Modules

  • ABCDE
  • FGHIL
  • MNOPS
  • TUX

Tools

perluniprops

Perl 5 version 22.0 documentation
Recently read

perluniprops

NAME

perluniprops - Index of Unicode Version 7.0.0 character properties in Perl

DESCRIPTION

This document provides information about the portion of the Unicode database that deals with character properties, that is the portion that is defined on single code points. (Other information in the Unicode data base below briefly mentions other data that Unicode provides.)

Perl can provide access to all non-provisional Unicode character properties, though not all are enabled by default. The omitted ones are the Unihan properties (accessible via the CPAN module Unicode::Unihan) and certain deprecated or Unicode-internal properties. (An installation may choose to recompile Perl's tables to change this. See Unicode character properties that are NOT accepted by Perl.)

For most purposes, access to Unicode properties from the Perl core is through regular expression matches, as described in the next section. For some special purposes, and to access the properties that are not suitable for regular expression matching, all the Unicode character properties that Perl handles are accessible via the standard Unicode::UCD module, as described in the section Properties accessible through Unicode::UCD.

Perl also provides some additional extensions and short-cut synonyms for Unicode properties.

This document merely lists all available properties and does not attempt to explain what each property really means. There is a brief description of each Perl extension; see Other Properties in perlunicode for more information on these. There is some detail about Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but to find out about the intricacies of the official Unicode properties, refer to the Unicode standard. A good starting place is http://www.unicode.org/reports/tr44/.

Note that you can define your own properties; see User-Defined Character Properties in perlunicode.

Properties accessible through \p{} and \P{}

The Perl regular expression \p{} and \P{} constructs give access to most of the Unicode character properties. The table below shows all these constructs, both single and compound forms.

Compound forms consist of two components, separated by an equals sign or a colon. The first component is the property name, and the second component is the particular value of the property to match against, for example, \p{Script: Greek} and \p{Script=Greek} both mean to match characters whose Script property value is Greek.

Single forms, like \p{Greek} , are mostly Perl-defined shortcuts for their equivalent compound forms. The table shows these equivalences. (In our example, \p{Greek} is a just a shortcut for \p{Script=Greek} .) There are also a few Perl-defined single forms that are not shortcuts for a compound form. One such is \p{Word} . These are also listed in the table.

In parsing these constructs, Perl always ignores Upper/lower case differences everywhere within the {braces}. Thus \p{Greek} means the same thing as \p{greek} . But note that changing the case of the "p" or "P" before the left brace completely changes the meaning of the construct, from "match" (for \p{} ) to "doesn't match" (for \P{} ). Casing in this document is for improved legibility.

Also, white space, hyphens, and underscores are normally ignored everywhere between the {braces}, and hence can be freely added or removed even if the /x modifier hasn't been specified on the regular expression. But in the table below a 'T' at the beginning of an entry means that tighter (stricter) rules are used for that entry:

Some properties are considered obsolete by Unicode, but still available. There are several varieties of obsolescence:

Matches in the Block property have shortcuts that begin with "In_". For example, \p{Block=Latin1} can be written as \p{In_Latin1} . For backward compatibility, if there is no conflict with another shortcut, these may also be written as \p{Latin1} or \p{Is_Latin1} . But, N.B., there are numerous such conflicting shortcuts. Use of these forms for Block is discouraged, and are flagged as such, not only because of the potential confusion as to what is meant, but also because a later release of Unicode may preempt the shortcut, and your program would no longer be correct. Use the "In_" form instead to avoid this, or even more clearly, use the compound form, e.g., \p{blk:latin1} . See Blocks in perlunicode for more information about this.

The table below has two columns. The left column contains the \p{} constructs to look up, possibly preceded by the flags mentioned above; and the right column contains information about them, like a description, or synonyms. The table shows both the single and compound forms for each property that has them. If the left column is a short name for a property, the right column will give its longer, more descriptive name; and if the left column is the longest name, the right column will show any equivalent shortest name, in both single and compound forms if applicable.

If braces are not needed to specify a property (e.g., \pL ), the left column contains both forms, with and without braces.

The right column will also caution you if a property means something different than what might normally be expected.

All single forms are Perl extensions; a few compound forms are as well, and are noted as such.

Numbers in (parentheses) indicate the total number of Unicode code points matched by the property. For emphasis, those properties that match no code points at all are listed as well in a separate section following the table.

Most properties match the same code points regardless of whether "/i" case-insensitive matching is specified or not. But a few properties are affected. These are shown with the notation (/i= other_property) in the second column. Under case-insensitive matching they match the same code pode points as the property other_property.

There is no description given for most non-Perl defined properties (See http://www.unicode.org/reports/tr44/ for that).

For compactness, '*' is used as a wildcard instead of showing all possible combinations. For example, entries like:

  1. \p{Gc: *} \p{General_Category: *}

mean that 'Gc' is a synonym for 'General_Category', and anything that is valid for the latter is also valid for the former. Similarly,

  1. \p{Is_*} \p{*}

means that if and only if, for example, \p{Foo} exists, then \p{Is_Foo} and \p{IsFoo} are also valid and all mean the same thing. And similarly, \p{Foo=Bar} means the same as \p{Is_Foo=Bar} and \p{IsFoo=Bar} . "*" here is restricted to something not beginning with an underscore.

Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries for the other possibilities. Note that not all properties which have values 'Yes' and 'No' are binary, and they have all their values spelled out without using this wild card, and a NOT clause in their description that highlights their not being binary. These also require the compound form to match them, whereas true binary properties have both single and compound forms available.

Note that all non-essential underscores are removed in the display of the short names below.

Legend summary:

  • * is a wild-card

  • (\d+) in the info column gives the number of Unicode code points matched by this property.

  • D means this is deprecated.

  • O means this is obsolete.

  • S means this is stabilized.

  • T means tighter (stricter) name matching applies.

  • X means use of this form is discouraged, and may not be stable.

  1. NAME INFO
  2. X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
  3. T \p{Age: 1.1} \p{Age=V1_1} (33_979)
  4. T \p{Age: 2.0} \p{Age=V2_0} (144_521)
  5. T \p{Age: 2.1} \p{Age=V2_1} (2)
  6. T \p{Age: 3.0} \p{Age=V3_0} (10_307)
  7. T \p{Age: 3.1} \p{Age=V3_1} (44_978)
  8. T \p{Age: 3.2} \p{Age=V3_2} (1016)
  9. T \p{Age: 4.0} \p{Age=V4_0} (1226)
  10. T \p{Age: 4.1} \p{Age=V4_1} (1273)
  11. T \p{Age: 5.0} \p{Age=V5_0} (1369)
  12. T \p{Age: 5.1} \p{Age=V5_1} (1624)
  13. T \p{Age: 5.2} \p{Age=V5_2} (6648)
  14. T \p{Age: 6.0} \p{Age=V6_0} (2088)
  15. T \p{Age: 6.1} \p{Age=V6_1} (732)
  16. T \p{Age: 6.2} \p{Age=V6_2} (1)
  17. T \p{Age: 6.3} \p{Age=V6_3} (5)
  18. T \p{Age: 7.0} \p{Age=V7_0} (2834)
  19. \p{Age: NA} \p{Age=Unassigned} (861_509 plus all
  20. above-Unicode code points)
  21. \p{Age: Unassigned} Code point's usage has not been assigned
  22. in any Unicode release thus far. (Short:
  23. \p{Age=NA}) (861_509 plus all above-
  24. Unicode code points)
  25. \p{Age: V1_1} Code point's usage introduced in version
  26. 1.1 (33_979)
  27. \p{Age: V2_0} Code point's usage was introduced in
  28. version 2.0; See also Property
  29. 'Present_In' (144_521)
  30. \p{Age: V2_1} Code point's usage was introduced in
  31. version 2.1; See also Property
  32. 'Present_In' (2)
  33. \p{Age: V3_0} Code point's usage was introduced in
  34. version 3.0; See also Property
  35. 'Present_In' (10_307)
  36. \p{Age: V3_1} Code point's usage was introduced in
  37. version 3.1; See also Property
  38. 'Present_In' (44_978)
  39. \p{Age: V3_2} Code point's usage was introduced in
  40. version 3.2; See also Property
  41. 'Present_In' (1016)
  42. \p{Age: V4_0} Code point's usage was introduced in
  43. version 4.0; See also Property
  44. 'Present_In' (1226)
  45. \p{Age: V4_1} Code point's usage was introduced in
  46. version 4.1; See also Property
  47. 'Present_In' (1273)
  48. \p{Age: V5_0} Code point's usage was introduced in
  49. version 5.0; See also Property
  50. 'Present_In' (1369)
  51. \p{Age: V5_1} Code point's usage was introduced in
  52. version 5.1; See also Property
  53. 'Present_In' (1624)
  54. \p{Age: V5_2} Code point's usage was introduced in
  55. version 5.2; See also Property
  56. 'Present_In' (6648)
  57. \p{Age: V6_0} Code point's usage was introduced in
  58. version 6.0; See also Property
  59. 'Present_In' (2088)
  60. \p{Age: V6_1} Code point's usage was introduced in
  61. version 6.1; See also Property
  62. 'Present_In' (732)
  63. \p{Age: V6_2} Code point's usage was introduced in
  64. version 6.2; See also Property
  65. 'Present_In' (1)
  66. \p{Age: V6_3} Code point's usage was introduced in
  67. version 6.3; See also Property
  68. 'Present_In' (5)
  69. \p{Age: V7_0} Code point's usage was introduced in
  70. version 7.0; See also Property
  71. 'Present_In' (2834)
  72. \p{Aghb} \p{Caucasian_Albanian} (= \p{Script=
  73. Caucasian_Albanian}) (NOT \p{Block=
  74. Caucasian_Albanian}) (53)
  75. \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  76. (22)
  77. \p{AHex: *} \p{ASCII_Hex_Digit: *}
  78. X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
  79. Alchemical_Symbols}) (128)
  80. X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
  81. \p{InAlchemical}) (128)
  82. \p{All} All code points, including those above
  83. Unicode. Same as qr/./s (1_114_112 plus
  84. all above-Unicode code points)
  85. \p{Alnum} \p{XPosixAlnum} (104_617)
  86. \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y})
  87. (104_077)
  88. \p{Alpha: *} \p{Alphabetic: *}
  89. \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y})
  90. (104_077)
  91. \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (1_010_035
  92. plus all above-Unicode code points)
  93. \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (104_077)
  94. X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
  95. \p{Block=Alphabetic_Presentation_Forms})
  96. (80)
  97. X \p{Alphabetic_Presentation_Forms} \p{Block=
  98. Alphabetic_Presentation_Forms} (Short:
  99. \p{InAlphabeticPF}) (80)
  100. X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
  101. \p{Block=
  102. Ancient_Greek_Musical_Notation}) (80)
  103. X \p{Ancient_Greek_Musical_Notation} \p{Block=
  104. Ancient_Greek_Musical_Notation} (Short:
  105. \p{InAncientGreekMusic}) (80)
  106. X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
  107. X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
  108. \p{Any} All Unicode code points: [\x{0000}-
  109. \x{10FFFF}] (1_114_112)
  110. \p{Arab} \p{Arabic} (= \p{Script=Arabic}) (NOT
  111. \p{Block=Arabic}) (1244)
  112. \p{Arabic} \p{Script=Arabic} (Short: \p{Arab}; NOT
  113. \p{Block=Arabic}) (1244)
  114. X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
  115. Arabic_Extended_A}) (96)
  116. X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
  117. \p{InArabicExtA}) (96)
  118. X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
  119. (= \p{Block=
  120. Arabic_Mathematical_Alphabetic_Symbols})
  121. (256)
  122. X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
  123. Arabic_Mathematical_Alphabetic_Symbols}
  124. (Short: \p{InArabicMath}) (256)
  125. X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
  126. \p{Block=Arabic_Presentation_Forms_A})
  127. (688)
  128. X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
  129. \p{Block=Arabic_Presentation_Forms_B})
  130. (144)
  131. X \p{Arabic_Presentation_Forms_A} \p{Block=
  132. Arabic_Presentation_Forms_A} (Short:
  133. \p{InArabicPFA}) (688)
  134. X \p{Arabic_Presentation_Forms_B} \p{Block=
  135. Arabic_Presentation_Forms_B} (Short:
  136. \p{InArabicPFB}) (144)
  137. X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
  138. Arabic_Supplement}) (48)
  139. X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
  140. \p{InArabicSup}) (48)
  141. \p{Armenian} \p{Script=Armenian} (Short: \p{Armn}; NOT
  142. \p{Block=Armenian}) (93)
  143. \p{Armi} \p{Imperial_Aramaic} (= \p{Script=
  144. Imperial_Aramaic}) (NOT \p{Block=
  145. Imperial_Aramaic}) (31)
  146. \p{Armn} \p{Armenian} (= \p{Script=Armenian}) (NOT
  147. \p{Block=Armenian}) (93)
  148. X \p{Arrows} \p{Block=Arrows} (112)
  149. \p{ASCII} \p{Block=Basic_Latin} [[:ASCII:]] (128)
  150. \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  151. (22)
  152. \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
  153. plus all above-Unicode code points)
  154. \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22)
  155. \p{Assigned} All assigned code points (252_537)
  156. \p{Avestan} \p{Script=Avestan} (Short: \p{Avst}; NOT
  157. \p{Block=Avestan}) (61)
  158. \p{Avst} \p{Avestan} (= \p{Script=Avestan}) (NOT
  159. \p{Block=Avestan}) (61)
  160. \p{Bali} \p{Balinese} (= \p{Script=Balinese}) (NOT
  161. \p{Block=Balinese}) (121)
  162. \p{Balinese} \p{Script=Balinese} (Short: \p{Bali}; NOT
  163. \p{Block=Balinese}) (121)
  164. \p{Bamu} \p{Bamum} (= \p{Script=Bamum}) (NOT
  165. \p{Block=Bamum}) (657)
  166. \p{Bamum} \p{Script=Bamum} (Short: \p{Bamu}; NOT
  167. \p{Block=Bamum}) (657)
  168. X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
  169. Bamum_Supplement}) (576)
  170. X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
  171. \p{InBamumSup}) (576)
  172. X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
  173. \p{Bass} \p{Bassa_Vah} (= \p{Script=Bassa_Vah})
  174. (NOT \p{Block=Bassa_Vah}) (36)
  175. \p{Bassa_Vah} \p{Script=Bassa_Vah} (Short: \p{Bass}; NOT
  176. \p{Block=Bassa_Vah}) (36)
  177. \p{Batak} \p{Script=Batak} (Short: \p{Batk}; NOT
  178. \p{Block=Batak}) (56)
  179. \p{Batk} \p{Batak} (= \p{Script=Batak}) (NOT
  180. \p{Block=Batak}) (56)
  181. \p{Bc: *} \p{Bidi_Class: *}
  182. \p{Beng} \p{Bengali} (= \p{Script=Bengali}) (NOT
  183. \p{Block=Bengali}) (93)
  184. \p{Bengali} \p{Script=Bengali} (Short: \p{Beng}; NOT
  185. \p{Block=Bengali}) (93)
  186. \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y})
  187. (12)
  188. \p{Bidi_C: *} \p{Bidi_Control: *}
  189. \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1436)
  190. \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (50)
  191. \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1436)
  192. \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (50)
  193. \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
  194. \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016)
  195. \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016)
  196. \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15)
  197. \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
  198. \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (158)
  199. \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
  200. \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (87)
  201. \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158)
  202. \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12)
  203. \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (87)
  204. \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1)
  205. \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1)
  206. \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_097_670
  207. plus all above-Unicode code points)
  208. \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_097_670 plus
  209. all above-Unicode code points)
  210. \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1)
  211. \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1)
  212. \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1)
  213. \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
  214. \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1)
  215. \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
  216. \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1429)
  217. \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1429)
  218. \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (5126)
  219. \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5126)
  220. \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7)
  221. \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
  222. \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1)
  223. \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1)
  224. \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1)
  225. \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (4077)
  226. \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4077)
  227. \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1)
  228. \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1)
  229. \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1)
  230. \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
  231. \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1)
  232. \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
  233. \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
  234. \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3)
  235. \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17)
  236. \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17)
  237. \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
  238. \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
  239. plus all above-Unicode code points)
  240. \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12)
  241. \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
  242. (545)
  243. \p{Bidi_M: *} \p{Bidi_Mirrored: *}
  244. \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
  245. (545)
  246. \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
  247. plus all above-Unicode code points)
  248. \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545)
  249. \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
  250. (60)
  251. \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60)
  252. \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
  253. (1_113_992 plus all above-Unicode code
  254. points)
  255. \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
  256. plus all above-Unicode code points)
  257. \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
  258. (60)
  259. \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60)
  260. \p{Blank} \p{XPosixBlank} (18)
  261. \p{Blk: *} \p{Block: *}
  262. \p{Block: Aegean_Numbers} (Single: \p{InAegeanNumbers}) (64)
  263. \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
  264. \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical},
  265. \p{InAlchemical}) (128)
  266. \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
  267. (80)
  268. \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
  269. AlphabeticPF}, \p{InAlphabeticPF}) (80)
  270. \p{Block: Ancient_Greek_Music} \p{Block=
  271. Ancient_Greek_Musical_Notation} (80)
  272. \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
  273. AncientGreekMusic},
  274. \p{InAncientGreekMusic}) (80)
  275. \p{Block: Ancient_Greek_Numbers} (Single:
  276. \p{InAncientGreekNumbers}) (80)
  277. \p{Block: Ancient_Symbols} (Single: \p{InAncientSymbols}) (64)
  278. \p{Block: Arabic} (Single: \p{InArabic}; NOT \p{Arabic} NOR
  279. \p{Is_Arabic}) (256)
  280. \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
  281. \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA},
  282. \p{InArabicExtA}) (96)
  283. \p{Block: Arabic_Math} \p{Block=
  284. Arabic_Mathematical_Alphabetic_Symbols}
  285. (256)
  286. \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
  287. ArabicMath}, \p{InArabicMath}) (256)
  288. \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
  289. \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
  290. \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA},
  291. \p{InArabicPFA}) (688)
  292. \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB},
  293. \p{InArabicPFB}) (144)
  294. \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
  295. \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup},
  296. \p{InArabicSup}) (48)
  297. \p{Block: Armenian} (Single: \p{InArmenian}; NOT \p{Armenian}
  298. NOR \p{Is_Armenian}) (96)
  299. \p{Block: Arrows} (Single: \p{InArrows}) (112)
  300. \p{Block: ASCII} \p{Block=Basic_Latin} (128)
  301. \p{Block: Avestan} (Single: \p{InAvestan}; NOT \p{Avestan}
  302. NOR \p{Is_Avestan}) (64)
  303. \p{Block: Balinese} (Single: \p{InBalinese}; NOT \p{Balinese}
  304. NOR \p{Is_Balinese}) (128)
  305. \p{Block: Bamum} (Single: \p{InBamum}; NOT \p{Bamum} NOR
  306. \p{Is_Bamum}) (96)
  307. \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
  308. \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup},
  309. \p{InBamumSup}) (576)
  310. \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}, \p{ASCII}) (128)
  311. \p{Block: Bassa_Vah} (Single: \p{InBassaVah}; NOT \p{Bassa_Vah}
  312. NOR \p{Is_Bassa_Vah}) (48)
  313. \p{Block: Batak} (Single: \p{InBatak}; NOT \p{Batak} NOR
  314. \p{Is_Batak}) (64)
  315. \p{Block: Bengali} (Single: \p{InBengali}; NOT \p{Bengali}
  316. NOR \p{Is_Bengali}) (128)
  317. \p{Block: Block_Elements} (Single: \p{InBlockElements}) (32)
  318. \p{Block: Bopomofo} (Single: \p{InBopomofo}; NOT \p{Bopomofo}
  319. NOR \p{Is_Bopomofo}) (48)
  320. \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
  321. \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt},
  322. \p{InBopomofoExt}) (32)
  323. \p{Block: Box_Drawing} (Single: \p{InBoxDrawing}) (128)
  324. \p{Block: Brahmi} (Single: \p{InBrahmi}; NOT \p{Brahmi} NOR
  325. \p{Is_Brahmi}) (128)
  326. \p{Block: Braille} \p{Block=Braille_Patterns} (256)
  327. \p{Block: Braille_Patterns} (Short: \p{Blk=Braille},
  328. \p{InBraille}) (256)
  329. \p{Block: Buginese} (Single: \p{InBuginese}; NOT \p{Buginese}
  330. NOR \p{Is_Buginese}) (32)
  331. \p{Block: Buhid} (Single: \p{InBuhid}; NOT \p{Buhid} NOR
  332. \p{Is_Buhid}) (32)
  333. \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
  334. (256)
  335. \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
  336. ByzantineMusic}, \p{InByzantineMusic})
  337. (256)
  338. \p{Block: Canadian_Syllabics} \p{Block=
  339. Unified_Canadian_Aboriginal_Syllabics}
  340. (640)
  341. \p{Block: Carian} (Single: \p{InCarian}; NOT \p{Carian} NOR
  342. \p{Is_Carian}) (64)
  343. \p{Block: Caucasian_Albanian} (Single: \p{InCaucasianAlbanian};
  344. NOT \p{Caucasian_Albanian} NOR
  345. \p{Is_Caucasian_Albanian}) (64)
  346. \p{Block: Chakma} (Single: \p{InChakma}; NOT \p{Chakma} NOR
  347. \p{Is_Chakma}) (80)
  348. \p{Block: Cham} (Single: \p{InCham}; NOT \p{Cham} NOR
  349. \p{Is_Cham}) (96)
  350. \p{Block: Cherokee} (Single: \p{InCherokee}; NOT \p{Cherokee}
  351. NOR \p{Is_Cherokee}) (96)
  352. \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
  353. \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
  354. \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
  355. \p{Block: CJK_Compat_Ideographs} \p{Block=
  356. CJK_Compatibility_Ideographs} (512)
  357. \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
  358. CJK_Compatibility_Ideographs_Supplement}
  359. (544)
  360. \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat},
  361. \p{InCJKCompat}) (256)
  362. \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms},
  363. \p{InCJKCompatForms}) (32)
  364. \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
  365. CJKCompatIdeographs},
  366. \p{InCJKCompatIdeographs}) (512)
  367. \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
  368. CJKCompatIdeographsSup},
  369. \p{InCJKCompatIdeographsSup}) (544)
  370. \p{Block: CJK_Ext_A} \p{Block=
  371. CJK_Unified_Ideographs_Extension_A}
  372. (6592)
  373. \p{Block: CJK_Ext_B} \p{Block=
  374. CJK_Unified_Ideographs_Extension_B}
  375. (42_720)
  376. \p{Block: CJK_Ext_C} \p{Block=
  377. CJK_Unified_Ideographs_Extension_C}
  378. (4160)
  379. \p{Block: CJK_Ext_D} \p{Block=
  380. CJK_Unified_Ideographs_Extension_D} (224)
  381. \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
  382. \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup},
  383. \p{InCJKRadicalsSup}) (128)
  384. \p{Block: CJK_Strokes} (Single: \p{InCJKStrokes}) (48)
  385. \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
  386. \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols},
  387. \p{InCJKSymbols}) (64)
  388. \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}, \p{InCJK})
  389. (20_992)
  390. \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
  391. CJKExtA}, \p{InCJKExtA}) (6592)
  392. \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
  393. CJKExtB}, \p{InCJKExtB}) (42_720)
  394. \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
  395. CJKExtC}, \p{InCJKExtC}) (4160)
  396. \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
  397. CJKExtD}, \p{InCJKExtD}) (224)
  398. \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
  399. Diacriticals}, \p{InDiacriticals}) (112)
  400. \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
  401. DiacriticalsExt}, \p{InDiacriticalsExt})
  402. (80)
  403. \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
  404. DiacriticalsForSymbols},
  405. \p{InDiacriticalsForSymbols}) (48)
  406. \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
  407. DiacriticalsSup}, \p{InDiacriticalsSup})
  408. (64)
  409. \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks},
  410. \p{InHalfMarks}) (16)
  411. \p{Block: Combining_Marks_For_Symbols} \p{Block=
  412. Combining_Diacritical_Marks_For_Symbols}
  413. (48)
  414. \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
  415. IndicNumberForms},
  416. \p{InIndicNumberForms}) (16)
  417. \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
  418. \p{Block: Control_Pictures} (Single: \p{InControlPictures}) (64)
  419. \p{Block: Coptic} (Single: \p{InCoptic}; NOT \p{Coptic} NOR
  420. \p{Is_Coptic}) (128)
  421. \p{Block: Coptic_Epact_Numbers} (Single: \p{InCopticEpactNumbers})
  422. (32)
  423. \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
  424. \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod},
  425. \p{InCountingRod}) (32)
  426. \p{Block: Cuneiform} (Single: \p{InCuneiform}; NOT
  427. \p{Cuneiform} NOR \p{Is_Cuneiform})
  428. (1024)
  429. \p{Block: Cuneiform_Numbers} \p{Block=
  430. Cuneiform_Numbers_And_Punctuation} (128)
  431. \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
  432. CuneiformNumbers},
  433. \p{InCuneiformNumbers}) (128)
  434. \p{Block: Currency_Symbols} (Single: \p{InCurrencySymbols}) (48)
  435. \p{Block: Cypriot_Syllabary} (Single: \p{InCypriotSyllabary}) (64)
  436. \p{Block: Cyrillic} (Single: \p{InCyrillic}; NOT \p{Cyrillic}
  437. NOR \p{Is_Cyrillic}) (256)
  438. \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
  439. \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
  440. \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA},
  441. \p{InCyrillicExtA}) (32)
  442. \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB},
  443. \p{InCyrillicExtB}) (96)
  444. \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
  445. \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup},
  446. \p{InCyrillicSup}) (48)
  447. \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
  448. (48)
  449. \p{Block: Deseret} (Single: \p{InDeseret}) (80)
  450. \p{Block: Devanagari} (Single: \p{InDevanagari}; NOT
  451. \p{Devanagari} NOR \p{Is_Devanagari})
  452. (128)
  453. \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
  454. \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt},
  455. \p{InDevanagariExt}) (32)
  456. \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
  457. \p{Block: Diacriticals_Ext} \p{Block=
  458. Combining_Diacritical_Marks_Extended}
  459. (80)
  460. \p{Block: Diacriticals_For_Symbols} \p{Block=
  461. Combining_Diacritical_Marks_For_Symbols}
  462. (48)
  463. \p{Block: Diacriticals_Sup} \p{Block=
  464. Combining_Diacritical_Marks_Supplement}
  465. (64)
  466. \p{Block: Dingbats} (Single: \p{InDingbats}) (192)
  467. \p{Block: Domino} \p{Block=Domino_Tiles} (112)
  468. \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}, \p{InDomino}) (112)
  469. \p{Block: Duployan} (Single: \p{InDuployan}; NOT \p{Duployan}
  470. NOR \p{Is_Duployan}) (160)
  471. \p{Block: Egyptian_Hieroglyphs} (Single:
  472. \p{InEgyptianHieroglyphs}; NOT
  473. \p{Egyptian_Hieroglyphs} NOR
  474. \p{Is_Egyptian_Hieroglyphs}) (1072)
  475. \p{Block: Elbasan} (Single: \p{InElbasan}; NOT \p{Elbasan}
  476. NOR \p{Is_Elbasan}) (48)
  477. \p{Block: Emoticons} (Single: \p{InEmoticons}) (80)
  478. \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
  479. \p{Block: Enclosed_Alphanum_Sup} \p{Block=
  480. Enclosed_Alphanumeric_Supplement} (256)
  481. \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
  482. EnclosedAlphanumSup},
  483. \p{InEnclosedAlphanumSup}) (256)
  484. \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
  485. EnclosedAlphanum},
  486. \p{InEnclosedAlphanum}) (160)
  487. \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
  488. (256)
  489. \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
  490. EnclosedCJK}, \p{InEnclosedCJK}) (256)
  491. \p{Block: Enclosed_Ideographic_Sup} \p{Block=
  492. Enclosed_Ideographic_Supplement} (256)
  493. \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
  494. EnclosedIdeographicSup},
  495. \p{InEnclosedIdeographicSup}) (256)
  496. \p{Block: Ethiopic} (Single: \p{InEthiopic}; NOT \p{Ethiopic}
  497. NOR \p{Is_Ethiopic}) (384)
  498. \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
  499. \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
  500. \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt},
  501. \p{InEthiopicExt}) (96)
  502. \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA},
  503. \p{InEthiopicExtA}) (48)
  504. \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
  505. \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup},
  506. \p{InEthiopicSup}) (32)
  507. \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation},
  508. \p{InPunctuation}; NOT \p{Punct} NOR
  509. \p{Is_Punctuation}) (112)
  510. \p{Block: Geometric_Shapes} (Single: \p{InGeometricShapes}) (96)
  511. \p{Block: Geometric_Shapes_Ext} \p{Block=
  512. Geometric_Shapes_Extended} (128)
  513. \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
  514. GeometricShapesExt},
  515. \p{InGeometricShapesExt}) (128)
  516. \p{Block: Georgian} (Single: \p{InGeorgian}; NOT \p{Georgian}
  517. NOR \p{Is_Georgian}) (96)
  518. \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
  519. \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup},
  520. \p{InGeorgianSup}) (48)
  521. \p{Block: Glagolitic} (Single: \p{InGlagolitic}; NOT
  522. \p{Glagolitic} NOR \p{Is_Glagolitic})
  523. (96)
  524. \p{Block: Gothic} (Single: \p{InGothic}; NOT \p{Gothic} NOR
  525. \p{Is_Gothic}) (32)
  526. \p{Block: Grantha} (Single: \p{InGrantha}; NOT \p{Grantha}
  527. NOR \p{Is_Grantha}) (128)
  528. \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
  529. NOR \p{Is_Greek}) (144)
  530. \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}, \p{InGreek};
  531. NOT \p{Greek} NOR \p{Is_Greek}) (144)
  532. \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
  533. \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt},
  534. \p{InGreekExt}) (256)
  535. \p{Block: Gujarati} (Single: \p{InGujarati}; NOT \p{Gujarati}
  536. NOR \p{Is_Gujarati}) (128)
  537. \p{Block: Gurmukhi} (Single: \p{InGurmukhi}; NOT \p{Gurmukhi}
  538. NOR \p{Is_Gurmukhi}) (128)
  539. \p{Block: Half_And_Full_Forms} \p{Block=
  540. Halfwidth_And_Fullwidth_Forms} (240)
  541. \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
  542. \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
  543. HalfAndFullForms},
  544. \p{InHalfAndFullForms}) (240)
  545. \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
  546. NOR \p{Is_Hangul}) (11_184)
  547. \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo},
  548. \p{InCompatJamo}) (96)
  549. \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}, \p{InJamo}) (256)
  550. \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA},
  551. \p{InJamoExtA}) (32)
  552. \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB},
  553. \p{InJamoExtB}) (80)
  554. \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}, \p{InHangul};
  555. NOT \p{Hangul} NOR \p{Is_Hangul})
  556. (11_184)
  557. \p{Block: Hanunoo} (Single: \p{InHanunoo}; NOT \p{Hanunoo}
  558. NOR \p{Is_Hanunoo}) (32)
  559. \p{Block: Hebrew} (Single: \p{InHebrew}; NOT \p{Hebrew} NOR
  560. \p{Is_Hebrew}) (112)
  561. \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
  562. HighPUSurrogates},
  563. \p{InHighPUSurrogates}) (128)
  564. \p{Block: High_PU_Surrogates} \p{Block=
  565. High_Private_Use_Surrogates} (128)
  566. \p{Block: High_Surrogates} (Single: \p{InHighSurrogates}) (896)
  567. \p{Block: Hiragana} (Single: \p{InHiragana}; NOT \p{Hiragana}
  568. NOR \p{Is_Hiragana}) (96)
  569. \p{Block: IDC} \p{Block=
  570. Ideographic_Description_Characters} (NOT
  571. \p{ID_Continue} NOR \p{Is_IDC}) (16)
  572. \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC},
  573. \p{InIDC}; NOT \p{ID_Continue} NOR
  574. \p{Is_IDC}) (16)
  575. \p{Block: Imperial_Aramaic} (Single: \p{InImperialAramaic}; NOT
  576. \p{Imperial_Aramaic} NOR
  577. \p{Is_Imperial_Aramaic}) (32)
  578. \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  579. (16)
  580. \p{Block: Inscriptional_Pahlavi} (Single:
  581. \p{InInscriptionalPahlavi}; NOT
  582. \p{Inscriptional_Pahlavi} NOR
  583. \p{Is_Inscriptional_Pahlavi}) (32)
  584. \p{Block: Inscriptional_Parthian} (Single:
  585. \p{InInscriptionalParthian}; NOT
  586. \p{Inscriptional_Parthian} NOR
  587. \p{Is_Inscriptional_Parthian}) (32)
  588. \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
  589. \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}, \p{InIPAExt})
  590. (96)
  591. \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
  592. \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
  593. \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
  594. \p{Block: Javanese} (Single: \p{InJavanese}; NOT \p{Javanese}
  595. NOR \p{Is_Javanese}) (96)
  596. \p{Block: Kaithi} (Single: \p{InKaithi}; NOT \p{Kaithi} NOR
  597. \p{Is_Kaithi}) (80)
  598. \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
  599. \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}, \p{InKanaSup})
  600. (256)
  601. \p{Block: Kanbun} (Single: \p{InKanbun}) (16)
  602. \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
  603. \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}, \p{InKangxi})
  604. (224)
  605. \p{Block: Kannada} (Single: \p{InKannada}; NOT \p{Kannada}
  606. NOR \p{Is_Kannada}) (128)
  607. \p{Block: Katakana} (Single: \p{InKatakana}; NOT \p{Katakana}
  608. NOR \p{Is_Katakana}) (96)
  609. \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
  610. \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
  611. KatakanaExt}, \p{InKatakanaExt}) (16)
  612. \p{Block: Kayah_Li} (Single: \p{InKayahLi}; NOT \p{Kayah_Li}
  613. NOR \p{Is_Kayah_Li}) (48)
  614. \p{Block: Kharoshthi} (Single: \p{InKharoshthi}; NOT
  615. \p{Kharoshthi} NOR \p{Is_Kharoshthi})
  616. (96)
  617. \p{Block: Khmer} (Single: \p{InKhmer}; NOT \p{Khmer} NOR
  618. \p{Is_Khmer}) (128)
  619. \p{Block: Khmer_Symbols} (Single: \p{InKhmerSymbols}) (32)
  620. \p{Block: Khojki} (Single: \p{InKhojki}; NOT \p{Khojki} NOR
  621. \p{Is_Khojki}) (80)
  622. \p{Block: Khudawadi} (Single: \p{InKhudawadi}; NOT
  623. \p{Khudawadi} NOR \p{Is_Khudawadi}) (80)
  624. \p{Block: Lao} (Single: \p{InLao}; NOT \p{Lao} NOR
  625. \p{Is_Lao}) (128)
  626. \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
  627. \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
  628. \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1},
  629. \p{InLatin1}) (128)
  630. \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
  631. \p{Block: Latin_Ext_Additional} \p{Block=
  632. Latin_Extended_Additional} (256)
  633. \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
  634. \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
  635. \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
  636. \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64)
  637. \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA},
  638. \p{InLatinExtA}) (128)
  639. \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
  640. LatinExtAdditional},
  641. \p{InLatinExtAdditional}) (256)
  642. \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB},
  643. \p{InLatinExtB}) (208)
  644. \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC},
  645. \p{InLatinExtC}) (32)
  646. \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD},
  647. \p{InLatinExtD}) (224)
  648. \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE},
  649. \p{InLatinExtE}) (64)
  650. \p{Block: Lepcha} (Single: \p{InLepcha}; NOT \p{Lepcha} NOR
  651. \p{Is_Lepcha}) (80)
  652. \p{Block: Letterlike_Symbols} (Single: \p{InLetterlikeSymbols})
  653. (80)
  654. \p{Block: Limbu} (Single: \p{InLimbu}; NOT \p{Limbu} NOR
  655. \p{Is_Limbu}) (80)
  656. \p{Block: Linear_A} (Single: \p{InLinearA}; NOT \p{Linear_A}
  657. NOR \p{Is_Linear_A}) (384)
  658. \p{Block: Linear_B_Ideograms} (Single: \p{InLinearBIdeograms})
  659. (128)
  660. \p{Block: Linear_B_Syllabary} (Single: \p{InLinearBSyllabary})
  661. (128)
  662. \p{Block: Lisu} (Single: \p{InLisu}) (48)
  663. \p{Block: Low_Surrogates} (Single: \p{InLowSurrogates}) (1024)
  664. \p{Block: Lycian} (Single: \p{InLycian}; NOT \p{Lycian} NOR
  665. \p{Is_Lycian}) (32)
  666. \p{Block: Lydian} (Single: \p{InLydian}; NOT \p{Lydian} NOR
  667. \p{Is_Lydian}) (32)
  668. \p{Block: Mahajani} (Single: \p{InMahajani}; NOT \p{Mahajani}
  669. NOR \p{Is_Mahajani}) (48)
  670. \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
  671. \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}, \p{InMahjong})
  672. (48)
  673. \p{Block: Malayalam} (Single: \p{InMalayalam}; NOT
  674. \p{Malayalam} NOR \p{Is_Malayalam}) (128)
  675. \p{Block: Mandaic} (Single: \p{InMandaic}; NOT \p{Mandaic}
  676. NOR \p{Is_Mandaic}) (32)
  677. \p{Block: Manichaean} (Single: \p{InManichaean}; NOT
  678. \p{Manichaean} NOR \p{Is_Manichaean})
  679. (64)
  680. \p{Block: Math_Alphanum} \p{Block=
  681. Mathematical_Alphanumeric_Symbols} (1024)
  682. \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
  683. \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
  684. MathAlphanum}, \p{InMathAlphanum}) (1024)
  685. \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators},
  686. \p{InMathOperators}) (256)
  687. \p{Block: Meetei_Mayek} (Single: \p{InMeeteiMayek}; NOT
  688. \p{Meetei_Mayek} NOR
  689. \p{Is_Meetei_Mayek}) (64)
  690. \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
  691. \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt},
  692. \p{InMeeteiMayekExt}) (32)
  693. \p{Block: Mende_Kikakui} (Single: \p{InMendeKikakui}; NOT
  694. \p{Mende_Kikakui} NOR
  695. \p{Is_Mende_Kikakui}) (224)
  696. \p{Block: Meroitic_Cursive} (Single: \p{InMeroiticCursive}; NOT
  697. \p{Meroitic_Cursive} NOR
  698. \p{Is_Meroitic_Cursive}) (96)
  699. \p{Block: Meroitic_Hieroglyphs} (Single:
  700. \p{InMeroiticHieroglyphs}) (32)
  701. \p{Block: Miao} (Single: \p{InMiao}; NOT \p{Miao} NOR
  702. \p{Is_Miao}) (160)
  703. \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
  704. (256)
  705. \p{Block: Misc_Math_Symbols_A} \p{Block=
  706. Miscellaneous_Mathematical_Symbols_A}
  707. (48)
  708. \p{Block: Misc_Math_Symbols_B} \p{Block=
  709. Miscellaneous_Mathematical_Symbols_B}
  710. (128)
  711. \p{Block: Misc_Pictographs} \p{Block=
  712. Miscellaneous_Symbols_And_Pictographs}
  713. (768)
  714. \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
  715. \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
  716. \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
  717. MiscMathSymbolsA},
  718. \p{InMiscMathSymbolsA}) (48)
  719. \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
  720. MiscMathSymbolsB},
  721. \p{InMiscMathSymbolsB}) (128)
  722. \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols},
  723. \p{InMiscSymbols}) (256)
  724. \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
  725. MiscArrows}, \p{InMiscArrows}) (256)
  726. \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
  727. MiscPictographs}, \p{InMiscPictographs})
  728. (768)
  729. \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical},
  730. \p{InMiscTechnical}) (256)
  731. \p{Block: Modi} (Single: \p{InModi}; NOT \p{Modi} NOR
  732. \p{Is_Modi}) (96)
  733. \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
  734. \p{Block: Modifier_Tone_Letters} (Single:
  735. \p{InModifierToneLetters}) (32)
  736. \p{Block: Mongolian} (Single: \p{InMongolian}; NOT
  737. \p{Mongolian} NOR \p{Is_Mongolian}) (176)
  738. \p{Block: Mro} (Single: \p{InMro}; NOT \p{Mro} NOR
  739. \p{Is_Mro}) (48)
  740. \p{Block: Music} \p{Block=Musical_Symbols} (256)
  741. \p{Block: Musical_Symbols} (Short: \p{Blk=Music}, \p{InMusic})
  742. (256)
  743. \p{Block: Myanmar} (Single: \p{InMyanmar}; NOT \p{Myanmar}
  744. NOR \p{Is_Myanmar}) (160)
  745. \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
  746. \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
  747. \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA},
  748. \p{InMyanmarExtA}) (32)
  749. \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB},
  750. \p{InMyanmarExtB}) (32)
  751. \p{Block: Nabataean} (Single: \p{InNabataean}; NOT
  752. \p{Nabataean} NOR \p{Is_Nabataean}) (48)
  753. \p{Block: NB} \p{Block=No_Block} (857_776 plus all
  754. above-Unicode code points)
  755. \p{Block: New_Tai_Lue} (Single: \p{InNewTaiLue}; NOT
  756. \p{New_Tai_Lue} NOR \p{Is_New_Tai_Lue})
  757. (96)
  758. \p{Block: NKo} (Single: \p{InNKo}; NOT \p{Nko} NOR
  759. \p{Is_NKo}) (64)
  760. \p{Block: No_Block} (Short: \p{Blk=NB}, \p{InNB}) (857_776
  761. plus all above-Unicode code points)
  762. \p{Block: Number_Forms} (Single: \p{InNumberForms}) (64)
  763. \p{Block: OCR} \p{Block=Optical_Character_Recognition}
  764. (32)
  765. \p{Block: Ogham} (Single: \p{InOgham}; NOT \p{Ogham} NOR
  766. \p{Is_Ogham}) (32)
  767. \p{Block: Ol_Chiki} (Single: \p{InOlChiki}) (48)
  768. \p{Block: Old_Italic} (Single: \p{InOldItalic}; NOT
  769. \p{Old_Italic} NOR \p{Is_Old_Italic})
  770. (48)
  771. \p{Block: Old_North_Arabian} (Single: \p{InOldNorthArabian}) (32)
  772. \p{Block: Old_Permic} (Single: \p{InOldPermic}; NOT
  773. \p{Old_Permic} NOR \p{Is_Old_Permic})
  774. (48)
  775. \p{Block: Old_Persian} (Single: \p{InOldPersian}; NOT
  776. \p{Old_Persian} NOR \p{Is_Old_Persian})
  777. (64)
  778. \p{Block: Old_South_Arabian} (Single: \p{InOldSouthArabian}) (32)
  779. \p{Block: Old_Turkic} (Single: \p{InOldTurkic}; NOT
  780. \p{Old_Turkic} NOR \p{Is_Old_Turkic})
  781. (80)
  782. \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR},
  783. \p{InOCR}) (32)
  784. \p{Block: Oriya} (Single: \p{InOriya}; NOT \p{Oriya} NOR
  785. \p{Is_Oriya}) (128)
  786. \p{Block: Ornamental_Dingbats} (Single: \p{InOrnamentalDingbats})
  787. (48)
  788. \p{Block: Osmanya} (Single: \p{InOsmanya}; NOT \p{Osmanya}
  789. NOR \p{Is_Osmanya}) (48)
  790. \p{Block: Pahawh_Hmong} (Single: \p{InPahawhHmong}; NOT
  791. \p{Pahawh_Hmong} NOR
  792. \p{Is_Pahawh_Hmong}) (144)
  793. \p{Block: Palmyrene} (Single: \p{InPalmyrene}) (32)
  794. \p{Block: Pau_Cin_Hau} (Single: \p{InPauCinHau}; NOT
  795. \p{Pau_Cin_Hau} NOR \p{Is_Pau_Cin_Hau})
  796. (64)
  797. \p{Block: Phags_Pa} (Single: \p{InPhagsPa}; NOT \p{Phags_Pa}
  798. NOR \p{Is_Phags_Pa}) (64)
  799. \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
  800. \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}, \p{InPhaistos})
  801. (48)
  802. \p{Block: Phoenician} (Single: \p{InPhoenician}; NOT
  803. \p{Phoenician} NOR \p{Is_Phoenician})
  804. (32)
  805. \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
  806. \p{Block: Phonetic_Ext_Sup} \p{Block=
  807. Phonetic_Extensions_Supplement} (64)
  808. \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt},
  809. \p{InPhoneticExt}) (128)
  810. \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
  811. PhoneticExtSup}, \p{InPhoneticExtSup})
  812. (64)
  813. \p{Block: Playing_Cards} (Single: \p{InPlayingCards}) (96)
  814. \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
  815. \p{Private_Use} NOR \p{Is_Private_Use})
  816. (6400)
  817. \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}, \p{InPUA}; NOT
  818. \p{Private_Use} NOR \p{Is_Private_Use})
  819. (6400)
  820. \p{Block: Psalter_Pahlavi} (Single: \p{InPsalterPahlavi}; NOT
  821. \p{Psalter_Pahlavi} NOR
  822. \p{Is_Psalter_Pahlavi}) (48)
  823. \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
  824. \p{Private_Use} NOR \p{Is_Private_Use})
  825. (6400)
  826. \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
  827. \p{Punct} NOR \p{Is_Punctuation}) (112)
  828. \p{Block: Rejang} (Single: \p{InRejang}; NOT \p{Rejang} NOR
  829. \p{Is_Rejang}) (48)
  830. \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
  831. \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}, \p{InRumi})
  832. (32)
  833. \p{Block: Runic} (Single: \p{InRunic}; NOT \p{Runic} NOR
  834. \p{Is_Runic}) (96)
  835. \p{Block: Samaritan} (Single: \p{InSamaritan}; NOT
  836. \p{Samaritan} NOR \p{Is_Samaritan}) (64)
  837. \p{Block: Saurashtra} (Single: \p{InSaurashtra}; NOT
  838. \p{Saurashtra} NOR \p{Is_Saurashtra})
  839. (96)
  840. \p{Block: Sharada} (Single: \p{InSharada}; NOT \p{Sharada}
  841. NOR \p{Is_Sharada}) (96)
  842. \p{Block: Shavian} (Single: \p{InShavian}) (48)
  843. \p{Block: Shorthand_Format_Controls} (Single:
  844. \p{InShorthandFormatControls}) (16)
  845. \p{Block: Siddham} (Single: \p{InSiddham}; NOT \p{Siddham}
  846. NOR \p{Is_Siddham}) (128)
  847. \p{Block: Sinhala} (Single: \p{InSinhala}; NOT \p{Sinhala}
  848. NOR \p{Is_Sinhala}) (128)
  849. \p{Block: Sinhala_Archaic_Numbers} (Single:
  850. \p{InSinhalaArchaicNumbers}) (32)
  851. \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms},
  852. \p{InSmallForms}) (32)
  853. \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
  854. \p{Block: Sora_Sompeng} (Single: \p{InSoraSompeng}; NOT
  855. \p{Sora_Sompeng} NOR
  856. \p{Is_Sora_Sompeng}) (48)
  857. \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
  858. ModifierLetters}, \p{InModifierLetters})
  859. (80)
  860. \p{Block: Specials} (Single: \p{InSpecials}) (16)
  861. \p{Block: Sundanese} (Single: \p{InSundanese}; NOT
  862. \p{Sundanese} NOR \p{Is_Sundanese}) (64)
  863. \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
  864. \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup},
  865. \p{InSundaneseSup}) (16)
  866. \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
  867. \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
  868. \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
  869. \p{Block: Sup_Math_Operators} \p{Block=
  870. Supplemental_Mathematical_Operators}
  871. (256)
  872. \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
  873. (65_536)
  874. \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
  875. (65_536)
  876. \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
  877. \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
  878. \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
  879. SuperAndSub}, \p{InSuperAndSub}) (48)
  880. \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA},
  881. \p{InSupArrowsA}) (16)
  882. \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB},
  883. \p{InSupArrowsB}) (128)
  884. \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC},
  885. \p{InSupArrowsC}) (256)
  886. \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
  887. SupMathOperators},
  888. \p{InSupMathOperators}) (256)
  889. \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
  890. SupPunctuation}, \p{InSupPunctuation})
  891. (128)
  892. \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
  893. SupPUAA}, \p{InSupPUAA}) (65_536)
  894. \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
  895. SupPUAB}, \p{InSupPUAB}) (65_536)
  896. \p{Block: Syloti_Nagri} (Single: \p{InSylotiNagri}; NOT
  897. \p{Syloti_Nagri} NOR
  898. \p{Is_Syloti_Nagri}) (48)
  899. \p{Block: Syriac} (Single: \p{InSyriac}; NOT \p{Syriac} NOR
  900. \p{Is_Syriac}) (80)
  901. \p{Block: Tagalog} (Single: \p{InTagalog}; NOT \p{Tagalog}
  902. NOR \p{Is_Tagalog}) (32)
  903. \p{Block: Tagbanwa} (Single: \p{InTagbanwa}; NOT \p{Tagbanwa}
  904. NOR \p{Is_Tagbanwa}) (32)
  905. \p{Block: Tags} (Single: \p{InTags}) (128)
  906. \p{Block: Tai_Le} (Single: \p{InTaiLe}; NOT \p{Tai_Le} NOR
  907. \p{Is_Tai_Le}) (48)
  908. \p{Block: Tai_Tham} (Single: \p{InTaiTham}; NOT \p{Tai_Tham}
  909. NOR \p{Is_Tai_Tham}) (144)
  910. \p{Block: Tai_Viet} (Single: \p{InTaiViet}; NOT \p{Tai_Viet}
  911. NOR \p{Is_Tai_Viet}) (96)
  912. \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
  913. \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing},
  914. \p{InTaiXuanJing}) (96)
  915. \p{Block: Takri} (Single: \p{InTakri}; NOT \p{Takri} NOR
  916. \p{Is_Takri}) (80)
  917. \p{Block: Tamil} (Single: \p{InTamil}; NOT \p{Tamil} NOR
  918. \p{Is_Tamil}) (128)
  919. \p{Block: Telugu} (Single: \p{InTelugu}; NOT \p{Telugu} NOR
  920. \p{Is_Telugu}) (128)
  921. \p{Block: Thaana} (Single: \p{InThaana}; NOT \p{Thaana} NOR
  922. \p{Is_Thaana}) (64)
  923. \p{Block: Thai} (Single: \p{InThai}; NOT \p{Thai} NOR
  924. \p{Is_Thai}) (128)
  925. \p{Block: Tibetan} (Single: \p{InTibetan}; NOT \p{Tibetan}
  926. NOR \p{Is_Tibetan}) (256)
  927. \p{Block: Tifinagh} (Single: \p{InTifinagh}; NOT \p{Tifinagh}
  928. NOR \p{Is_Tifinagh}) (80)
  929. \p{Block: Tirhuta} (Single: \p{InTirhuta}; NOT \p{Tirhuta}
  930. NOR \p{Is_Tirhuta}) (96)
  931. \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
  932. (128)
  933. \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
  934. TransportAndMap}, \p{InTransportAndMap})
  935. (128)
  936. \p{Block: UCAS} \p{Block=
  937. Unified_Canadian_Aboriginal_Syllabics}
  938. (640)
  939. \p{Block: UCAS_Ext} \p{Block=
  940. Unified_Canadian_Aboriginal_Syllabics_-
  941. Extended} (80)
  942. \p{Block: Ugaritic} (Single: \p{InUgaritic}; NOT \p{Ugaritic}
  943. NOR \p{Is_Ugaritic}) (32)
  944. \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
  945. UCAS}, \p{InUCAS}) (640)
  946. \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
  947. \p{Blk=UCASExt}, \p{InUCASExt}) (80)
  948. \p{Block: Vai} (Single: \p{InVai}; NOT \p{Vai} NOR
  949. \p{Is_Vai}) (320)
  950. \p{Block: Variation_Selectors} (Short: \p{Blk=VS}, \p{InVS}; NOT
  951. \p{Variation_Selector} NOR \p{Is_VS})
  952. (16)
  953. \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup},
  954. \p{InVSSup}) (240)
  955. \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
  956. \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt},
  957. \p{InVedicExt}) (48)
  958. \p{Block: Vertical_Forms} (Single: \p{InVerticalForms}) (16)
  959. \p{Block: VS} \p{Block=Variation_Selectors} (NOT
  960. \p{Variation_Selector} NOR \p{Is_VS})
  961. (16)
  962. \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
  963. (240)
  964. \p{Block: Warang_Citi} (Single: \p{InWarangCiti}; NOT
  965. \p{Warang_Citi} NOR \p{Is_Warang_Citi})
  966. (96)
  967. \p{Block: Yi_Radicals} (Single: \p{InYiRadicals}) (64)
  968. \p{Block: Yi_Syllables} (Single: \p{InYiSyllables}) (1168)
  969. \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
  970. \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing},
  971. \p{InYijing}) (64)
  972. X \p{Block_Elements} \p{Block=Block_Elements} (32)
  973. \p{Bopo} \p{Bopomofo} (= \p{Script=Bopomofo}) (NOT
  974. \p{Block=Bopomofo}) (70)
  975. \p{Bopomofo} \p{Script=Bopomofo} (Short: \p{Bopo}; NOT
  976. \p{Block=Bopomofo}) (70)
  977. X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
  978. Bopomofo_Extended}) (32)
  979. X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
  980. \p{InBopomofoExt}) (32)
  981. X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
  982. \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *}
  983. \p{Brah} \p{Brahmi} (= \p{Script=Brahmi}) (NOT
  984. \p{Block=Brahmi}) (109)
  985. \p{Brahmi} \p{Script=Brahmi} (Short: \p{Brah}; NOT
  986. \p{Block=Brahmi}) (109)
  987. \p{Brai} \p{Braille} (= \p{Script=Braille}) (256)
  988. \p{Braille} \p{Script=Braille} (Short: \p{Brai}) (256)
  989. X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
  990. \p{InBraille}) (256)
  991. \p{Bugi} \p{Buginese} (= \p{Script=Buginese}) (NOT
  992. \p{Block=Buginese}) (30)
  993. \p{Buginese} \p{Script=Buginese} (Short: \p{Bugi}; NOT
  994. \p{Block=Buginese}) (30)
  995. \p{Buhd} \p{Buhid} (= \p{Script=Buhid}) (NOT
  996. \p{Block=Buhid}) (20)
  997. \p{Buhid} \p{Script=Buhid} (Short: \p{Buhd}; NOT
  998. \p{Block=Buhid}) (20)
  999. X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
  1000. Byzantine_Musical_Symbols}) (256)
  1001. X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
  1002. (Short: \p{InByzantineMusic}) (256)
  1003. \p{C} \pC \p{Other} (= \p{General_Category=Other})
  1004. (1_001_306 plus all above-Unicode code
  1005. points)
  1006. \p{Cakm} \p{Chakma} (= \p{Script=Chakma}) (NOT
  1007. \p{Block=Chakma}) (67)
  1008. \p{Canadian_Aboriginal} \p{Script=Canadian_Aboriginal} (Short:
  1009. \p{Cans}) (710)
  1010. X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
  1011. (= \p{Block=
  1012. Unified_Canadian_Aboriginal_Syllabics})
  1013. (640)
  1014. T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
  1015. Not_Reordered} (1_113_367 plus all
  1016. above-Unicode code points)
  1017. T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
  1018. Overlay} (32)
  1019. T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
  1020. Nukta} (19)
  1021. T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
  1022. Kana_Voicing} (2)
  1023. T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
  1024. Virama} (44)
  1025. T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
  1026. CCC10} (1)
  1027. T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
  1028. CCC11} (1)
  1029. T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
  1030. CCC12} (1)
  1031. T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
  1032. CCC13} (1)
  1033. T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
  1034. CCC14} (1)
  1035. T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
  1036. CCC15} (1)
  1037. T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
  1038. CCC16} (1)
  1039. T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
  1040. CCC17} (1)
  1041. T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
  1042. CCC18} (2)
  1043. T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
  1044. CCC19} (2)
  1045. T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
  1046. CCC20} (1)
  1047. T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
  1048. CCC21} (1)
  1049. T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
  1050. CCC22} (1)
  1051. T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
  1052. CCC23} (1)
  1053. T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
  1054. CCC24} (1)
  1055. T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
  1056. CCC25} (1)
  1057. T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
  1058. CCC26} (1)
  1059. T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
  1060. CCC27} (2)
  1061. T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
  1062. CCC28} (2)
  1063. T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
  1064. CCC29} (2)
  1065. T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
  1066. CCC30} (2)
  1067. T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
  1068. CCC31} (2)
  1069. T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
  1070. CCC32} (2)
  1071. T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
  1072. CCC33} (1)
  1073. T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
  1074. CCC34} (1)
  1075. T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
  1076. CCC35} (1)
  1077. T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
  1078. CCC36} (1)
  1079. T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
  1080. CCC84} (1)
  1081. T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
  1082. CCC91} (1)
  1083. T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
  1084. CCC103} (2)
  1085. T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
  1086. CCC107} (4)
  1087. T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
  1088. CCC118} (2)
  1089. T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
  1090. CCC122} (4)
  1091. T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
  1092. CCC129} (1)
  1093. T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
  1094. CCC130} (6)
  1095. T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
  1096. CCC132} (1)
  1097. T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
  1098. CCC133} (0)
  1099. T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
  1100. Attached_Below_Left} (0)
  1101. T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
  1102. Attached_Below} (5)
  1103. T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
  1104. Attached_Above} (1)
  1105. T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
  1106. Attached_Above_Right} (9)
  1107. T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
  1108. Below_Left} (1)
  1109. T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
  1110. Below} (152)
  1111. T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
  1112. Below_Right} (4)
  1113. T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
  1114. Left} (2)
  1115. T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
  1116. Right} (1)
  1117. T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
  1118. Above_Left} (3)
  1119. T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
  1120. Above} (399)
  1121. T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
  1122. Above_Right} (4)
  1123. T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
  1124. Double_Below} (4)
  1125. T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
  1126. Double_Above} (5)
  1127. T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
  1128. Iota_Subscript} (1)
  1129. \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
  1130. Above} (399)
  1131. \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (399)
  1132. \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (3)
  1133. \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (4)
  1134. \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
  1135. Above_Left} (3)
  1136. \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
  1137. Above_Right} (4)
  1138. \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
  1139. Attached_Above} (1)
  1140. \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
  1141. Attached_Above_Right} (9)
  1142. \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
  1143. Attached_Below} (5)
  1144. \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
  1145. Attached_Below_Left} (0)
  1146. \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
  1147. (1)
  1148. \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
  1149. \p{Ccc=ATAR}) (9)
  1150. \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
  1151. (5)
  1152. \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
  1153. ATBL}) (0)
  1154. \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
  1155. Below} (152)
  1156. \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (152)
  1157. \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1)
  1158. \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4)
  1159. \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
  1160. Below_Left} (1)
  1161. \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
  1162. Below_Right} (4)
  1163. \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1)
  1164. \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2)
  1165. \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4)
  1166. \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1)
  1167. \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2)
  1168. \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1)
  1169. \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4)
  1170. \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1)
  1171. \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1)
  1172. \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6)
  1173. \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1)
  1174. \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
  1175. \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1)
  1176. \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1)
  1177. \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1)
  1178. \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1)
  1179. \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2)
  1180. \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2)
  1181. \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1)
  1182. \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1)
  1183. \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1)
  1184. \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1)
  1185. \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1)
  1186. \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1)
  1187. \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1)
  1188. \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2)
  1189. \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2)
  1190. \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2)
  1191. \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2)
  1192. \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2)
  1193. \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2)
  1194. \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1)
  1195. \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1)
  1196. \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1)
  1197. \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1)
  1198. \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1)
  1199. \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1)
  1200. \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
  1201. Double_Above} (5)
  1202. \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
  1203. Double_Below} (4)
  1204. \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) (5)
  1205. \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) (4)
  1206. \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
  1207. (1)
  1208. \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
  1209. Iota_Subscript} (1)
  1210. \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) (2)
  1211. \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
  1212. Kana_Voicing} (2)
  1213. \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
  1214. Left} (2)
  1215. \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2)
  1216. \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
  1217. Nukta} (19)
  1218. \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
  1219. (1_113_367 plus all above-Unicode code
  1220. points)
  1221. \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
  1222. Not_Reordered} (1_113_367 plus all
  1223. above-Unicode code points)
  1224. \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (19)
  1225. \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
  1226. Overlay} (32)
  1227. \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32)
  1228. \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
  1229. Right} (1)
  1230. \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1)
  1231. \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (44)
  1232. \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
  1233. Virama} (44)
  1234. \p{Cans} \p{Canadian_Aboriginal} (= \p{Script=
  1235. Canadian_Aboriginal}) (710)
  1236. \p{Cari} \p{Carian} (= \p{Script=Carian}) (NOT
  1237. \p{Block=Carian}) (49)
  1238. \p{Carian} \p{Script=Carian} (Short: \p{Cari}; NOT
  1239. \p{Block=Carian}) (49)
  1240. \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (1961)
  1241. \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_112_151 plus
  1242. all above-Unicode code points)
  1243. \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (1961)
  1244. \p{Cased} \p{Cased=Y} (3671)
  1245. \p{Cased: N*} (Single: \P{Cased}) (1_110_441 plus all
  1246. above-Unicode code points)
  1247. \p{Cased: Y*} (Single: \p{Cased}) (3671)
  1248. \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
  1249. \p{LC}) (3362)
  1250. \p{Category: *} \p{General_Category: *}
  1251. \p{Caucasian_Albanian} \p{Script=Caucasian_Albanian} (Short:
  1252. \p{Aghb}; NOT \p{Block=
  1253. Caucasian_Albanian}) (53)
  1254. \p{Cc} \p{XPosixCntrl} (= \p{General_Category=
  1255. Control}) (65)
  1256. \p{Ccc: *} \p{Canonical_Combining_Class: *}
  1257. \p{CE} \p{Composition_Exclusion} (=
  1258. \p{Composition_Exclusion=Y}) (81)
  1259. \p{CE: *} \p{Composition_Exclusion: *}
  1260. \p{Cf} \p{Format} (= \p{General_Category=Format})
  1261. (150)
  1262. \p{Chakma} \p{Script=Chakma} (Short: \p{Cakm}; NOT
  1263. \p{Block=Chakma}) (67)
  1264. \p{Cham} \p{Script=Cham} (NOT \p{Block=Cham}) (83)
  1265. \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
  1266. \p{CWCF}) (1156)
  1267. \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
  1268. (1_112_956 plus all above-Unicode code
  1269. points)
  1270. \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
  1271. (1156)
  1272. \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
  1273. \p{CWCM}) (2236)
  1274. \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
  1275. (1_111_876 plus all above-Unicode code
  1276. points)
  1277. \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
  1278. (2236)
  1279. \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
  1280. \p{CWL}) (1092)
  1281. \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
  1282. (1_113_020 plus all above-Unicode code
  1283. points)
  1284. \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1092)
  1285. \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
  1286. Y} (Short: \p{CWKCF}) (10_005)
  1287. \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
  1288. \P{CWKCF}) (1_104_107 plus all above-
  1289. Unicode code points)
  1290. \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
  1291. \p{CWKCF}) (10_005)
  1292. \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
  1293. \p{CWT}) (1148)
  1294. \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
  1295. (1_112_964 plus all above-Unicode code
  1296. points)
  1297. \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1148)
  1298. \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
  1299. \p{CWU}) (1175)
  1300. \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
  1301. (1_112_937 plus all above-Unicode code
  1302. points)
  1303. \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1175)
  1304. \p{Cher} \p{Cherokee} (= \p{Script=Cherokee}) (NOT
  1305. \p{Block=Cherokee}) (85)
  1306. \p{Cherokee} \p{Script=Cherokee} (Short: \p{Cher}; NOT
  1307. \p{Block=Cherokee}) (85)
  1308. \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
  1309. Y}) (1961)
  1310. \p{CI: *} \p{Case_Ignorable: *}
  1311. X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
  1312. CJK_Unified_Ideographs}) (20_992)
  1313. X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
  1314. CJK_Compatibility}) (256)
  1315. X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
  1316. CJK_Compatibility_Forms}) (32)
  1317. X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
  1318. \p{Block=CJK_Compatibility_Ideographs})
  1319. (512)
  1320. X \p{CJK_Compat_Ideographs_Sup}
  1321. \p{CJK_Compatibility_Ideographs_-
  1322. Supplement} (= \p{Block=
  1323. CJK_Compatibility_Ideographs_-
  1324. Supplement}) (544)
  1325. X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
  1326. \p{InCJKCompat}) (256)
  1327. X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
  1328. (Short: \p{InCJKCompatForms}) (32)
  1329. X \p{CJK_Compatibility_Ideographs} \p{Block=
  1330. CJK_Compatibility_Ideographs} (Short:
  1331. \p{InCJKCompatIdeographs}) (512)
  1332. X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
  1333. CJK_Compatibility_Ideographs_Supplement}
  1334. (Short: \p{InCJKCompatIdeographsSup})
  1335. (544)
  1336. X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
  1337. \p{Block=
  1338. CJK_Unified_Ideographs_Extension_A})
  1339. (6592)
  1340. X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
  1341. \p{Block=
  1342. CJK_Unified_Ideographs_Extension_B})
  1343. (42_720)
  1344. X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
  1345. \p{Block=
  1346. CJK_Unified_Ideographs_Extension_C})
  1347. (4160)
  1348. X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
  1349. \p{Block=
  1350. CJK_Unified_Ideographs_Extension_D})
  1351. (224)
  1352. X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
  1353. CJK_Radicals_Supplement}) (128)
  1354. X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
  1355. (Short: \p{InCJKRadicalsSup}) (128)
  1356. X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
  1357. X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
  1358. \p{Block=CJK_Symbols_And_Punctuation})
  1359. (64)
  1360. X \p{CJK_Symbols_And_Punctuation} \p{Block=
  1361. CJK_Symbols_And_Punctuation} (Short:
  1362. \p{InCJKSymbols}) (64)
  1363. X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
  1364. (Short: \p{InCJK}) (20_992)
  1365. X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
  1366. CJK_Unified_Ideographs_Extension_A}
  1367. (Short: \p{InCJKExtA}) (6592)
  1368. X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
  1369. CJK_Unified_Ideographs_Extension_B}
  1370. (Short: \p{InCJKExtB}) (42_720)
  1371. X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
  1372. CJK_Unified_Ideographs_Extension_C}
  1373. (Short: \p{InCJKExtC}) (4160)
  1374. X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
  1375. CJK_Unified_Ideographs_Extension_D}
  1376. (Short: \p{InCJKExtD}) (224)
  1377. \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
  1378. (Short: \p{Pe}) (73)
  1379. \p{Cn} \p{Unassigned} (= \p{General_Category=
  1380. Unassigned}) (861_575 plus all above-
  1381. Unicode code points)
  1382. \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category=
  1383. Control}) (65)
  1384. \p{Co} \p{Private_Use} (= \p{General_Category=
  1385. Private_Use}) (NOT \p{Private_Use_Area})
  1386. (137_468)
  1387. X \p{Combining_Diacritical_Marks} \p{Block=
  1388. Combining_Diacritical_Marks} (Short:
  1389. \p{InDiacriticals}) (112)
  1390. X \p{Combining_Diacritical_Marks_Extended} \p{Block=
  1391. Combining_Diacritical_Marks_Extended}
  1392. (Short: \p{InDiacriticalsExt}) (80)
  1393. X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
  1394. Combining_Diacritical_Marks_For_Symbols}
  1395. (Short: \p{InDiacriticalsForSymbols})
  1396. (48)
  1397. X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
  1398. Combining_Diacritical_Marks_Supplement}
  1399. (Short: \p{InDiacriticalsSup}) (64)
  1400. X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
  1401. \p{InHalfMarks}) (16)
  1402. \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
  1403. (1830)
  1404. X \p{Combining_Marks_For_Symbols}
  1405. \p{Combining_Diacritical_Marks_For_-
  1406. Symbols} (= \p{Block=
  1407. Combining_Diacritical_Marks_For_-
  1408. Symbols}) (48)
  1409. \p{Common} \p{Script=Common} (Short: \p{Zyyy}) (7129)
  1410. X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  1411. (Short: \p{InIndicNumberForms}) (16)
  1412. \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
  1413. \p{Full_Composition_Exclusion=Y}) (1120)
  1414. \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
  1415. X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
  1416. Hangul_Compatibility_Jamo}) (96)
  1417. \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
  1418. \p{CE}) (81)
  1419. \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
  1420. plus all above-Unicode code points)
  1421. \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81)
  1422. \p{Connector_Punctuation} \p{General_Category=
  1423. Connector_Punctuation} (Short: \p{Pc})
  1424. (10)
  1425. \p{Control} \p{XPosixCntrl} (= \p{General_Category=
  1426. Control}) (65)
  1427. X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
  1428. \p{Copt} \p{Coptic} (= \p{Script=Coptic}) (NOT
  1429. \p{Block=Coptic}) (137)
  1430. \p{Coptic} \p{Script=Coptic} (Short: \p{Copt}; NOT
  1431. \p{Block=Coptic}) (137)
  1432. X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
  1433. X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
  1434. Counting_Rod_Numerals}) (32)
  1435. X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
  1436. \p{InCountingRod}) (32)
  1437. \p{Cprt} \p{Cypriot} (= \p{Script=Cypriot}) (55)
  1438. \p{Cs} \p{Surrogate} (= \p{General_Category=
  1439. Surrogate}) (2048)
  1440. \p{Cuneiform} \p{Script=Cuneiform} (Short: \p{Xsux}; NOT
  1441. \p{Block=Cuneiform}) (1037)
  1442. X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
  1443. \p{Block=
  1444. Cuneiform_Numbers_And_Punctuation}) (128)
  1445. X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
  1446. Cuneiform_Numbers_And_Punctuation}
  1447. (Short: \p{InCuneiformNumbers}) (128)
  1448. \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
  1449. (Short: \p{Sc}) (52)
  1450. X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
  1451. \p{CWCF} \p{Changes_When_Casefolded} (=
  1452. \p{Changes_When_Casefolded=Y}) (1156)
  1453. \p{CWCF: *} \p{Changes_When_Casefolded: *}
  1454. \p{CWCM} \p{Changes_When_Casemapped} (=
  1455. \p{Changes_When_Casemapped=Y}) (2236)
  1456. \p{CWCM: *} \p{Changes_When_Casemapped: *}
  1457. \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
  1458. \p{Changes_When_NFKC_Casefolded=Y})
  1459. (10_005)
  1460. \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
  1461. \p{CWL} \p{Changes_When_Lowercased} (=
  1462. \p{Changes_When_Lowercased=Y}) (1092)
  1463. \p{CWL: *} \p{Changes_When_Lowercased: *}
  1464. \p{CWT} \p{Changes_When_Titlecased} (=
  1465. \p{Changes_When_Titlecased=Y}) (1148)
  1466. \p{CWT: *} \p{Changes_When_Titlecased: *}
  1467. \p{CWU} \p{Changes_When_Uppercased} (=
  1468. \p{Changes_When_Uppercased=Y}) (1175)
  1469. \p{CWU: *} \p{Changes_When_Uppercased: *}
  1470. \p{Cypriot} \p{Script=Cypriot} (Short: \p{Cprt}) (55)
  1471. X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
  1472. \p{Cyrillic} \p{Script=Cyrillic} (Short: \p{Cyrl}; NOT
  1473. \p{Block=Cyrillic}) (431)
  1474. X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
  1475. Cyrillic_Extended_A}) (32)
  1476. X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
  1477. Cyrillic_Extended_B}) (96)
  1478. X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
  1479. \p{InCyrillicExtA}) (32)
  1480. X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
  1481. \p{InCyrillicExtB}) (96)
  1482. X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
  1483. Cyrillic_Supplement}) (48)
  1484. X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
  1485. \p{InCyrillicSup}) (48)
  1486. X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
  1487. Cyrillic_Supplement}) (48)
  1488. \p{Cyrl} \p{Cyrillic} (= \p{Script=Cyrillic}) (NOT
  1489. \p{Block=Cyrillic}) (431)
  1490. \p{Dash} \p{Dash=Y} (28)
  1491. \p{Dash: N*} (Single: \P{Dash}) (1_114_084 plus all
  1492. above-Unicode code points)
  1493. \p{Dash: Y*} (Single: \p{Dash}) (28)
  1494. \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
  1495. (Short: \p{Pd}) (24)
  1496. \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category=
  1497. Decimal_Number}) (540)
  1498. \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
  1499. (13_232)
  1500. \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232)
  1501. \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240)
  1502. \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
  1503. \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720)
  1504. \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
  1505. \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
  1506. \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240)
  1507. \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184)
  1508. \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
  1509. \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20)
  1510. \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
  1511. \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171)
  1512. \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
  1513. \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238)
  1514. \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
  1515. \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82)
  1516. \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
  1517. \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122)
  1518. \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
  1519. \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5)
  1520. \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
  1521. Non_Canonical} (Perl extension) (3661)
  1522. \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
  1523. decompositions (Short: \p{Dt=NonCanon})
  1524. (Perl extension) (3661)
  1525. \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_219 plus
  1526. all above-Unicode code points)
  1527. \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26)
  1528. \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
  1529. \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (284)
  1530. \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (284)
  1531. \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38)
  1532. \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (152)
  1533. \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (152)
  1534. \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
  1535. \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35)
  1536. \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104)
  1537. \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
  1538. Y} (Short: \p{DI}) (4173)
  1539. \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
  1540. (1_109_939 plus all above-Unicode code
  1541. points)
  1542. \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
  1543. (4173)
  1544. \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (111)
  1545. \p{Dep: *} \p{Deprecated: *}
  1546. \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (111)
  1547. \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_001
  1548. plus all above-Unicode code points)
  1549. \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (111)
  1550. \p{Deseret} \p{Script=Deseret} (Short: \p{Dsrt}) (80)
  1551. \p{Deva} \p{Devanagari} (= \p{Script=Devanagari})
  1552. (NOT \p{Block=Devanagari}) (152)
  1553. \p{Devanagari} \p{Script=Devanagari} (Short: \p{Deva};
  1554. NOT \p{Block=Devanagari}) (152)
  1555. X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
  1556. Devanagari_Extended}) (32)
  1557. X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
  1558. \p{InDevanagariExt}) (32)
  1559. \p{DI} \p{Default_Ignorable_Code_Point} (=
  1560. \p{Default_Ignorable_Code_Point=Y})
  1561. (4173)
  1562. \p{DI: *} \p{Default_Ignorable_Code_Point: *}
  1563. \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (766)
  1564. \p{Dia: *} \p{Diacritic: *}
  1565. \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (766)
  1566. \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_346
  1567. plus all above-Unicode code points)
  1568. \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (766)
  1569. X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
  1570. \p{Block=Combining_Diacritical_Marks})
  1571. (112)
  1572. X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended}
  1573. (= \p{Block=
  1574. Combining_Diacritical_Marks_Extended})
  1575. (80)
  1576. X \p{Diacriticals_For_Symbols}
  1577. \p{Combining_Diacritical_Marks_For_-
  1578. Symbols} (= \p{Block=
  1579. Combining_Diacritical_Marks_For_-
  1580. Symbols}) (48)
  1581. X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
  1582. (= \p{Block=
  1583. Combining_Diacritical_Marks_Supplement})
  1584. (64)
  1585. \p{Digit} \p{XPosixDigit} (= \p{General_Category=
  1586. Decimal_Number}) (540)
  1587. X \p{Dingbats} \p{Block=Dingbats} (192)
  1588. X \p{Domino} \p{Domino_Tiles} (= \p{Block=
  1589. Domino_Tiles}) (112)
  1590. X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
  1591. \p{InDomino}) (112)
  1592. \p{Dsrt} \p{Deseret} (= \p{Script=Deseret}) (80)
  1593. \p{Dt: *} \p{Decomposition_Type: *}
  1594. \p{Dupl} \p{Duployan} (= \p{Script=Duployan}) (NOT
  1595. \p{Block=Duployan}) (143)
  1596. \p{Duployan} \p{Script=Duployan} (Short: \p{Dupl}; NOT
  1597. \p{Block=Duployan}) (143)
  1598. \p{Ea: *} \p{East_Asian_Width: *}
  1599. \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_746)
  1600. \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_746)
  1601. \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
  1602. \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104)
  1603. \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
  1604. \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123)
  1605. \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (801_894 plus
  1606. all above-Unicode code points)
  1607. \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
  1608. \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111)
  1609. \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (801_894 plus all
  1610. above-Unicode code points)
  1611. \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (173_134)
  1612. \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (173_134)
  1613. \p{Egyp} \p{Egyptian_Hieroglyphs} (= \p{Script=
  1614. Egyptian_Hieroglyphs}) (NOT \p{Block=
  1615. Egyptian_Hieroglyphs}) (1071)
  1616. \p{Egyptian_Hieroglyphs} \p{Script=Egyptian_Hieroglyphs} (Short:
  1617. \p{Egyp}; NOT \p{Block=
  1618. Egyptian_Hieroglyphs}) (1071)
  1619. \p{Elba} \p{Elbasan} (= \p{Script=Elbasan}) (NOT
  1620. \p{Block=Elbasan}) (40)
  1621. \p{Elbasan} \p{Script=Elbasan} (Short: \p{Elba}; NOT
  1622. \p{Block=Elbasan}) (40)
  1623. X \p{Emoticons} \p{Block=Emoticons} (80)
  1624. X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
  1625. Enclosed_Alphanumerics}) (160)
  1626. X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
  1627. \p{Block=
  1628. Enclosed_Alphanumeric_Supplement}) (256)
  1629. X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
  1630. Enclosed_Alphanumeric_Supplement}
  1631. (Short: \p{InEnclosedAlphanumSup}) (256)
  1632. X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
  1633. (Short: \p{InEnclosedAlphanum}) (160)
  1634. X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
  1635. \p{Block=
  1636. Enclosed_CJK_Letters_And_Months}) (256)
  1637. X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
  1638. Enclosed_CJK_Letters_And_Months} (Short:
  1639. \p{InEnclosedCJK}) (256)
  1640. X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
  1641. (= \p{Block=
  1642. Enclosed_Ideographic_Supplement}) (256)
  1643. X \p{Enclosed_Ideographic_Supplement} \p{Block=
  1644. Enclosed_Ideographic_Supplement} (Short:
  1645. \p{InEnclosedIdeographicSup}) (256)
  1646. \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
  1647. (Short: \p{Me}) (13)
  1648. \p{Ethi} \p{Ethiopic} (= \p{Script=Ethiopic}) (NOT
  1649. \p{Block=Ethiopic}) (495)
  1650. \p{Ethiopic} \p{Script=Ethiopic} (Short: \p{Ethi}; NOT
  1651. \p{Block=Ethiopic}) (495)
  1652. X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
  1653. Ethiopic_Extended}) (96)
  1654. X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
  1655. Ethiopic_Extended_A}) (48)
  1656. X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
  1657. \p{InEthiopicExt}) (96)
  1658. X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
  1659. \p{InEthiopicExtA}) (48)
  1660. X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
  1661. Ethiopic_Supplement}) (32)
  1662. X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
  1663. \p{InEthiopicSup}) (32)
  1664. \p{Ext} \p{Extender} (= \p{Extender=Y}) (38)
  1665. \p{Ext: *} \p{Extender: *}
  1666. \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (38)
  1667. \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_074
  1668. plus all above-Unicode code points)
  1669. \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (38)
  1670. \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
  1671. (Short: \p{Pf}) (10)
  1672. \p{Format} \p{General_Category=Format} (Short:
  1673. \p{Cf}) (150)
  1674. \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
  1675. (Short: \p{CompEx}) (1120)
  1676. \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
  1677. \P{CompEx}) (1_112_992 plus all above-
  1678. Unicode code points)
  1679. \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
  1680. \p{CompEx}) (1120)
  1681. \p{Gc: *} \p{General_Category: *}
  1682. \p{GCB: *} \p{Grapheme_Cluster_Break: *}
  1683. \p{General_Category: C} \p{General_Category=Other} (1_001_306 plus
  1684. all above-Unicode code points)
  1685. \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
  1686. \p{Gc=LC}, \p{LC}) (3362)
  1687. \p{General_Category: Cc} \p{General_Category=Control} (65)
  1688. \p{General_Category: Cf} \p{General_Category=Format} (150)
  1689. \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
  1690. (73)
  1691. \p{General_Category: Cn} \p{General_Category=Unassigned} (861_575
  1692. plus all above-Unicode code points)
  1693. \p{General_Category: Cntrl} \p{General_Category=Control} (65)
  1694. \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
  1695. \p{General_Category: Combining_Mark} \p{General_Category=Mark}
  1696. (1830)
  1697. \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
  1698. \p{Pc}) (10)
  1699. \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65)
  1700. \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
  1701. \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
  1702. (52)
  1703. \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
  1704. (24)
  1705. \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
  1706. (540)
  1707. \p{General_Category: Digit} \p{General_Category=Decimal_Number}
  1708. (540)
  1709. \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
  1710. (13)
  1711. \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
  1712. (10)
  1713. \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (150)
  1714. \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
  1715. \p{Pi}) (12)
  1716. \p{General_Category: L} \p{General_Category=Letter} (102_725)
  1717. X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3362)
  1718. X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
  1719. the trailing '_' matters in spite of
  1720. loose matching rules. (3362)
  1721. \p{General_Category: LC} \p{General_Category=Cased_Letter} (3362)
  1722. \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (102_725)
  1723. \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
  1724. (236)
  1725. \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) (1)
  1726. \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
  1727. (/i= General_Category=Cased_Letter)
  1728. (1841)
  1729. \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (248)
  1730. \p{General_Category: Lo} \p{General_Category=Other_Letter} (99_115)
  1731. \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
  1732. /i= General_Category=Cased_Letter) (1841)
  1733. \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
  1734. (/i= General_Category=Cased_Letter) (31)
  1735. \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
  1736. (/i= General_Category=Cased_Letter)
  1737. (1490)
  1738. \p{General_Category: M} \p{General_Category=Mark} (1830)
  1739. \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (1830)
  1740. \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948)
  1741. \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (399)
  1742. \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
  1743. \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
  1744. (1418)
  1745. \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
  1746. (248)
  1747. \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
  1748. (116)
  1749. \p{General_Category: N} \p{General_Category=Number} (1346)
  1750. \p{General_Category: Nd} \p{General_Category=Decimal_Number} (540)
  1751. \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
  1752. \p{General_Category: No} \p{General_Category=Other_Number} (570)
  1753. \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
  1754. (1418)
  1755. \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1346)
  1756. \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
  1757. (75)
  1758. \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (1_001_306
  1759. plus all above-Unicode code points)
  1760. \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
  1761. (99_115)
  1762. \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) (570)
  1763. \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
  1764. (484)
  1765. \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
  1766. (5082)
  1767. \p{General_Category: P} \p{General_Category=Punctuation} (688)
  1768. \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
  1769. \p{Zp}) (1)
  1770. \p{General_Category: Pc} \p{General_Category=
  1771. Connector_Punctuation} (10)
  1772. \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24)
  1773. \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
  1774. (73)
  1775. \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
  1776. (10)
  1777. \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
  1778. (12)
  1779. \p{General_Category: Po} \p{General_Category=Other_Punctuation}
  1780. (484)
  1781. \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
  1782. (137_468)
  1783. \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
  1784. \p{General_Category: Punct} \p{General_Category=Punctuation} (688)
  1785. \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (688)
  1786. \p{General_Category: S} \p{General_Category=Symbol} (6198)
  1787. \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (52)
  1788. \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19)
  1789. \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (116)
  1790. \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
  1791. \p{General_Category: So} \p{General_Category=Other_Symbol} (5082)
  1792. \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
  1793. (17)
  1794. \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) (399)
  1795. \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048)
  1796. \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (6198)
  1797. \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
  1798. /i= General_Category=Cased_Letter) (31)
  1799. \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
  1800. (861_575 plus all above-Unicode code
  1801. points)
  1802. \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
  1803. /i= General_Category=Cased_Letter) (1490)
  1804. \p{General_Category: Z} \p{General_Category=Separator} (19)
  1805. \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
  1806. \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
  1807. (1)
  1808. \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
  1809. X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
  1810. \p{InPunctuation}) (112)
  1811. X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
  1812. X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
  1813. \p{Block=Geometric_Shapes_Extended})
  1814. (128)
  1815. X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
  1816. (Short: \p{InGeometricShapesExt}) (128)
  1817. \p{Geor} \p{Georgian} (= \p{Script=Georgian}) (NOT
  1818. \p{Block=Georgian}) (127)
  1819. \p{Georgian} \p{Script=Georgian} (Short: \p{Geor}; NOT
  1820. \p{Block=Georgian}) (127)
  1821. X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
  1822. Georgian_Supplement}) (48)
  1823. X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
  1824. \p{InGeorgianSup}) (48)
  1825. \p{Glag} \p{Glagolitic} (= \p{Script=Glagolitic})
  1826. (NOT \p{Block=Glagolitic}) (94)
  1827. \p{Glagolitic} \p{Script=Glagolitic} (Short: \p{Glag};
  1828. NOT \p{Block=Glagolitic}) (94)
  1829. \p{Goth} \p{Gothic} (= \p{Script=Gothic}) (NOT
  1830. \p{Block=Gothic}) (27)
  1831. \p{Gothic} \p{Script=Gothic} (Short: \p{Goth}; NOT
  1832. \p{Block=Gothic}) (27)
  1833. \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
  1834. (111_345)
  1835. \p{Gr_Base: *} \p{Grapheme_Base: *}
  1836. \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
  1837. Y}) (1461)
  1838. \p{Gr_Ext: *} \p{Grapheme_Extend: *}
  1839. \p{Gran} \p{Grantha} (= \p{Script=Grantha}) (NOT
  1840. \p{Block=Grantha}) (83)
  1841. \p{Grantha} \p{Script=Grantha} (Short: \p{Gran}; NOT
  1842. \p{Block=Grantha}) (83)
  1843. \p{Graph} \p{XPosixGraph} (250_405)
  1844. \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
  1845. (111_345)
  1846. \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase})
  1847. (1_002_767 plus all above-Unicode code
  1848. points)
  1849. \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase}) (111_345)
  1850. \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
  1851. (6030)
  1852. \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (6030)
  1853. \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1)
  1854. \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
  1855. (1461)
  1856. \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1461)
  1857. \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125)
  1858. \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1)
  1859. \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399)
  1860. \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773)
  1861. \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_094_733
  1862. plus all above-Unicode code points)
  1863. \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
  1864. (0)
  1865. \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (0)
  1866. \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
  1867. (26)
  1868. \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
  1869. Regional_Indicator} (26)
  1870. \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
  1871. SpacingMark} (331)
  1872. \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (331)
  1873. \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137)
  1874. \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95)
  1875. \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
  1876. (1_094_733 plus all above-Unicode code
  1877. points)
  1878. \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
  1879. (1461)
  1880. \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_651
  1881. plus all above-Unicode code points)
  1882. \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1461)
  1883. \p{Greek} \p{Script=Greek} (Short: \p{Grek}; NOT
  1884. \p{Greek_And_Coptic}) (516)
  1885. X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
  1886. \p{InGreek}) (144)
  1887. X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
  1888. Greek_Extended}) (256)
  1889. X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
  1890. \p{InGreekExt}) (256)
  1891. \p{Grek} \p{Greek} (= \p{Script=Greek}) (NOT
  1892. \p{Greek_And_Coptic}) (516)
  1893. \p{Gujarati} \p{Script=Gujarati} (Short: \p{Gujr}; NOT
  1894. \p{Block=Gujarati}) (84)
  1895. \p{Gujr} \p{Gujarati} (= \p{Script=Gujarati}) (NOT
  1896. \p{Block=Gujarati}) (84)
  1897. \p{Gurmukhi} \p{Script=Gurmukhi} (Short: \p{Guru}; NOT
  1898. \p{Block=Gurmukhi}) (79)
  1899. \p{Guru} \p{Gurmukhi} (= \p{Script=Gurmukhi}) (NOT
  1900. \p{Block=Gurmukhi}) (79)
  1901. X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
  1902. \p{Block=Halfwidth_And_Fullwidth_Forms})
  1903. (240)
  1904. X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
  1905. Combining_Half_Marks}) (16)
  1906. X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
  1907. Halfwidth_And_Fullwidth_Forms} (Short:
  1908. \p{InHalfAndFullForms}) (240)
  1909. \p{Han} \p{Script=Han} (75_963)
  1910. \p{Hang} \p{Hangul} (= \p{Script=Hangul}) (NOT
  1911. \p{Hangul_Syllables}) (11_739)
  1912. \p{Hangul} \p{Script=Hangul} (Short: \p{Hang}; NOT
  1913. \p{Hangul_Syllables}) (11_739)
  1914. X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
  1915. (Short: \p{InCompatJamo}) (96)
  1916. X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
  1917. (256)
  1918. X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
  1919. (Short: \p{InJamoExtA}) (32)
  1920. X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
  1921. (Short: \p{InJamoExtB}) (80)
  1922. \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
  1923. (125)
  1924. \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125)
  1925. \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
  1926. (399)
  1927. \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399)
  1928. \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
  1929. LVT_Syllable} (10_773)
  1930. \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
  1931. (10_773)
  1932. \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
  1933. Not_Applicable} (1_102_583 plus all
  1934. above-Unicode code points)
  1935. \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
  1936. (1_102_583 plus all above-Unicode code
  1937. points)
  1938. \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
  1939. (137)
  1940. \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137)
  1941. \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
  1942. (95)
  1943. \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95)
  1944. X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
  1945. \p{InHangul}) (11_184)
  1946. \p{Hani} \p{Han} (= \p{Script=Han}) (75_963)
  1947. \p{Hano} \p{Hanunoo} (= \p{Script=Hanunoo}) (NOT
  1948. \p{Block=Hanunoo}) (21)
  1949. \p{Hanunoo} \p{Script=Hanunoo} (Short: \p{Hano}; NOT
  1950. \p{Block=Hanunoo}) (21)
  1951. \p{Hebr} \p{Hebrew} (= \p{Script=Hebrew}) (NOT
  1952. \p{Block=Hebrew}) (133)
  1953. \p{Hebrew} \p{Script=Hebrew} (Short: \p{Hebr}; NOT
  1954. \p{Block=Hebrew}) (133)
  1955. \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  1956. \p{Hex: *} \p{Hex_Digit: *}
  1957. \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  1958. \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068
  1959. plus all above-Unicode code points)
  1960. \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44)
  1961. X \p{High_Private_Use_Surrogates} \p{Block=
  1962. High_Private_Use_Surrogates} (Short:
  1963. \p{InHighPUSurrogates}) (128)
  1964. X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
  1965. \p{Block=High_Private_Use_Surrogates})
  1966. (128)
  1967. X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
  1968. \p{Hira} \p{Hiragana} (= \p{Script=Hiragana}) (NOT
  1969. \p{Block=Hiragana}) (91)
  1970. \p{Hiragana} \p{Script=Hiragana} (Short: \p{Hira}; NOT
  1971. \p{Block=Hiragana}) (91)
  1972. \p{Hmng} \p{Pahawh_Hmong} (= \p{Script=
  1973. Pahawh_Hmong}) (NOT \p{Block=
  1974. Pahawh_Hmong}) (127)
  1975. \p{HorizSpace} \p{XPosixBlank} (18)
  1976. \p{Hst: *} \p{Hangul_Syllable_Type: *}
  1977. D \p{Hyphen} \p{Hyphen=Y} (11)
  1978. D \p{Hyphen: N*} Supplanted by Line_Break property values;
  1979. see www.unicode.org/reports/tr14
  1980. (Single: \P{Hyphen}) (1_114_101 plus all
  1981. above-Unicode code points)
  1982. D \p{Hyphen: Y*} Supplanted by Line_Break property values;
  1983. see www.unicode.org/reports/tr14
  1984. (Single: \p{Hyphen}) (11)
  1985. \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
  1986. \p{Ideographic_Description_Characters})
  1987. (105_343)
  1988. \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (1_008_769
  1989. plus all above-Unicode code points)
  1990. \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (105_343)
  1991. \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (102_964)
  1992. \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (1_011_148
  1993. plus all above-Unicode code points)
  1994. \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (102_964)
  1995. \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
  1996. \p{Ideographic_Description_Characters})
  1997. (105_343)
  1998. \p{IDC: *} \p{ID_Continue: *}
  1999. \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
  2000. (75_633)
  2001. \p{Ideo: *} \p{Ideographic: *}
  2002. \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
  2003. (75_633)
  2004. \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_038_479
  2005. plus all above-Unicode code points)
  2006. \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (75_633)
  2007. X \p{Ideographic_Description_Characters} \p{Block=
  2008. Ideographic_Description_Characters}
  2009. (Short: \p{InIDC}) (16)
  2010. \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (102_964)
  2011. \p{IDS: *} \p{ID_Start: *}
  2012. \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
  2013. \p{IDSB}) (10)
  2014. \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
  2015. (1_114_102 plus all above-Unicode code
  2016. points)
  2017. \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10)
  2018. \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
  2019. \p{IDST}) (2)
  2020. \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
  2021. (1_114_110 plus all above-Unicode code
  2022. points)
  2023. \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2)
  2024. \p{IDSB} \p{IDS_Binary_Operator} (=
  2025. \p{IDS_Binary_Operator=Y}) (10)
  2026. \p{IDSB: *} \p{IDS_Binary_Operator: *}
  2027. \p{IDST} \p{IDS_Trinary_Operator} (=
  2028. \p{IDS_Trinary_Operator=Y}) (2)
  2029. \p{IDST: *} \p{IDS_Trinary_Operator: *}
  2030. \p{Imperial_Aramaic} \p{Script=Imperial_Aramaic} (Short:
  2031. \p{Armi}; NOT \p{Block=
  2032. Imperial_Aramaic}) (31)
  2033. \p{In: *} \p{Present_In: *} (Perl extension)
  2034. \p{In_*} \p{Block: *}
  2035. X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
  2036. Common_Indic_Number_Forms}) (16)
  2037. \p{Inherited} \p{Script=Inherited} (Short: \p{Zinh})
  2038. (563)
  2039. \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
  2040. (Short: \p{Pi}) (12)
  2041. \p{Inscriptional_Pahlavi} \p{Script=Inscriptional_Pahlavi} (Short:
  2042. \p{Phli}; NOT \p{Block=
  2043. Inscriptional_Pahlavi}) (27)
  2044. \p{Inscriptional_Parthian} \p{Script=Inscriptional_Parthian}
  2045. (Short: \p{Prti}; NOT \p{Block=
  2046. Inscriptional_Parthian}) (30)
  2047. X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
  2048. IPA_Extensions}) (96)
  2049. X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
  2050. \p{InIPAExt}) (96)
  2051. \p{Is_*} \p{*} (Any exceptions are individually
  2052. noted beginning with the word NOT.) If
  2053. an entry has flag(s) at its beginning,
  2054. like "D", the "Is_" form has the same
  2055. flag(s)
  2056. \p{Ital} \p{Old_Italic} (= \p{Script=Old_Italic})
  2057. (NOT \p{Block=Old_Italic}) (36)
  2058. X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
  2059. (256)
  2060. X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
  2061. Hangul_Jamo_Extended_A}) (32)
  2062. X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
  2063. Hangul_Jamo_Extended_B}) (80)
  2064. \p{Java} \p{Javanese} (= \p{Script=Javanese}) (NOT
  2065. \p{Block=Javanese}) (90)
  2066. \p{Javanese} \p{Script=Javanese} (Short: \p{Java}; NOT
  2067. \p{Block=Javanese}) (90)
  2068. \p{Jg: *} \p{Joining_Group: *}
  2069. \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
  2070. \p{Join_C: *} \p{Join_Control: *}
  2071. \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
  2072. \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
  2073. plus all above-Unicode code points)
  2074. \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2)
  2075. \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (7)
  2076. \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1)
  2077. \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10)
  2078. \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (21)
  2079. \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2)
  2080. \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
  2081. BurushaskiYehBarree}) (2)
  2082. \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15)
  2083. \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4)
  2084. \p{Joining_Group: E} (Short: \p{Jg=E}) (1)
  2085. \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7)
  2086. \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1)
  2087. \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10)
  2088. \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1)
  2089. \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (14)
  2090. \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3)
  2091. \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (18)
  2092. \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
  2093. HamzaOnHehGoal}) (1)
  2094. \p{Joining_Group: He} (Short: \p{Jg=He}) (1)
  2095. \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1)
  2096. \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2)
  2097. \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1)
  2098. \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (5)
  2099. \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1)
  2100. \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1)
  2101. \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2)
  2102. \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (7)
  2103. \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1)
  2104. \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
  2105. ManichaeanAleph}) (1)
  2106. \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
  2107. (2)
  2108. \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
  2109. (2)
  2110. \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
  2111. ManichaeanDaleth}) (1)
  2112. \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
  2113. ManichaeanDhamedh}) (1)
  2114. \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
  2115. (1)
  2116. \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
  2117. ManichaeanGimel}) (2)
  2118. \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
  2119. (1)
  2120. \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
  2121. ManichaeanHundred}) (1)
  2122. \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
  2123. (3)
  2124. \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
  2125. ManichaeanLamedh}) (1)
  2126. \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem}) (1)
  2127. \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun}) (1)
  2128. \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne}) (1)
  2129. \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2)
  2130. \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
  2131. (3)
  2132. \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
  2133. (1)
  2134. \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
  2135. ManichaeanSadhe}) (1)
  2136. \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
  2137. ManichaeanSamekh}) (1)
  2138. \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw}) (1)
  2139. \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen}) (1)
  2140. \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
  2141. (1)
  2142. \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
  2143. ManichaeanThamedh}) (1)
  2144. \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
  2145. ManichaeanTwenty}) (1)
  2146. \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw}) (1)
  2147. \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
  2148. (1)
  2149. \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
  2150. ManichaeanZayin}) (2)
  2151. \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4)
  2152. \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1)
  2153. \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
  2154. (1_113_828 plus all above-Unicode code
  2155. points)
  2156. \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8)
  2157. \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1)
  2158. \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1)
  2159. \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1)
  2160. \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5)
  2161. \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1)
  2162. \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (18)
  2163. \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1)
  2164. \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1)
  2165. \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6)
  2166. \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1)
  2167. \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11)
  2168. \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1)
  2169. \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1)
  2170. \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1)
  2171. \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1)
  2172. \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1)
  2173. \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4)
  2174. \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1)
  2175. \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3)
  2176. \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
  2177. Hamza_On_Heh_Goal} (1)
  2178. \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2)
  2179. \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16)
  2180. \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (10)
  2181. \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2)
  2182. \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1)
  2183. \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1)
  2184. \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1)
  2185. \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1)
  2186. \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1)
  2187. \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (4)
  2188. \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (424)
  2189. \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (424)
  2190. \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4)
  2191. \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (3)
  2192. \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (3)
  2193. \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_112_003 plus
  2194. all above-Unicode code points)
  2195. \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (111)
  2196. \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (111)
  2197. \p{Joining_Type: T} \p{Joining_Type=Transparent} (1567)
  2198. \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1567)
  2199. \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_112_003
  2200. plus all above-Unicode code points)
  2201. \p{Jt: *} \p{Joining_Type: *}
  2202. \p{Kaithi} \p{Script=Kaithi} (Short: \p{Kthi}; NOT
  2203. \p{Block=Kaithi}) (66)
  2204. \p{Kali} \p{Kayah_Li} (= \p{Script=Kayah_Li}) (NOT
  2205. \p{Block=Kayah_Li}) (47)
  2206. \p{Kana} \p{Katakana} (= \p{Script=Katakana}) (NOT
  2207. \p{Block=Katakana}) (300)
  2208. X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
  2209. Kana_Supplement}) (256)
  2210. X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
  2211. \p{InKanaSup}) (256)
  2212. X \p{Kanbun} \p{Block=Kanbun} (16)
  2213. X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
  2214. Kangxi_Radicals}) (224)
  2215. X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
  2216. \p{InKangxi}) (224)
  2217. \p{Kannada} \p{Script=Kannada} (Short: \p{Knda}; NOT
  2218. \p{Block=Kannada}) (87)
  2219. \p{Katakana} \p{Script=Katakana} (Short: \p{Kana}; NOT
  2220. \p{Block=Katakana}) (300)
  2221. X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
  2222. \p{Block=Katakana_Phonetic_Extensions})
  2223. (16)
  2224. X \p{Katakana_Phonetic_Extensions} \p{Block=
  2225. Katakana_Phonetic_Extensions} (Short:
  2226. \p{InKatakanaExt}) (16)
  2227. \p{Kayah_Li} \p{Script=Kayah_Li} (Short: \p{Kali}; NOT
  2228. \p{Block=Kayah_Li}) (47)
  2229. \p{Khar} \p{Kharoshthi} (= \p{Script=Kharoshthi})
  2230. (NOT \p{Block=Kharoshthi}) (65)
  2231. \p{Kharoshthi} \p{Script=Kharoshthi} (Short: \p{Khar};
  2232. NOT \p{Block=Kharoshthi}) (65)
  2233. \p{Khmer} \p{Script=Khmer} (Short: \p{Khmr}; NOT
  2234. \p{Block=Khmer}) (146)
  2235. X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
  2236. \p{Khmr} \p{Khmer} (= \p{Script=Khmer}) (NOT
  2237. \p{Block=Khmer}) (146)
  2238. \p{Khoj} \p{Khojki} (= \p{Script=Khojki}) (NOT
  2239. \p{Block=Khojki}) (61)
  2240. \p{Khojki} \p{Script=Khojki} (Short: \p{Khoj}; NOT
  2241. \p{Block=Khojki}) (61)
  2242. \p{Khudawadi} \p{Script=Khudawadi} (Short: \p{Sind}; NOT
  2243. \p{Block=Khudawadi}) (69)
  2244. \p{Knda} \p{Kannada} (= \p{Script=Kannada}) (NOT
  2245. \p{Block=Kannada}) (87)
  2246. \p{Kthi} \p{Kaithi} (= \p{Script=Kaithi}) (NOT
  2247. \p{Block=Kaithi}) (66)
  2248. \p{L} \pL \p{Letter} (= \p{General_Category=Letter})
  2249. (102_725)
  2250. X \p{L&} \p{Cased_Letter} (= \p{General_Category=
  2251. Cased_Letter}) (3362)
  2252. X \p{L_} \p{Cased_Letter} (= \p{General_Category=
  2253. Cased_Letter}) Note the trailing '_'
  2254. matters in spite of loose matching
  2255. rules. (3362)
  2256. \p{Lana} \p{Tai_Tham} (= \p{Script=Tai_Tham}) (NOT
  2257. \p{Block=Tai_Tham}) (127)
  2258. \p{Lao} \p{Script=Lao} (NOT \p{Block=Lao}) (67)
  2259. \p{Laoo} \p{Lao} (= \p{Script=Lao}) (NOT \p{Block=
  2260. Lao}) (67)
  2261. \p{Latin} \p{Script=Latin} (Short: \p{Latn}) (1338)
  2262. X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
  2263. Latin_1_Supplement}) (128)
  2264. X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
  2265. Latin_1_Supplement}) (128)
  2266. X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
  2267. \p{InLatin1}) (128)
  2268. X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
  2269. Latin_Extended_A}) (128)
  2270. X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
  2271. \p{Block=Latin_Extended_Additional})
  2272. (256)
  2273. X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
  2274. Latin_Extended_B}) (208)
  2275. X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
  2276. Latin_Extended_C}) (32)
  2277. X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
  2278. Latin_Extended_D}) (224)
  2279. X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block=
  2280. Latin_Extended_E}) (64)
  2281. X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
  2282. \p{InLatinExtA}) (128)
  2283. X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
  2284. (Short: \p{InLatinExtAdditional}) (256)
  2285. X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
  2286. \p{InLatinExtB}) (208)
  2287. X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
  2288. \p{InLatinExtC}) (32)
  2289. X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
  2290. \p{InLatinExtD}) (224)
  2291. X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short:
  2292. \p{InLatinExtE}) (64)
  2293. \p{Latn} \p{Latin} (= \p{Script=Latin}) (1338)
  2294. \p{Lb: *} \p{Line_Break: *}
  2295. \p{LC} \p{Cased_Letter} (= \p{General_Category=
  2296. Cased_Letter}) (3362)
  2297. \p{Lepc} \p{Lepcha} (= \p{Script=Lepcha}) (NOT
  2298. \p{Block=Lepcha}) (74)
  2299. \p{Lepcha} \p{Script=Lepcha} (Short: \p{Lepc}; NOT
  2300. \p{Block=Lepcha}) (74)
  2301. \p{Letter} \p{General_Category=Letter} (Short: \p{L})
  2302. (102_725)
  2303. \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
  2304. \p{Nl}) (236)
  2305. X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
  2306. \p{Limb} \p{Limbu} (= \p{Script=Limbu}) (NOT
  2307. \p{Block=Limbu}) (68)
  2308. \p{Limbu} \p{Script=Limbu} (Short: \p{Limb}; NOT
  2309. \p{Block=Limbu}) (68)
  2310. \p{Lina} \p{Linear_A} (= \p{Script=Linear_A}) (NOT
  2311. \p{Block=Linear_A}) (341)
  2312. \p{Linb} \p{Linear_B} (= \p{Script=Linear_B}) (211)
  2313. \p{Line_Break: AI} \p{Line_Break=Ambiguous} (689)
  2314. \p{Line_Break: AL} \p{Line_Break=Alphabetic} (17_608)
  2315. \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (17_608)
  2316. \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (689)
  2317. \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
  2318. \p{Line_Break: BA} \p{Line_Break=Break_After} (181)
  2319. \p{Line_Break: BB} \p{Line_Break=Break_Before} (21)
  2320. \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
  2321. \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (181)
  2322. \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (21)
  2323. \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3)
  2324. \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1)
  2325. \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1)
  2326. \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
  2327. \p{Line_Break: CJ} \p{Line_Break=
  2328. Conditional_Japanese_Starter} (51)
  2329. \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (89)
  2330. \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2)
  2331. \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (89)
  2332. \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (1820)
  2333. \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (1820)
  2334. \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (690)
  2335. \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
  2336. (51)
  2337. \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1)
  2338. \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
  2339. \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
  2340. \p{Line_Break: EX} \p{Line_Break=Exclamation} (36)
  2341. \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (36)
  2342. \p{Line_Break: GL} \p{Line_Break=Glue} (18)
  2343. \p{Line_Break: Glue} (Short: \p{Lb=GL}) (18)
  2344. \p{Line_Break: H2} (Short: \p{Lb=H2}) (399)
  2345. \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773)
  2346. \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (74)
  2347. \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (74)
  2348. \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
  2349. \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1)
  2350. \p{Line_Break: ID} \p{Line_Break=Ideographic} (162_936)
  2351. \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (162_936)
  2352. \p{Line_Break: IN} \p{Line_Break=Inseparable} (5)
  2353. \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13)
  2354. \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (5)
  2355. \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (5)
  2356. \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
  2357. \p{Line_Break: JL} (Short: \p{Lb=JL}) (125)
  2358. \p{Line_Break: JT} (Short: \p{Lb=JT}) (137)
  2359. \p{Line_Break: JV} (Short: \p{Lb=JV}) (95)
  2360. \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
  2361. \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1)
  2362. \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4)
  2363. \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1)
  2364. \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
  2365. \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (29)
  2366. \p{Line_Break: NS} \p{Line_Break=Nonstarter} (29)
  2367. \p{Line_Break: NU} \p{Line_Break=Numeric} (532)
  2368. \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (532)
  2369. \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (84)
  2370. \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (84)
  2371. \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (29)
  2372. \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (29)
  2373. \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (66)
  2374. \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (66)
  2375. \p{Line_Break: QU} \p{Line_Break=Quotation} (39)
  2376. \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39)
  2377. \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26)
  2378. \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
  2379. \p{Line_Break: SA} \p{Line_Break=Complex_Context} (690)
  2380. D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
  2381. \p{Line_Break: SP} \p{Line_Break=Space} (1)
  2382. \p{Line_Break: Space} (Short: \p{Lb=SP}) (1)
  2383. D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
  2384. should never appear in well-formed text,
  2385. and therefore shouldn't be the basis for
  2386. line breaking (Short: \p{Lb=SG}) (2048)
  2387. \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
  2388. \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (915_480 plus all
  2389. above-Unicode code points)
  2390. \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
  2391. \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2)
  2392. \p{Line_Break: XX} \p{Line_Break=Unknown} (915_480 plus all
  2393. above-Unicode code points)
  2394. \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
  2395. \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1)
  2396. \p{Line_Separator} \p{General_Category=Line_Separator}
  2397. (Short: \p{Zl}) (1)
  2398. \p{Linear_A} \p{Script=Linear_A} (Short: \p{Lina}; NOT
  2399. \p{Block=Linear_A}) (341)
  2400. \p{Linear_B} \p{Script=Linear_B} (Short: \p{Linb}) (211)
  2401. X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
  2402. X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
  2403. \p{Lisu} \p{Script=Lisu} (48)
  2404. \p{Ll} \p{Lowercase_Letter} (=
  2405. \p{General_Category=Lowercase_Letter})
  2406. (/i= General_Category=Cased_Letter)
  2407. (1841)
  2408. \p{Lm} \p{Modifier_Letter} (=
  2409. \p{General_Category=Modifier_Letter})
  2410. (248)
  2411. \p{Lo} \p{Other_Letter} (= \p{General_Category=
  2412. Other_Letter}) (99_115)
  2413. \p{LOE} \p{Logical_Order_Exception} (=
  2414. \p{Logical_Order_Exception=Y}) (15)
  2415. \p{LOE: *} \p{Logical_Order_Exception: *}
  2416. \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
  2417. \p{LOE}) (15)
  2418. \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
  2419. (1_114_097 plus all above-Unicode code
  2420. points)
  2421. \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (15)
  2422. X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
  2423. \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
  2424. Cased=Yes) (2030)
  2425. \p{Lower: *} \p{Lowercase: *}
  2426. \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
  2427. Cased=Yes) (2030)
  2428. \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
  2429. No) (1_112_082 plus all above-Unicode
  2430. code points)
  2431. \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
  2432. Yes) (2030)
  2433. \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
  2434. (Short: \p{Ll}; /i= General_Category=
  2435. Cased_Letter) (1841)
  2436. \p{Lt} \p{Titlecase_Letter} (=
  2437. \p{General_Category=Titlecase_Letter})
  2438. (/i= General_Category=Cased_Letter) (31)
  2439. \p{Lu} \p{Uppercase_Letter} (=
  2440. \p{General_Category=Uppercase_Letter})
  2441. (/i= General_Category=Cased_Letter)
  2442. (1490)
  2443. \p{Lyci} \p{Lycian} (= \p{Script=Lycian}) (NOT
  2444. \p{Block=Lycian}) (29)
  2445. \p{Lycian} \p{Script=Lycian} (Short: \p{Lyci}; NOT
  2446. \p{Block=Lycian}) (29)
  2447. \p{Lydi} \p{Lydian} (= \p{Script=Lydian}) (NOT
  2448. \p{Block=Lydian}) (27)
  2449. \p{Lydian} \p{Script=Lydian} (Short: \p{Lydi}; NOT
  2450. \p{Block=Lydian}) (27)
  2451. \p{M} \pM \p{Mark} (= \p{General_Category=Mark})
  2452. (1830)
  2453. \p{Mahajani} \p{Script=Mahajani} (Short: \p{Mahj}; NOT
  2454. \p{Block=Mahajani}) (39)
  2455. \p{Mahj} \p{Mahajani} (= \p{Script=Mahajani}) (NOT
  2456. \p{Block=Mahajani}) (39)
  2457. X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
  2458. Mahjong_Tiles}) (48)
  2459. X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
  2460. \p{InMahjong}) (48)
  2461. \p{Malayalam} \p{Script=Malayalam} (Short: \p{Mlym}; NOT
  2462. \p{Block=Malayalam}) (99)
  2463. \p{Mand} \p{Mandaic} (= \p{Script=Mandaic}) (NOT
  2464. \p{Block=Mandaic}) (29)
  2465. \p{Mandaic} \p{Script=Mandaic} (Short: \p{Mand}; NOT
  2466. \p{Block=Mandaic}) (29)
  2467. \p{Mani} \p{Manichaean} (= \p{Script=Manichaean})
  2468. (NOT \p{Block=Manichaean}) (51)
  2469. \p{Manichaean} \p{Script=Manichaean} (Short: \p{Mani};
  2470. NOT \p{Block=Manichaean}) (51)
  2471. \p{Mark} \p{General_Category=Mark} (Short: \p{M})
  2472. (1830)
  2473. \p{Math} \p{Math=Y} (2310)
  2474. \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all
  2475. above-Unicode code points)
  2476. \p{Math: Y*} (Single: \p{Math}) (2310)
  2477. X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
  2478. \p{Block=
  2479. Mathematical_Alphanumeric_Symbols})
  2480. (1024)
  2481. X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
  2482. Mathematical_Operators}) (256)
  2483. \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
  2484. \p{Sm}) (948)
  2485. X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
  2486. Mathematical_Alphanumeric_Symbols}
  2487. (Short: \p{InMathAlphanum}) (1024)
  2488. X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
  2489. (Short: \p{InMathOperators}) (256)
  2490. \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
  2491. Spacing_Mark}) (399)
  2492. \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
  2493. Enclosing_Mark}) (13)
  2494. \p{Meetei_Mayek} \p{Script=Meetei_Mayek} (Short: \p{Mtei};
  2495. NOT \p{Block=Meetei_Mayek}) (79)
  2496. X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
  2497. Meetei_Mayek_Extensions}) (32)
  2498. X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
  2499. (Short: \p{InMeeteiMayekExt}) (32)
  2500. \p{Mend} \p{Mende_Kikakui} (= \p{Script=
  2501. Mende_Kikakui}) (NOT \p{Block=
  2502. Mende_Kikakui}) (213)
  2503. \p{Mende_Kikakui} \p{Script=Mende_Kikakui} (Short: \p{Mend};
  2504. NOT \p{Block=Mende_Kikakui}) (213)
  2505. \p{Merc} \p{Meroitic_Cursive} (= \p{Script=
  2506. Meroitic_Cursive}) (NOT \p{Block=
  2507. Meroitic_Cursive}) (26)
  2508. \p{Mero} \p{Meroitic_Hieroglyphs} (= \p{Script=
  2509. Meroitic_Hieroglyphs}) (32)
  2510. \p{Meroitic_Cursive} \p{Script=Meroitic_Cursive} (Short:
  2511. \p{Merc}; NOT \p{Block=
  2512. Meroitic_Cursive}) (26)
  2513. \p{Meroitic_Hieroglyphs} \p{Script=Meroitic_Hieroglyphs} (Short:
  2514. \p{Mero}) (32)
  2515. \p{Miao} \p{Script=Miao} (NOT \p{Block=Miao}) (133)
  2516. X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
  2517. \p{Block=
  2518. Miscellaneous_Symbols_And_Arrows}) (256)
  2519. X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
  2520. (= \p{Block=
  2521. Miscellaneous_Mathematical_Symbols_A})
  2522. (48)
  2523. X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
  2524. (= \p{Block=
  2525. Miscellaneous_Mathematical_Symbols_B})
  2526. (128)
  2527. X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
  2528. (= \p{Block=
  2529. Miscellaneous_Symbols_And_Pictographs})
  2530. (768)
  2531. X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
  2532. Miscellaneous_Symbols}) (256)
  2533. X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
  2534. Miscellaneous_Technical}) (256)
  2535. X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
  2536. Miscellaneous_Mathematical_Symbols_A}
  2537. (Short: \p{InMiscMathSymbolsA}) (48)
  2538. X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
  2539. Miscellaneous_Mathematical_Symbols_B}
  2540. (Short: \p{InMiscMathSymbolsB}) (128)
  2541. X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
  2542. \p{InMiscSymbols}) (256)
  2543. X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
  2544. Miscellaneous_Symbols_And_Arrows}
  2545. (Short: \p{InMiscArrows}) (256)
  2546. X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
  2547. Miscellaneous_Symbols_And_Pictographs}
  2548. (Short: \p{InMiscPictographs}) (768)
  2549. X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
  2550. (Short: \p{InMiscTechnical}) (256)
  2551. \p{Mlym} \p{Malayalam} (= \p{Script=Malayalam})
  2552. (NOT \p{Block=Malayalam}) (99)
  2553. \p{Mn} \p{Nonspacing_Mark} (=
  2554. \p{General_Category=Nonspacing_Mark})
  2555. (1418)
  2556. \p{Modi} \p{Script=Modi} (NOT \p{Block=Modi}) (79)
  2557. \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
  2558. (Short: \p{Lm}) (248)
  2559. X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
  2560. Spacing_Modifier_Letters}) (80)
  2561. \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
  2562. (Short: \p{Sk}) (116)
  2563. X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
  2564. \p{Mong} \p{Mongolian} (= \p{Script=Mongolian})
  2565. (NOT \p{Block=Mongolian}) (153)
  2566. \p{Mongolian} \p{Script=Mongolian} (Short: \p{Mong}; NOT
  2567. \p{Block=Mongolian}) (153)
  2568. \p{Mro} \p{Script=Mro} (NOT \p{Block=Mro}) (43)
  2569. \p{Mroo} \p{Mro} (= \p{Script=Mro}) (NOT \p{Block=
  2570. Mro}) (43)
  2571. \p{Mtei} \p{Meetei_Mayek} (= \p{Script=
  2572. Meetei_Mayek}) (NOT \p{Block=
  2573. Meetei_Mayek}) (79)
  2574. X \p{Music} \p{Musical_Symbols} (= \p{Block=
  2575. Musical_Symbols}) (256)
  2576. X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
  2577. \p{InMusic}) (256)
  2578. \p{Myanmar} \p{Script=Myanmar} (Short: \p{Mymr}; NOT
  2579. \p{Block=Myanmar}) (223)
  2580. X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
  2581. Myanmar_Extended_A}) (32)
  2582. X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block=
  2583. Myanmar_Extended_B}) (32)
  2584. X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
  2585. \p{InMyanmarExtA}) (32)
  2586. X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short:
  2587. \p{InMyanmarExtB}) (32)
  2588. \p{Mymr} \p{Myanmar} (= \p{Script=Myanmar}) (NOT
  2589. \p{Block=Myanmar}) (223)
  2590. \p{N} \pN \p{Number} (= \p{General_Category=Number})
  2591. (1346)
  2592. \p{Nabataean} \p{Script=Nabataean} (Short: \p{Nbat}; NOT
  2593. \p{Block=Nabataean}) (40)
  2594. \p{Narb} \p{Old_North_Arabian} (= \p{Script=
  2595. Old_North_Arabian}) (32)
  2596. X \p{NB} \p{No_Block} (= \p{Block=No_Block})
  2597. (857_776 plus all above-Unicode code
  2598. points)
  2599. \p{Nbat} \p{Nabataean} (= \p{Script=Nabataean})
  2600. (NOT \p{Block=Nabataean}) (40)
  2601. \p{NChar} \p{Noncharacter_Code_Point} (=
  2602. \p{Noncharacter_Code_Point=Y}) (66)
  2603. \p{NChar: *} \p{Noncharacter_Code_Point: *}
  2604. \p{Nd} \p{XPosixDigit} (= \p{General_Category=
  2605. Decimal_Number}) (540)
  2606. \p{New_Tai_Lue} \p{Script=New_Tai_Lue} (Short: \p{Talu};
  2607. NOT \p{Block=New_Tai_Lue}) (83)
  2608. \p{NFC_QC: *} \p{NFC_Quick_Check: *}
  2609. \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (110)
  2610. \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110)
  2611. \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
  2612. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  2613. (1120)
  2614. \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
  2615. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  2616. (1120)
  2617. \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
  2618. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  2619. (1_112_882 plus all above-Unicode code
  2620. points)
  2621. \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
  2622. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  2623. (1_112_882 plus all above-Unicode code
  2624. points)
  2625. \p{NFD_QC: *} \p{NFD_Quick_Check: *}
  2626. \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
  2627. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  2628. (13_232)
  2629. \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
  2630. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  2631. (13_232)
  2632. \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
  2633. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  2634. (1_100_880 plus all above-Unicode code
  2635. points)
  2636. \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
  2637. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  2638. (1_100_880 plus all above-Unicode code
  2639. points)
  2640. \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
  2641. \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110)
  2642. \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110)
  2643. \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
  2644. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  2645. (4793)
  2646. \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
  2647. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  2648. (4793)
  2649. \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
  2650. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  2651. (1_109_209 plus all above-Unicode code
  2652. points)
  2653. \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
  2654. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  2655. (1_109_209 plus all above-Unicode code
  2656. points)
  2657. \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
  2658. \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
  2659. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  2660. (16_893)
  2661. \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
  2662. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  2663. (16_893)
  2664. \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
  2665. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  2666. (1_097_219 plus all above-Unicode code
  2667. points)
  2668. \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
  2669. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  2670. (1_097_219 plus all above-Unicode code
  2671. points)
  2672. \p{Nko} \p{Script=Nko} (NOT \p{NKo}) (59)
  2673. \p{Nkoo} \p{Nko} (= \p{Script=Nko}) (NOT \p{NKo})
  2674. (59)
  2675. \p{Nl} \p{Letter_Number} (= \p{General_Category=
  2676. Letter_Number}) (236)
  2677. \p{No} \p{Other_Number} (= \p{General_Category=
  2678. Other_Number}) (570)
  2679. X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
  2680. (857_776 plus all above-Unicode code
  2681. points)
  2682. \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
  2683. \p{NChar}) (66)
  2684. \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
  2685. (1_114_046 plus all above-Unicode code
  2686. points)
  2687. \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
  2688. (66)
  2689. \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
  2690. (Short: \p{Mn}) (1418)
  2691. \p{Nt: *} \p{Numeric_Type: *}
  2692. \p{Number} \p{General_Category=Number} (Short: \p{N})
  2693. (1346)
  2694. X \p{Number_Forms} \p{Block=Number_Forms} (64)
  2695. \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (540)
  2696. \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (540)
  2697. \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
  2698. \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128)
  2699. \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_685 plus all
  2700. above-Unicode code points)
  2701. \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (759)
  2702. \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (759)
  2703. T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1)
  2704. T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (70)
  2705. T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (3)
  2706. T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (1)
  2707. T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1)
  2708. T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (5)
  2709. T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1)
  2710. T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (2)
  2711. T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (3)
  2712. T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (1)
  2713. T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (11)
  2714. T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (5)
  2715. T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1)
  2716. T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1)
  2717. T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (11)
  2718. T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1)
  2719. T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1)
  2720. T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (6)
  2721. T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (6)
  2722. T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1)
  2723. T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (2)
  2724. T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1)
  2725. T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (113)
  2726. T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1)
  2727. T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (115)
  2728. T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1)
  2729. T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (117)
  2730. T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1)
  2731. T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (109)
  2732. T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1)
  2733. T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (105)
  2734. T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1)
  2735. T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (94)
  2736. T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1)
  2737. T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (93)
  2738. T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1)
  2739. T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (89)
  2740. T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1)
  2741. T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (93)
  2742. T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (49)
  2743. T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (6)
  2744. T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (6)
  2745. T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (4)
  2746. T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (4)
  2747. T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (4)
  2748. T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (5)
  2749. T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (5)
  2750. T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (5)
  2751. T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (5)
  2752. T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (27)
  2753. T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1)
  2754. T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1)
  2755. T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1)
  2756. T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1)
  2757. T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1)
  2758. T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1)
  2759. T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1)
  2760. T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1)
  2761. T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1)
  2762. T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (14)
  2763. T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1)
  2764. T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1)
  2765. T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1)
  2766. T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1)
  2767. T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1)
  2768. T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1)
  2769. T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1)
  2770. T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1)
  2771. T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1)
  2772. T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (14)
  2773. T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1)
  2774. T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1)
  2775. T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1)
  2776. T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1)
  2777. T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1)
  2778. T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1)
  2779. T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1)
  2780. T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1)
  2781. T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1)
  2782. T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (24)
  2783. T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (9)
  2784. T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (9)
  2785. T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (9)
  2786. T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (9)
  2787. T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (26)
  2788. T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (3)
  2789. T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (4)
  2790. T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (3)
  2791. T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (13)
  2792. T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (3)
  2793. T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (3)
  2794. T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (3)
  2795. T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (4)
  2796. T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (18)
  2797. T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (1)
  2798. T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (1)
  2799. T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (1)
  2800. T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (5)
  2801. T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (1)
  2802. T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (1)
  2803. T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (1)
  2804. T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (1)
  2805. T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (8)
  2806. T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (1)
  2807. T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (1)
  2808. T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (1)
  2809. T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (4)
  2810. T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (1)
  2811. T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (1)
  2812. T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (1)
  2813. T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (1)
  2814. T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (1)
  2815. T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1)
  2816. T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1)
  2817. T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1)
  2818. T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
  2819. (3)
  2820. T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
  2821. 10000000000}) (1)
  2822. T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
  2823. 1000000000000}) (2)
  2824. \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_685 plus all
  2825. above-Unicode code points)
  2826. \p{Nv: *} \p{Numeric_Value: *}
  2827. X \p{OCR} \p{Optical_Character_Recognition} (=
  2828. \p{Block=Optical_Character_Recognition})
  2829. (32)
  2830. \p{Ogam} \p{Ogham} (= \p{Script=Ogham}) (NOT
  2831. \p{Block=Ogham}) (29)
  2832. \p{Ogham} \p{Script=Ogham} (Short: \p{Ogam}; NOT
  2833. \p{Block=Ogham}) (29)
  2834. \p{Ol_Chiki} \p{Script=Ol_Chiki} (Short: \p{Olck}) (48)
  2835. \p{Olck} \p{Ol_Chiki} (= \p{Script=Ol_Chiki}) (48)
  2836. \p{Old_Italic} \p{Script=Old_Italic} (Short: \p{Ital};
  2837. NOT \p{Block=Old_Italic}) (36)
  2838. \p{Old_North_Arabian} \p{Script=Old_North_Arabian} (Short:
  2839. \p{Narb}) (32)
  2840. \p{Old_Permic} \p{Script=Old_Permic} (Short: \p{Perm};
  2841. NOT \p{Block=Old_Permic}) (43)
  2842. \p{Old_Persian} \p{Script=Old_Persian} (Short: \p{Xpeo};
  2843. NOT \p{Block=Old_Persian}) (50)
  2844. \p{Old_South_Arabian} \p{Script=Old_South_Arabian} (Short:
  2845. \p{Sarb}) (32)
  2846. \p{Old_Turkic} \p{Script=Old_Turkic} (Short: \p{Orkh};
  2847. NOT \p{Block=Old_Turkic}) (73)
  2848. \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
  2849. (Short: \p{Ps}) (75)
  2850. X \p{Optical_Character_Recognition} \p{Block=
  2851. Optical_Character_Recognition} (Short:
  2852. \p{InOCR}) (32)
  2853. \p{Oriya} \p{Script=Oriya} (Short: \p{Orya}; NOT
  2854. \p{Block=Oriya}) (90)
  2855. \p{Orkh} \p{Old_Turkic} (= \p{Script=Old_Turkic})
  2856. (NOT \p{Block=Old_Turkic}) (73)
  2857. X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
  2858. \p{Orya} \p{Oriya} (= \p{Script=Oriya}) (NOT
  2859. \p{Block=Oriya}) (90)
  2860. \p{Osma} \p{Osmanya} (= \p{Script=Osmanya}) (NOT
  2861. \p{Block=Osmanya}) (40)
  2862. \p{Osmanya} \p{Script=Osmanya} (Short: \p{Osma}; NOT
  2863. \p{Block=Osmanya}) (40)
  2864. \p{Other} \p{General_Category=Other} (Short: \p{C})
  2865. (1_001_306 plus all above-Unicode code
  2866. points)
  2867. \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
  2868. \p{Lo}) (99_115)
  2869. \p{Other_Number} \p{General_Category=Other_Number} (Short:
  2870. \p{No}) (570)
  2871. \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
  2872. (Short: \p{Po}) (484)
  2873. \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
  2874. \p{So}) (5082)
  2875. \p{P} \pP \p{Punct} (= \p{General_Category=
  2876. Punctuation}) (NOT
  2877. \p{General_Punctuation}) (688)
  2878. \p{Pahawh_Hmong} \p{Script=Pahawh_Hmong} (Short: \p{Hmng};
  2879. NOT \p{Block=Pahawh_Hmong}) (127)
  2880. \p{Palm} \p{Palmyrene} (= \p{Script=Palmyrene}) (32)
  2881. \p{Palmyrene} \p{Script=Palmyrene} (Short: \p{Palm}) (32)
  2882. \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
  2883. (Short: \p{Zp}) (1)
  2884. \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
  2885. Y}) (2760)
  2886. \p{Pat_Syn: *} \p{Pattern_Syntax: *}
  2887. \p{Pat_WS} \p{Pattern_White_Space} (=
  2888. \p{Pattern_White_Space=Y}) (11)
  2889. \p{Pat_WS: *} \p{Pattern_White_Space: *}
  2890. \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
  2891. (2760)
  2892. \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
  2893. (1_111_352 plus all above-Unicode code
  2894. points)
  2895. \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760)
  2896. \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
  2897. \p{PatWS}) (11)
  2898. \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
  2899. (1_114_101 plus all above-Unicode code
  2900. points)
  2901. \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11)
  2902. \p{Pau_Cin_Hau} \p{Script=Pau_Cin_Hau} (Short: \p{Pauc};
  2903. NOT \p{Block=Pau_Cin_Hau}) (57)
  2904. \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script=Pau_Cin_Hau})
  2905. (NOT \p{Block=Pau_Cin_Hau}) (57)
  2906. \p{Pc} \p{Connector_Punctuation} (=
  2907. \p{General_Category=
  2908. Connector_Punctuation}) (10)
  2909. \p{Pd} \p{Dash_Punctuation} (=
  2910. \p{General_Category=Dash_Punctuation})
  2911. (24)
  2912. \p{Pe} \p{Close_Punctuation} (=
  2913. \p{General_Category=Close_Punctuation})
  2914. (73)
  2915. \p{PerlSpace} \p{PosixSpace} (6)
  2916. \p{PerlWord} \p{PosixWord} (63)
  2917. \p{Perm} \p{Old_Permic} (= \p{Script=Old_Permic})
  2918. (NOT \p{Block=Old_Permic}) (43)
  2919. \p{Pf} \p{Final_Punctuation} (=
  2920. \p{General_Category=Final_Punctuation})
  2921. (10)
  2922. \p{Phag} \p{Phags_Pa} (= \p{Script=Phags_Pa}) (NOT
  2923. \p{Block=Phags_Pa}) (56)
  2924. \p{Phags_Pa} \p{Script=Phags_Pa} (Short: \p{Phag}; NOT
  2925. \p{Block=Phags_Pa}) (56)
  2926. X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
  2927. Phaistos_Disc}) (48)
  2928. X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
  2929. \p{InPhaistos}) (48)
  2930. \p{Phli} \p{Inscriptional_Pahlavi} (= \p{Script=
  2931. Inscriptional_Pahlavi}) (NOT \p{Block=
  2932. Inscriptional_Pahlavi}) (27)
  2933. \p{Phlp} \p{Psalter_Pahlavi} (= \p{Script=
  2934. Psalter_Pahlavi}) (NOT \p{Block=
  2935. Psalter_Pahlavi}) (29)
  2936. \p{Phnx} \p{Phoenician} (= \p{Script=Phoenician})
  2937. (NOT \p{Block=Phoenician}) (29)
  2938. \p{Phoenician} \p{Script=Phoenician} (Short: \p{Phnx};
  2939. NOT \p{Block=Phoenician}) (29)
  2940. X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
  2941. Phonetic_Extensions}) (128)
  2942. X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
  2943. \p{Block=
  2944. Phonetic_Extensions_Supplement}) (64)
  2945. X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
  2946. \p{InPhoneticExt}) (128)
  2947. X \p{Phonetic_Extensions_Supplement} \p{Block=
  2948. Phonetic_Extensions_Supplement} (Short:
  2949. \p{InPhoneticExtSup}) (64)
  2950. \p{Pi} \p{Initial_Punctuation} (=
  2951. \p{General_Category=
  2952. Initial_Punctuation}) (12)
  2953. X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
  2954. \p{Plrd} \p{Miao} (= \p{Script=Miao}) (NOT
  2955. \p{Block=Miao}) (133)
  2956. \p{Po} \p{Other_Punctuation} (=
  2957. \p{General_Category=Other_Punctuation})
  2958. (484)
  2959. \p{PosixAlnum} [A-Za-z0-9] (62)
  2960. \p{PosixAlpha} [A-Za-z] (52)
  2961. \p{PosixBlank} \t and ' ' (2)
  2962. \p{PosixCntrl} ASCII control characters: NUL, SOH, STX,
  2963. ETX, EOT, ENQ, ACK, BEL, BS, HT, LF, VT,
  2964. FF, CR, SO, SI, DLE, DC1, DC2, DC3, DC4,
  2965. NAK, SYN, ETB, CAN, EOM, SUB, ESC, FS,
  2966. GS, RS, US, and DEL (33)
  2967. \p{PosixDigit} [0-9] (10)
  2968. \p{PosixGraph} [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~0-9A-Za-
  2969. z] (94)
  2970. \p{PosixLower} [a-z] (/i= PosixAlpha) (26)
  2971. \p{PosixPrint} [- 0-9A-Za-z!"#$%&'()*+,./:;<=
  2972. >?@[\\]^_`{|}~] (95)
  2973. \p{PosixPunct} [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~] (32)
  2974. \p{PosixSpace} \t, \n, \cK, \f, \r, and ' '. (\cK is
  2975. vertical tab) (Short: \p{PerlSpace}) (6)
  2976. \p{PosixUpper} [A-Z] (/i= PosixAlpha) (26)
  2977. \p{PosixWord} \w, restricted to ASCII = [A-Za-z0-9_]
  2978. (Short: \p{PerlWord}) (63)
  2979. \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} [0-9A-Fa-f] (Short:
  2980. \p{AHex}) (22)
  2981. T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
  2982. extension) (33_979)
  2983. T \p{Present_In: 2.0} Code point's usage introduced in version
  2984. 2.0 or earlier (Short: \p{In=2.0}) (Perl
  2985. extension) (178_500)
  2986. T \p{Present_In: 2.1} Code point's usage introduced in version
  2987. 2.1 or earlier (Short: \p{In=2.1}) (Perl
  2988. extension) (178_502)
  2989. T \p{Present_In: 3.0} Code point's usage introduced in version
  2990. 3.0 or earlier (Short: \p{In=3.0}) (Perl
  2991. extension) (188_809)
  2992. T \p{Present_In: 3.1} Code point's usage introduced in version
  2993. 3.1 or earlier (Short: \p{In=3.1}) (Perl
  2994. extension) (233_787)
  2995. T \p{Present_In: 3.2} Code point's usage introduced in version
  2996. 3.2 or earlier (Short: \p{In=3.2}) (Perl
  2997. extension) (234_803)
  2998. T \p{Present_In: 4.0} Code point's usage introduced in version
  2999. 4.0 or earlier (Short: \p{In=4.0}) (Perl
  3000. extension) (236_029)
  3001. T \p{Present_In: 4.1} Code point's usage introduced in version
  3002. 4.1 or earlier (Short: \p{In=4.1}) (Perl
  3003. extension) (237_302)
  3004. T \p{Present_In: 5.0} Code point's usage introduced in version
  3005. 5.0 or earlier (Short: \p{In=5.0}) (Perl
  3006. extension) (238_671)
  3007. T \p{Present_In: 5.1} Code point's usage introduced in version
  3008. 5.1 or earlier (Short: \p{In=5.1}) (Perl
  3009. extension) (240_295)
  3010. T \p{Present_In: 5.2} Code point's usage introduced in version
  3011. 5.2 or earlier (Short: \p{In=5.2}) (Perl
  3012. extension) (246_943)
  3013. T \p{Present_In: 6.0} Code point's usage introduced in version
  3014. 6.0 or earlier (Short: \p{In=6.0}) (Perl
  3015. extension) (249_031)
  3016. T \p{Present_In: 6.1} Code point's usage introduced in version
  3017. 6.1 or earlier (Short: \p{In=6.1}) (Perl
  3018. extension) (249_763)
  3019. T \p{Present_In: 6.2} Code point's usage introduced in version
  3020. 6.2 or earlier (Short: \p{In=6.2}) (Perl
  3021. extension) (249_764)
  3022. T \p{Present_In: 6.3} Code point's usage introduced in version
  3023. 6.3 or earlier (Short: \p{In=6.3}) (Perl
  3024. extension) (249_769)
  3025. T \p{Present_In: 7.0} Code point's usage introduced in version
  3026. 7.0 or earlier (Short: \p{In=7.0}) (Perl
  3027. extension) (252_603)
  3028. \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
  3029. Unassigned}) (Perl extension) (861_509
  3030. plus all above-Unicode code points)
  3031. \p{Print} \p{XPosixPrint} (250_422)
  3032. \p{Private_Use} \p{General_Category=Private_Use} (Short:
  3033. \p{Co}; NOT \p{Private_Use_Area})
  3034. (137_468)
  3035. X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
  3036. \p{InPUA}) (6400)
  3037. \p{Prti} \p{Inscriptional_Parthian} (= \p{Script=
  3038. Inscriptional_Parthian}) (NOT \p{Block=
  3039. Inscriptional_Parthian}) (30)
  3040. \p{Ps} \p{Open_Punctuation} (=
  3041. \p{General_Category=Open_Punctuation})
  3042. (75)
  3043. \p{Psalter_Pahlavi} \p{Script=Psalter_Pahlavi} (Short:
  3044. \p{Phlp}; NOT \p{Block=Psalter_Pahlavi})
  3045. (29)
  3046. X \p{PUA} \p{Private_Use_Area} (= \p{Block=
  3047. Private_Use_Area}) (6400)
  3048. \p{Punct} \p{General_Category=Punctuation} (Short:
  3049. \p{P}; NOT \p{General_Punctuation}) (688)
  3050. \p{Punctuation} \p{Punct} (= \p{General_Category=
  3051. Punctuation}) (NOT
  3052. \p{General_Punctuation}) (688)
  3053. \p{Qaac} \p{Coptic} (= \p{Script=Coptic}) (NOT
  3054. \p{Block=Coptic}) (137)
  3055. \p{Qaai} \p{Inherited} (= \p{Script=Inherited})
  3056. (563)
  3057. \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
  3058. Y}) (30)
  3059. \p{QMark: *} \p{Quotation_Mark: *}
  3060. \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
  3061. (30)
  3062. \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082
  3063. plus all above-Unicode code points)
  3064. \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30)
  3065. \p{Radical} \p{Radical=Y} (329)
  3066. \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all
  3067. above-Unicode code points)
  3068. \p{Radical: Y*} (Single: \p{Radical}) (329)
  3069. \p{Rejang} \p{Script=Rejang} (Short: \p{Rjng}; NOT
  3070. \p{Block=Rejang}) (37)
  3071. \p{Rjng} \p{Rejang} (= \p{Script=Rejang}) (NOT
  3072. \p{Block=Rejang}) (37)
  3073. X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
  3074. Rumi_Numeral_Symbols}) (32)
  3075. X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
  3076. \p{InRumi}) (32)
  3077. \p{Runic} \p{Script=Runic} (Short: \p{Runr}; NOT
  3078. \p{Block=Runic}) (86)
  3079. \p{Runr} \p{Runic} (= \p{Script=Runic}) (NOT
  3080. \p{Block=Runic}) (86)
  3081. \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol})
  3082. (6198)
  3083. \p{Samaritan} \p{Script=Samaritan} (Short: \p{Samr}; NOT
  3084. \p{Block=Samaritan}) (61)
  3085. \p{Samr} \p{Samaritan} (= \p{Script=Samaritan})
  3086. (NOT \p{Block=Samaritan}) (61)
  3087. \p{Sarb} \p{Old_South_Arabian} (= \p{Script=
  3088. Old_South_Arabian}) (32)
  3089. \p{Saur} \p{Saurashtra} (= \p{Script=Saurashtra})
  3090. (NOT \p{Block=Saurashtra}) (81)
  3091. \p{Saurashtra} \p{Script=Saurashtra} (Short: \p{Saur};
  3092. NOT \p{Block=Saurashtra}) (81)
  3093. \p{SB: *} \p{Sentence_Break: *}
  3094. \p{Sc} \p{Currency_Symbol} (=
  3095. \p{General_Category=Currency_Symbol})
  3096. (52)
  3097. \p{Sc: *} \p{Script: *}
  3098. \p{Script: Aghb} \p{Script=Caucasian_Albanian} (53)
  3099. \p{Script: Arab} \p{Script=Arabic} (1244)
  3100. \p{Script: Arabic} (Short: \p{Sc=Arab}, \p{Arab}) (1244)
  3101. \p{Script: Armenian} (Short: \p{Sc=Armn}, \p{Armn}) (93)
  3102. \p{Script: Armi} \p{Script=Imperial_Aramaic} (31)
  3103. \p{Script: Armn} \p{Script=Armenian} (93)
  3104. \p{Script: Avestan} (Short: \p{Sc=Avst}, \p{Avst}) (61)
  3105. \p{Script: Avst} \p{Script=Avestan} (61)
  3106. \p{Script: Bali} \p{Script=Balinese} (121)
  3107. \p{Script: Balinese} (Short: \p{Sc=Bali}, \p{Bali}) (121)
  3108. \p{Script: Bamu} \p{Script=Bamum} (657)
  3109. \p{Script: Bamum} (Short: \p{Sc=Bamu}, \p{Bamu}) (657)
  3110. \p{Script: Bass} \p{Script=Bassa_Vah} (36)
  3111. \p{Script: Bassa_Vah} (Short: \p{Sc=Bass}, \p{Bass}) (36)
  3112. \p{Script: Batak} (Short: \p{Sc=Batk}, \p{Batk}) (56)
  3113. \p{Script: Batk} \p{Script=Batak} (56)
  3114. \p{Script: Beng} \p{Script=Bengali} (93)
  3115. \p{Script: Bengali} (Short: \p{Sc=Beng}, \p{Beng}) (93)
  3116. \p{Script: Bopo} \p{Script=Bopomofo} (70)
  3117. \p{Script: Bopomofo} (Short: \p{Sc=Bopo}, \p{Bopo}) (70)
  3118. \p{Script: Brah} \p{Script=Brahmi} (109)
  3119. \p{Script: Brahmi} (Short: \p{Sc=Brah}, \p{Brah}) (109)
  3120. \p{Script: Brai} \p{Script=Braille} (256)
  3121. \p{Script: Braille} (Short: \p{Sc=Brai}, \p{Brai}) (256)
  3122. \p{Script: Bugi} \p{Script=Buginese} (30)
  3123. \p{Script: Buginese} (Short: \p{Sc=Bugi}, \p{Bugi}) (30)
  3124. \p{Script: Buhd} \p{Script=Buhid} (20)
  3125. \p{Script: Buhid} (Short: \p{Sc=Buhd}, \p{Buhd}) (20)
  3126. \p{Script: Cakm} \p{Script=Chakma} (67)
  3127. \p{Script: Canadian_Aboriginal} (Short: \p{Sc=Cans}, \p{Cans})
  3128. (710)
  3129. \p{Script: Cans} \p{Script=Canadian_Aboriginal} (710)
  3130. \p{Script: Cari} \p{Script=Carian} (49)
  3131. \p{Script: Carian} (Short: \p{Sc=Cari}, \p{Cari}) (49)
  3132. \p{Script: Caucasian_Albanian} (Short: \p{Sc=Aghb}, \p{Aghb}) (53)
  3133. \p{Script: Chakma} (Short: \p{Sc=Cakm}, \p{Cakm}) (67)
  3134. \p{Script: Cham} (Short: \p{Sc=Cham}, \p{Cham}) (83)
  3135. \p{Script: Cher} \p{Script=Cherokee} (85)
  3136. \p{Script: Cherokee} (Short: \p{Sc=Cher}, \p{Cher}) (85)
  3137. \p{Script: Common} (Short: \p{Sc=Zyyy}, \p{Zyyy}) (7129)
  3138. \p{Script: Copt} \p{Script=Coptic} (137)
  3139. \p{Script: Coptic} (Short: \p{Sc=Copt}, \p{Copt}) (137)
  3140. \p{Script: Cprt} \p{Script=Cypriot} (55)
  3141. \p{Script: Cuneiform} (Short: \p{Sc=Xsux}, \p{Xsux}) (1037)
  3142. \p{Script: Cypriot} (Short: \p{Sc=Cprt}, \p{Cprt}) (55)
  3143. \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}, \p{Cyrl}) (431)
  3144. \p{Script: Cyrl} \p{Script=Cyrillic} (431)
  3145. \p{Script: Deseret} (Short: \p{Sc=Dsrt}, \p{Dsrt}) (80)
  3146. \p{Script: Deva} \p{Script=Devanagari} (152)
  3147. \p{Script: Devanagari} (Short: \p{Sc=Deva}, \p{Deva}) (152)
  3148. \p{Script: Dsrt} \p{Script=Deseret} (80)
  3149. \p{Script: Dupl} \p{Script=Duployan} (143)
  3150. \p{Script: Duployan} (Short: \p{Sc=Dupl}, \p{Dupl}) (143)
  3151. \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (1071)
  3152. \p{Script: Egyptian_Hieroglyphs} (Short: \p{Sc=Egyp}, \p{Egyp})
  3153. (1071)
  3154. \p{Script: Elba} \p{Script=Elbasan} (40)
  3155. \p{Script: Elbasan} (Short: \p{Sc=Elba}, \p{Elba}) (40)
  3156. \p{Script: Ethi} \p{Script=Ethiopic} (495)
  3157. \p{Script: Ethiopic} (Short: \p{Sc=Ethi}, \p{Ethi}) (495)
  3158. \p{Script: Geor} \p{Script=Georgian} (127)
  3159. \p{Script: Georgian} (Short: \p{Sc=Geor}, \p{Geor}) (127)
  3160. \p{Script: Glag} \p{Script=Glagolitic} (94)
  3161. \p{Script: Glagolitic} (Short: \p{Sc=Glag}, \p{Glag}) (94)
  3162. \p{Script: Goth} \p{Script=Gothic} (27)
  3163. \p{Script: Gothic} (Short: \p{Sc=Goth}, \p{Goth}) (27)
  3164. \p{Script: Gran} \p{Script=Grantha} (83)
  3165. \p{Script: Grantha} (Short: \p{Sc=Gran}, \p{Gran}) (83)
  3166. \p{Script: Greek} (Short: \p{Sc=Grek}, \p{Grek}) (516)
  3167. \p{Script: Grek} \p{Script=Greek} (516)
  3168. \p{Script: Gujarati} (Short: \p{Sc=Gujr}, \p{Gujr}) (84)
  3169. \p{Script: Gujr} \p{Script=Gujarati} (84)
  3170. \p{Script: Gurmukhi} (Short: \p{Sc=Guru}, \p{Guru}) (79)
  3171. \p{Script: Guru} \p{Script=Gurmukhi} (79)
  3172. \p{Script: Han} (Short: \p{Sc=Han}, \p{Han}) (75_963)
  3173. \p{Script: Hang} \p{Script=Hangul} (11_739)
  3174. \p{Script: Hangul} (Short: \p{Sc=Hang}, \p{Hang}) (11_739)
  3175. \p{Script: Hani} \p{Script=Han} (75_963)
  3176. \p{Script: Hano} \p{Script=Hanunoo} (21)
  3177. \p{Script: Hanunoo} (Short: \p{Sc=Hano}, \p{Hano}) (21)
  3178. \p{Script: Hebr} \p{Script=Hebrew} (133)
  3179. \p{Script: Hebrew} (Short: \p{Sc=Hebr}, \p{Hebr}) (133)
  3180. \p{Script: Hira} \p{Script=Hiragana} (91)
  3181. \p{Script: Hiragana} (Short: \p{Sc=Hira}, \p{Hira}) (91)
  3182. \p{Script: Hmng} \p{Script=Pahawh_Hmong} (127)
  3183. \p{Script: Imperial_Aramaic} (Short: \p{Sc=Armi}, \p{Armi}) (31)
  3184. \p{Script: Inherited} (Short: \p{Sc=Zinh}, \p{Zinh}) (563)
  3185. \p{Script: Inscriptional_Pahlavi} (Short: \p{Sc=Phli}, \p{Phli})
  3186. (27)
  3187. \p{Script: Inscriptional_Parthian} (Short: \p{Sc=Prti}, \p{Prti})
  3188. (30)
  3189. \p{Script: Ital} \p{Script=Old_Italic} (36)
  3190. \p{Script: Java} \p{Script=Javanese} (90)
  3191. \p{Script: Javanese} (Short: \p{Sc=Java}, \p{Java}) (90)
  3192. \p{Script: Kaithi} (Short: \p{Sc=Kthi}, \p{Kthi}) (66)
  3193. \p{Script: Kali} \p{Script=Kayah_Li} (47)
  3194. \p{Script: Kana} \p{Script=Katakana} (300)
  3195. \p{Script: Kannada} (Short: \p{Sc=Knda}, \p{Knda}) (87)
  3196. \p{Script: Katakana} (Short: \p{Sc=Kana}, \p{Kana}) (300)
  3197. \p{Script: Kayah_Li} (Short: \p{Sc=Kali}, \p{Kali}) (47)
  3198. \p{Script: Khar} \p{Script=Kharoshthi} (65)
  3199. \p{Script: Kharoshthi} (Short: \p{Sc=Khar}, \p{Khar}) (65)
  3200. \p{Script: Khmer} (Short: \p{Sc=Khmr}, \p{Khmr}) (146)
  3201. \p{Script: Khmr} \p{Script=Khmer} (146)
  3202. \p{Script: Khoj} \p{Script=Khojki} (61)
  3203. \p{Script: Khojki} (Short: \p{Sc=Khoj}, \p{Khoj}) (61)
  3204. \p{Script: Khudawadi} (Short: \p{Sc=Sind}, \p{Sind}) (69)
  3205. \p{Script: Knda} \p{Script=Kannada} (87)
  3206. \p{Script: Kthi} \p{Script=Kaithi} (66)
  3207. \p{Script: Lana} \p{Script=Tai_Tham} (127)
  3208. \p{Script: Lao} (Short: \p{Sc=Lao}, \p{Lao}) (67)
  3209. \p{Script: Laoo} \p{Script=Lao} (67)
  3210. \p{Script: Latin} (Short: \p{Sc=Latn}, \p{Latn}) (1338)
  3211. \p{Script: Latn} \p{Script=Latin} (1338)
  3212. \p{Script: Lepc} \p{Script=Lepcha} (74)
  3213. \p{Script: Lepcha} (Short: \p{Sc=Lepc}, \p{Lepc}) (74)
  3214. \p{Script: Limb} \p{Script=Limbu} (68)
  3215. \p{Script: Limbu} (Short: \p{Sc=Limb}, \p{Limb}) (68)
  3216. \p{Script: Lina} \p{Script=Linear_A} (341)
  3217. \p{Script: Linb} \p{Script=Linear_B} (211)
  3218. \p{Script: Linear_A} (Short: \p{Sc=Lina}, \p{Lina}) (341)
  3219. \p{Script: Linear_B} (Short: \p{Sc=Linb}, \p{Linb}) (211)
  3220. \p{Script: Lisu} (Short: \p{Sc=Lisu}, \p{Lisu}) (48)
  3221. \p{Script: Lyci} \p{Script=Lycian} (29)
  3222. \p{Script: Lycian} (Short: \p{Sc=Lyci}, \p{Lyci}) (29)
  3223. \p{Script: Lydi} \p{Script=Lydian} (27)
  3224. \p{Script: Lydian} (Short: \p{Sc=Lydi}, \p{Lydi}) (27)
  3225. \p{Script: Mahajani} (Short: \p{Sc=Mahj}, \p{Mahj}) (39)
  3226. \p{Script: Mahj} \p{Script=Mahajani} (39)
  3227. \p{Script: Malayalam} (Short: \p{Sc=Mlym}, \p{Mlym}) (99)
  3228. \p{Script: Mand} \p{Script=Mandaic} (29)
  3229. \p{Script: Mandaic} (Short: \p{Sc=Mand}, \p{Mand}) (29)
  3230. \p{Script: Mani} \p{Script=Manichaean} (51)
  3231. \p{Script: Manichaean} (Short: \p{Sc=Mani}, \p{Mani}) (51)
  3232. \p{Script: Meetei_Mayek} (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
  3233. \p{Script: Mend} \p{Script=Mende_Kikakui} (213)
  3234. \p{Script: Mende_Kikakui} (Short: \p{Sc=Mend}, \p{Mend}) (213)
  3235. \p{Script: Merc} \p{Script=Meroitic_Cursive} (26)
  3236. \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (32)
  3237. \p{Script: Meroitic_Cursive} (Short: \p{Sc=Merc}, \p{Merc}) (26)
  3238. \p{Script: Meroitic_Hieroglyphs} (Short: \p{Sc=Mero}, \p{Mero})
  3239. (32)
  3240. \p{Script: Miao} (Short: \p{Sc=Miao}, \p{Miao}) (133)
  3241. \p{Script: Mlym} \p{Script=Malayalam} (99)
  3242. \p{Script: Modi} (Short: \p{Sc=Modi}, \p{Modi}) (79)
  3243. \p{Script: Mong} \p{Script=Mongolian} (153)
  3244. \p{Script: Mongolian} (Short: \p{Sc=Mong}, \p{Mong}) (153)
  3245. \p{Script: Mro} (Short: \p{Sc=Mro}, \p{Mro}) (43)
  3246. \p{Script: Mroo} \p{Script=Mro} (43)
  3247. \p{Script: Mtei} \p{Script=Meetei_Mayek} (79)
  3248. \p{Script: Myanmar} (Short: \p{Sc=Mymr}, \p{Mymr}) (223)
  3249. \p{Script: Mymr} \p{Script=Myanmar} (223)
  3250. \p{Script: Nabataean} (Short: \p{Sc=Nbat}, \p{Nbat}) (40)
  3251. \p{Script: Narb} \p{Script=Old_North_Arabian} (32)
  3252. \p{Script: Nbat} \p{Script=Nabataean} (40)
  3253. \p{Script: New_Tai_Lue} (Short: \p{Sc=Talu}, \p{Talu}) (83)
  3254. \p{Script: Nko} (Short: \p{Sc=Nko}, \p{Nko}) (59)
  3255. \p{Script: Nkoo} \p{Script=Nko} (59)
  3256. \p{Script: Ogam} \p{Script=Ogham} (29)
  3257. \p{Script: Ogham} (Short: \p{Sc=Ogam}, \p{Ogam}) (29)
  3258. \p{Script: Ol_Chiki} (Short: \p{Sc=Olck}, \p{Olck}) (48)
  3259. \p{Script: Olck} \p{Script=Ol_Chiki} (48)
  3260. \p{Script: Old_Italic} (Short: \p{Sc=Ital}, \p{Ital}) (36)
  3261. \p{Script: Old_North_Arabian} (Short: \p{Sc=Narb}, \p{Narb}) (32)
  3262. \p{Script: Old_Permic} (Short: \p{Sc=Perm}, \p{Perm}) (43)
  3263. \p{Script: Old_Persian} (Short: \p{Sc=Xpeo}, \p{Xpeo}) (50)
  3264. \p{Script: Old_South_Arabian} (Short: \p{Sc=Sarb}, \p{Sarb}) (32)
  3265. \p{Script: Old_Turkic} (Short: \p{Sc=Orkh}, \p{Orkh}) (73)
  3266. \p{Script: Oriya} (Short: \p{Sc=Orya}, \p{Orya}) (90)
  3267. \p{Script: Orkh} \p{Script=Old_Turkic} (73)
  3268. \p{Script: Orya} \p{Script=Oriya} (90)
  3269. \p{Script: Osma} \p{Script=Osmanya} (40)
  3270. \p{Script: Osmanya} (Short: \p{Sc=Osma}, \p{Osma}) (40)
  3271. \p{Script: Pahawh_Hmong} (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
  3272. \p{Script: Palm} \p{Script=Palmyrene} (32)
  3273. \p{Script: Palmyrene} (Short: \p{Sc=Palm}, \p{Palm}) (32)
  3274. \p{Script: Pau_Cin_Hau} (Short: \p{Sc=Pauc}, \p{Pauc}) (57)
  3275. \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (57)
  3276. \p{Script: Perm} \p{Script=Old_Permic} (43)
  3277. \p{Script: Phag} \p{Script=Phags_Pa} (56)
  3278. \p{Script: Phags_Pa} (Short: \p{Sc=Phag}, \p{Phag}) (56)
  3279. \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (27)
  3280. \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29)
  3281. \p{Script: Phnx} \p{Script=Phoenician} (29)
  3282. \p{Script: Phoenician} (Short: \p{Sc=Phnx}, \p{Phnx}) (29)
  3283. \p{Script: Plrd} \p{Script=Miao} (133)
  3284. \p{Script: Prti} \p{Script=Inscriptional_Parthian} (30)
  3285. \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}, \p{Phlp}) (29)
  3286. \p{Script: Qaac} \p{Script=Coptic} (137)
  3287. \p{Script: Qaai} \p{Script=Inherited} (563)
  3288. \p{Script: Rejang} (Short: \p{Sc=Rjng}, \p{Rjng}) (37)
  3289. \p{Script: Rjng} \p{Script=Rejang} (37)
  3290. \p{Script: Runic} (Short: \p{Sc=Runr}, \p{Runr}) (86)
  3291. \p{Script: Runr} \p{Script=Runic} (86)
  3292. \p{Script: Samaritan} (Short: \p{Sc=Samr}, \p{Samr}) (61)
  3293. \p{Script: Samr} \p{Script=Samaritan} (61)
  3294. \p{Script: Sarb} \p{Script=Old_South_Arabian} (32)
  3295. \p{Script: Saur} \p{Script=Saurashtra} (81)
  3296. \p{Script: Saurashtra} (Short: \p{Sc=Saur}, \p{Saur}) (81)
  3297. \p{Script: Sharada} (Short: \p{Sc=Shrd}, \p{Shrd}) (85)
  3298. \p{Script: Shavian} (Short: \p{Sc=Shaw}, \p{Shaw}) (48)
  3299. \p{Script: Shaw} \p{Script=Shavian} (48)
  3300. \p{Script: Shrd} \p{Script=Sharada} (85)
  3301. \p{Script: Sidd} \p{Script=Siddham} (72)
  3302. \p{Script: Siddham} (Short: \p{Sc=Sidd}, \p{Sidd}) (72)
  3303. \p{Script: Sind} \p{Script=Khudawadi} (69)
  3304. \p{Script: Sinh} \p{Script=Sinhala} (110)
  3305. \p{Script: Sinhala} (Short: \p{Sc=Sinh}, \p{Sinh}) (110)
  3306. \p{Script: Sora} \p{Script=Sora_Sompeng} (35)
  3307. \p{Script: Sora_Sompeng} (Short: \p{Sc=Sora}, \p{Sora}) (35)
  3308. \p{Script: Sund} \p{Script=Sundanese} (72)
  3309. \p{Script: Sundanese} (Short: \p{Sc=Sund}, \p{Sund}) (72)
  3310. \p{Script: Sylo} \p{Script=Syloti_Nagri} (44)
  3311. \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}, \p{Sylo}) (44)
  3312. \p{Script: Syrc} \p{Script=Syriac} (77)
  3313. \p{Script: Syriac} (Short: \p{Sc=Syrc}, \p{Syrc}) (77)
  3314. \p{Script: Tagalog} (Short: \p{Sc=Tglg}, \p{Tglg}) (20)
  3315. \p{Script: Tagb} \p{Script=Tagbanwa} (18)
  3316. \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}, \p{Tagb}) (18)
  3317. \p{Script: Tai_Le} (Short: \p{Sc=Tale}, \p{Tale}) (35)
  3318. \p{Script: Tai_Tham} (Short: \p{Sc=Lana}, \p{Lana}) (127)
  3319. \p{Script: Tai_Viet} (Short: \p{Sc=Tavt}, \p{Tavt}) (72)
  3320. \p{Script: Takr} \p{Script=Takri} (66)
  3321. \p{Script: Takri} (Short: \p{Sc=Takr}, \p{Takr}) (66)
  3322. \p{Script: Tale} \p{Script=Tai_Le} (35)
  3323. \p{Script: Talu} \p{Script=New_Tai_Lue} (83)
  3324. \p{Script: Tamil} (Short: \p{Sc=Taml}, \p{Taml}) (72)
  3325. \p{Script: Taml} \p{Script=Tamil} (72)
  3326. \p{Script: Tavt} \p{Script=Tai_Viet} (72)
  3327. \p{Script: Telu} \p{Script=Telugu} (95)
  3328. \p{Script: Telugu} (Short: \p{Sc=Telu}, \p{Telu}) (95)
  3329. \p{Script: Tfng} \p{Script=Tifinagh} (59)
  3330. \p{Script: Tglg} \p{Script=Tagalog} (20)
  3331. \p{Script: Thaa} \p{Script=Thaana} (50)
  3332. \p{Script: Thaana} (Short: \p{Sc=Thaa}, \p{Thaa}) (50)
  3333. \p{Script: Thai} (Short: \p{Sc=Thai}, \p{Thai}) (86)
  3334. \p{Script: Tibetan} (Short: \p{Sc=Tibt}, \p{Tibt}) (207)
  3335. \p{Script: Tibt} \p{Script=Tibetan} (207)
  3336. \p{Script: Tifinagh} (Short: \p{Sc=Tfng}, \p{Tfng}) (59)
  3337. \p{Script: Tirh} \p{Script=Tirhuta} (82)
  3338. \p{Script: Tirhuta} (Short: \p{Sc=Tirh}, \p{Tirh}) (82)
  3339. \p{Script: Ugar} \p{Script=Ugaritic} (31)
  3340. \p{Script: Ugaritic} (Short: \p{Sc=Ugar}, \p{Ugar}) (31)
  3341. \p{Script: Unknown} (Short: \p{Sc=Zzzz}, \p{Zzzz}) (1_001_091
  3342. plus all above-Unicode code points)
  3343. \p{Script: Vai} (Short: \p{Sc=Vai}, \p{Vai}) (300)
  3344. \p{Script: Vaii} \p{Script=Vai} (300)
  3345. \p{Script: Wara} \p{Script=Warang_Citi} (84)
  3346. \p{Script: Warang_Citi} (Short: \p{Sc=Wara}, \p{Wara}) (84)
  3347. \p{Script: Xpeo} \p{Script=Old_Persian} (50)
  3348. \p{Script: Xsux} \p{Script=Cuneiform} (1037)
  3349. \p{Script: Yi} (Short: \p{Sc=Yi}, \p{Yi}) (1220)
  3350. \p{Script: Yiii} \p{Script=Yi} (1220)
  3351. \p{Script: Zinh} \p{Script=Inherited} (563)
  3352. \p{Script: Zyyy} \p{Script=Common} (7129)
  3353. \p{Script: Zzzz} \p{Script=Unknown} (1_001_091 plus all
  3354. above-Unicode code points)
  3355. \p{Script_Extensions: Aghb} \p{Script_Extensions=
  3356. Caucasian_Albanian} (53)
  3357. \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1298)
  3358. \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}) (1298)
  3359. \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}) (94)
  3360. \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
  3361. (31)
  3362. \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (94)
  3363. \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}) (61)
  3364. \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
  3365. \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
  3366. \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}) (121)
  3367. \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
  3368. \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}) (657)
  3369. \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
  3370. \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}) (36)
  3371. \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}) (56)
  3372. \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
  3373. \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (95)
  3374. \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}) (95)
  3375. \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (306)
  3376. \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}) (306)
  3377. \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
  3378. \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}) (109)
  3379. \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
  3380. \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}) (256)
  3381. \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
  3382. \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}) (31)
  3383. \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
  3384. \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}) (22)
  3385. \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (87)
  3386. \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans})
  3387. (710)
  3388. \p{Script_Extensions: Cans} \p{Script_Extensions=
  3389. Canadian_Aboriginal} (710)
  3390. \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
  3391. \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}) (49)
  3392. \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb})
  3393. (53)
  3394. \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}) (87)
  3395. \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}) (83)
  3396. \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (85)
  3397. \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}) (85)
  3398. \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}) (6741)
  3399. \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
  3400. \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}) (165)
  3401. \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
  3402. \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}) (1037)
  3403. \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}) (112)
  3404. \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}) (433)
  3405. \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (433)
  3406. \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}) (80)
  3407. \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (196)
  3408. \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}) (196)
  3409. \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
  3410. \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
  3411. \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}) (147)
  3412. \p{Script_Extensions: Egyp} \p{Script_Extensions=
  3413. Egyptian_Hieroglyphs} (1071)
  3414. \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp})
  3415. (1071)
  3416. \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
  3417. \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}) (40)
  3418. \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
  3419. \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}) (495)
  3420. \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (128)
  3421. \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}) (128)
  3422. \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (94)
  3423. \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}) (94)
  3424. \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
  3425. \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}) (27)
  3426. \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (85)
  3427. \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}) (85)
  3428. \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}) (520)
  3429. \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (520)
  3430. \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}) (96)
  3431. \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (96)
  3432. \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}) (91)
  3433. \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (91)
  3434. \p{Script_Extensions: Han} (Short: \p{Scx=Han}) (76_218)
  3435. \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_971)
  3436. \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}) (11_971)
  3437. \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (76_218)
  3438. \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
  3439. \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}) (23)
  3440. \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (133)
  3441. \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}) (133)
  3442. \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (356)
  3443. \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}) (356)
  3444. \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
  3445. (127)
  3446. \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi}) (31)
  3447. \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}) (496)
  3448. \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli})
  3449. (27)
  3450. \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
  3451. Prti}) (30)
  3452. \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (36)
  3453. \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
  3454. \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}) (91)
  3455. \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}) (86)
  3456. \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
  3457. \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (565)
  3458. \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}) (89)
  3459. \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}) (565)
  3460. \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}) (48)
  3461. \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (65)
  3462. \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}) (65)
  3463. \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}) (146)
  3464. \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
  3465. \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (71)
  3466. \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}) (71)
  3467. \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}) (81)
  3468. \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (89)
  3469. \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (86)
  3470. \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
  3471. \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}) (67)
  3472. \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (67)
  3473. \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}) (1356)
  3474. \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1356)
  3475. \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
  3476. \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}) (74)
  3477. \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
  3478. \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}) (69)
  3479. \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (341)
  3480. \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
  3481. \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}) (341)
  3482. \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}) (268)
  3483. \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}) (48)
  3484. \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
  3485. \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}) (29)
  3486. \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
  3487. \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}) (27)
  3488. \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}) (61)
  3489. \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
  3490. \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}) (101)
  3491. \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
  3492. \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}) (30)
  3493. \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
  3494. \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}) (52)
  3495. \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei}) (79)
  3496. \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
  3497. (213)
  3498. \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend}) (213)
  3499. \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
  3500. (26)
  3501. \p{Script_Extensions: Mero} \p{Script_Extensions=
  3502. Meroitic_Hieroglyphs} (32)
  3503. \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc}) (26)
  3504. \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero})
  3505. (32)
  3506. \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}) (133)
  3507. \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (101)
  3508. \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}) (89)
  3509. \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (156)
  3510. \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}) (156)
  3511. \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}) (43)
  3512. \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
  3513. \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
  3514. \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}) (224)
  3515. \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
  3516. \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}) (40)
  3517. \p{Script_Extensions: Narb} \p{Script_Extensions=
  3518. Old_North_Arabian} (32)
  3519. \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
  3520. \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}) (83)
  3521. \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}) (59)
  3522. \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (59)
  3523. \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
  3524. \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}) (29)
  3525. \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}) (48)
  3526. \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
  3527. \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}) (36)
  3528. \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb}) (32)
  3529. \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}) (43)
  3530. \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}) (50)
  3531. \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb}) (32)
  3532. \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}) (73)
  3533. \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}) (92)
  3534. \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
  3535. \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (92)
  3536. \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
  3537. \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}) (40)
  3538. \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng}) (127)
  3539. \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
  3540. \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}) (32)
  3541. \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}) (57)
  3542. \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
  3543. \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (43)
  3544. \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
  3545. \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}) (59)
  3546. \p{Script_Extensions: Phli} \p{Script_Extensions=
  3547. Inscriptional_Pahlavi} (27)
  3548. \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
  3549. (30)
  3550. \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
  3551. \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}) (29)
  3552. \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (133)
  3553. \p{Script_Extensions: Prti} \p{Script_Extensions=
  3554. Inscriptional_Parthian} (30)
  3555. \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp}) (30)
  3556. \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
  3557. \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (496)
  3558. \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}) (37)
  3559. \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
  3560. \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}) (86)
  3561. \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
  3562. \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}) (61)
  3563. \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
  3564. \p{Script_Extensions: Sarb} \p{Script_Extensions=
  3565. Old_South_Arabian} (32)
  3566. \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (81)
  3567. \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}) (81)
  3568. \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}) (85)
  3569. \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}) (48)
  3570. \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
  3571. \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (85)
  3572. \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (72)
  3573. \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}) (72)
  3574. \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
  3575. \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112)
  3576. \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}) (112)
  3577. \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
  3578. \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora}) (35)
  3579. \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
  3580. \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}) (72)
  3581. \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56)
  3582. \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo}) (56)
  3583. \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (93)
  3584. \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}) (93)
  3585. \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}) (22)
  3586. \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
  3587. \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}) (20)
  3588. \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}) (45)
  3589. \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}) (127)
  3590. \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}) (72)
  3591. \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (78)
  3592. \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}) (78)
  3593. \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
  3594. \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
  3595. \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}) (74)
  3596. \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (74)
  3597. \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
  3598. \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (97)
  3599. \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}) (97)
  3600. \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
  3601. \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
  3602. \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (65)
  3603. \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}) (65)
  3604. \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}) (86)
  3605. \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}) (207)
  3606. \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
  3607. \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}) (59)
  3608. \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (94)
  3609. \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}) (94)
  3610. \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
  3611. \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}) (31)
  3612. \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}) (1_001_091
  3613. plus all above-Unicode code points)
  3614. \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}) (300)
  3615. \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
  3616. \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
  3617. \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}) (84)
  3618. \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
  3619. \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1037)
  3620. \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}) (1246)
  3621. \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
  3622. \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (496)
  3623. \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (6741)
  3624. \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown}
  3625. (1_001_091 plus all above-Unicode code
  3626. points)
  3627. \p{Scx: *} \p{Script_Extensions: *}
  3628. \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
  3629. \p{SD: *} \p{Soft_Dotted: *}
  3630. \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
  3631. \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4)
  3632. \p{Sentence_Break: CL} \p{Sentence_Break=Close} (187)
  3633. \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187)
  3634. \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1)
  3635. \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (1834)
  3636. \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (1834)
  3637. \p{Sentence_Break: FO} \p{Sentence_Break=Format} (148)
  3638. \p{Sentence_Break: Format} (Short: \p{SB=FO}) (148)
  3639. \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (99_420)
  3640. \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1)
  3641. \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2029)
  3642. \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2029)
  3643. \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (532)
  3644. \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (532)
  3645. \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (99_420)
  3646. \p{Sentence_Break: Other} (Short: \p{SB=XX}) (1_008_170 plus all
  3647. above-Unicode code points)
  3648. \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
  3649. \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26)
  3650. \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
  3651. \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3)
  3652. \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20)
  3653. \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (96)
  3654. \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (96)
  3655. \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1641)
  3656. \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1641)
  3657. \p{Sentence_Break: XX} \p{Sentence_Break=Other} (1_008_170 plus
  3658. all above-Unicode code points)
  3659. \p{Separator} \p{General_Category=Separator} (Short:
  3660. \p{Z}) (19)
  3661. \p{Sharada} \p{Script=Sharada} (Short: \p{Shrd}; NOT
  3662. \p{Block=Sharada}) (85)
  3663. \p{Shavian} \p{Script=Shavian} (Short: \p{Shaw}) (48)
  3664. \p{Shaw} \p{Shavian} (= \p{Script=Shavian}) (48)
  3665. X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
  3666. (16)
  3667. \p{Shrd} \p{Sharada} (= \p{Script=Sharada}) (NOT
  3668. \p{Block=Sharada}) (85)
  3669. \p{Sidd} \p{Siddham} (= \p{Script=Siddham}) (NOT
  3670. \p{Block=Siddham}) (72)
  3671. \p{Siddham} \p{Script=Siddham} (Short: \p{Sidd}; NOT
  3672. \p{Block=Siddham}) (72)
  3673. \p{Sind} \p{Khudawadi} (= \p{Script=Khudawadi})
  3674. (NOT \p{Block=Khudawadi}) (69)
  3675. \p{Sinh} \p{Sinhala} (= \p{Script=Sinhala}) (NOT
  3676. \p{Block=Sinhala}) (110)
  3677. \p{Sinhala} \p{Script=Sinhala} (Short: \p{Sinh}; NOT
  3678. \p{Block=Sinhala}) (110)
  3679. X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
  3680. \p{Sk} \p{Modifier_Symbol} (=
  3681. \p{General_Category=Modifier_Symbol})
  3682. (116)
  3683. \p{Sm} \p{Math_Symbol} (= \p{General_Category=
  3684. Math_Symbol}) (948)
  3685. X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
  3686. \p{InSmallForms}) (32)
  3687. X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
  3688. Small_Form_Variants}) (32)
  3689. \p{So} \p{Other_Symbol} (= \p{General_Category=
  3690. Other_Symbol}) (5082)
  3691. \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
  3692. \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
  3693. all above-Unicode code points)
  3694. \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46)
  3695. \p{Sora} \p{Sora_Sompeng} (= \p{Script=
  3696. Sora_Sompeng}) (NOT \p{Block=
  3697. Sora_Sompeng}) (35)
  3698. \p{Sora_Sompeng} \p{Script=Sora_Sompeng} (Short: \p{Sora};
  3699. NOT \p{Block=Sora_Sompeng}) (35)
  3700. \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25)
  3701. \p{Space: *} \p{White_Space: *}
  3702. \p{Space_Separator} \p{General_Category=Space_Separator}
  3703. (Short: \p{Zs}) (17)
  3704. \p{SpacePerl} \p{XPosixSpace} (25)
  3705. \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
  3706. \p{Mc}) (399)
  3707. X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
  3708. (Short: \p{InModifierLetters}) (80)
  3709. X \p{Specials} \p{Block=Specials} (16)
  3710. \p{STerm} \p{STerm=Y} (99)
  3711. \p{STerm: N*} (Single: \P{STerm}) (1_114_013 plus all
  3712. above-Unicode code points)
  3713. \p{STerm: Y*} (Single: \p{STerm}) (99)
  3714. \p{Sund} \p{Sundanese} (= \p{Script=Sundanese})
  3715. (NOT \p{Block=Sundanese}) (72)
  3716. \p{Sundanese} \p{Script=Sundanese} (Short: \p{Sund}; NOT
  3717. \p{Block=Sundanese}) (72)
  3718. X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
  3719. Sundanese_Supplement}) (16)
  3720. X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
  3721. \p{InSundaneseSup}) (16)
  3722. X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
  3723. Supplemental_Arrows_A}) (16)
  3724. X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
  3725. Supplemental_Arrows_B}) (128)
  3726. X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block=
  3727. Supplemental_Arrows_C}) (256)
  3728. X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
  3729. \p{Block=
  3730. Supplemental_Mathematical_Operators})
  3731. (256)
  3732. X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
  3733. \p{Block=
  3734. Supplementary_Private_Use_Area_A})
  3735. (65_536)
  3736. X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
  3737. \p{Block=
  3738. Supplementary_Private_Use_Area_B})
  3739. (65_536)
  3740. X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
  3741. Supplemental_Punctuation}) (128)
  3742. X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
  3743. \p{Block=Superscripts_And_Subscripts})
  3744. (48)
  3745. X \p{Superscripts_And_Subscripts} \p{Block=
  3746. Superscripts_And_Subscripts} (Short:
  3747. \p{InSuperAndSub}) (48)
  3748. X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
  3749. \p{InSupArrowsA}) (16)
  3750. X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
  3751. \p{InSupArrowsB}) (128)
  3752. X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
  3753. \p{InSupArrowsC}) (256)
  3754. X \p{Supplemental_Mathematical_Operators} \p{Block=
  3755. Supplemental_Mathematical_Operators}
  3756. (Short: \p{InSupMathOperators}) (256)
  3757. X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
  3758. (Short: \p{InSupPunctuation}) (128)
  3759. X \p{Supplementary_Private_Use_Area_A} \p{Block=
  3760. Supplementary_Private_Use_Area_A}
  3761. (Short: \p{InSupPUAA}) (65_536)
  3762. X \p{Supplementary_Private_Use_Area_B} \p{Block=
  3763. Supplementary_Private_Use_Area_B}
  3764. (Short: \p{InSupPUAB}) (65_536)
  3765. \p{Surrogate} \p{General_Category=Surrogate} (Short:
  3766. \p{Cs}) (2048)
  3767. \p{Sylo} \p{Syloti_Nagri} (= \p{Script=
  3768. Syloti_Nagri}) (NOT \p{Block=
  3769. Syloti_Nagri}) (44)
  3770. \p{Syloti_Nagri} \p{Script=Syloti_Nagri} (Short: \p{Sylo};
  3771. NOT \p{Block=Syloti_Nagri}) (44)
  3772. \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
  3773. (6198)
  3774. \p{Syrc} \p{Syriac} (= \p{Script=Syriac}) (NOT
  3775. \p{Block=Syriac}) (77)
  3776. \p{Syriac} \p{Script=Syriac} (Short: \p{Syrc}; NOT
  3777. \p{Block=Syriac}) (77)
  3778. \p{Tagalog} \p{Script=Tagalog} (Short: \p{Tglg}; NOT
  3779. \p{Block=Tagalog}) (20)
  3780. \p{Tagb} \p{Tagbanwa} (= \p{Script=Tagbanwa}) (NOT
  3781. \p{Block=Tagbanwa}) (18)
  3782. \p{Tagbanwa} \p{Script=Tagbanwa} (Short: \p{Tagb}; NOT
  3783. \p{Block=Tagbanwa}) (18)
  3784. X \p{Tags} \p{Block=Tags} (128)
  3785. \p{Tai_Le} \p{Script=Tai_Le} (Short: \p{Tale}; NOT
  3786. \p{Block=Tai_Le}) (35)
  3787. \p{Tai_Tham} \p{Script=Tai_Tham} (Short: \p{Lana}; NOT
  3788. \p{Block=Tai_Tham}) (127)
  3789. \p{Tai_Viet} \p{Script=Tai_Viet} (Short: \p{Tavt}; NOT
  3790. \p{Block=Tai_Viet}) (72)
  3791. X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
  3792. Tai_Xuan_Jing_Symbols}) (96)
  3793. X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
  3794. \p{InTaiXuanJing}) (96)
  3795. \p{Takr} \p{Takri} (= \p{Script=Takri}) (NOT
  3796. \p{Block=Takri}) (66)
  3797. \p{Takri} \p{Script=Takri} (Short: \p{Takr}; NOT
  3798. \p{Block=Takri}) (66)
  3799. \p{Tale} \p{Tai_Le} (= \p{Script=Tai_Le}) (NOT
  3800. \p{Block=Tai_Le}) (35)
  3801. \p{Talu} \p{New_Tai_Lue} (= \p{Script=New_Tai_Lue})
  3802. (NOT \p{Block=New_Tai_Lue}) (83)
  3803. \p{Tamil} \p{Script=Tamil} (Short: \p{Taml}; NOT
  3804. \p{Block=Tamil}) (72)
  3805. \p{Taml} \p{Tamil} (= \p{Script=Tamil}) (NOT
  3806. \p{Block=Tamil}) (72)
  3807. \p{Tavt} \p{Tai_Viet} (= \p{Script=Tai_Viet}) (NOT
  3808. \p{Block=Tai_Viet}) (72)
  3809. \p{Telu} \p{Telugu} (= \p{Script=Telugu}) (NOT
  3810. \p{Block=Telugu}) (95)
  3811. \p{Telugu} \p{Script=Telugu} (Short: \p{Telu}; NOT
  3812. \p{Block=Telugu}) (95)
  3813. \p{Term} \p{Terminal_Punctuation} (=
  3814. \p{Terminal_Punctuation=Y}) (214)
  3815. \p{Term: *} \p{Terminal_Punctuation: *}
  3816. \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
  3817. \p{Term}) (214)
  3818. \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
  3819. (1_113_898 plus all above-Unicode code
  3820. points)
  3821. \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (214)
  3822. \p{Tfng} \p{Tifinagh} (= \p{Script=Tifinagh}) (NOT
  3823. \p{Block=Tifinagh}) (59)
  3824. \p{Tglg} \p{Tagalog} (= \p{Script=Tagalog}) (NOT
  3825. \p{Block=Tagalog}) (20)
  3826. \p{Thaa} \p{Thaana} (= \p{Script=Thaana}) (NOT
  3827. \p{Block=Thaana}) (50)
  3828. \p{Thaana} \p{Script=Thaana} (Short: \p{Thaa}; NOT
  3829. \p{Block=Thaana}) (50)
  3830. \p{Thai} \p{Script=Thai} (NOT \p{Block=Thai}) (86)
  3831. \p{Tibetan} \p{Script=Tibetan} (Short: \p{Tibt}; NOT
  3832. \p{Block=Tibetan}) (207)
  3833. \p{Tibt} \p{Tibetan} (= \p{Script=Tibetan}) (NOT
  3834. \p{Block=Tibetan}) (207)
  3835. \p{Tifinagh} \p{Script=Tifinagh} (Short: \p{Tfng}; NOT
  3836. \p{Block=Tifinagh}) (59)
  3837. \p{Tirh} \p{Tirhuta} (= \p{Script=Tirhuta}) (NOT
  3838. \p{Block=Tirhuta}) (82)
  3839. \p{Tirhuta} \p{Script=Tirhuta} (Short: \p{Tirh}; NOT
  3840. \p{Block=Tirhuta}) (82)
  3841. \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
  3842. \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
  3843. Cased=Yes) (31)
  3844. \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
  3845. (Short: \p{Lt}; /i= General_Category=
  3846. Cased_Letter) (31)
  3847. X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
  3848. Transport_And_Map_Symbols}) (128)
  3849. X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
  3850. (Short: \p{InTransportAndMap}) (128)
  3851. X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
  3852. (= \p{Block=
  3853. Unified_Canadian_Aboriginal_Syllabics})
  3854. (640)
  3855. X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
  3856. Extended} (= \p{Block=
  3857. Unified_Canadian_Aboriginal_Syllabics_-
  3858. Extended}) (80)
  3859. \p{Ugar} \p{Ugaritic} (= \p{Script=Ugaritic}) (NOT
  3860. \p{Block=Ugaritic}) (31)
  3861. \p{Ugaritic} \p{Script=Ugaritic} (Short: \p{Ugar}; NOT
  3862. \p{Block=Ugaritic}) (31)
  3863. \p{UIdeo} \p{Unified_Ideograph} (=
  3864. \p{Unified_Ideograph=Y}) (74_617)
  3865. \p{UIdeo: *} \p{Unified_Ideograph: *}
  3866. \p{Unassigned} \p{General_Category=Unassigned} (Short:
  3867. \p{Cn}) (861_575 plus all above-Unicode
  3868. code points)
  3869. \p{Unicode} \p{Any} (1_114_112)
  3870. X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
  3871. Unified_Canadian_Aboriginal_Syllabics}
  3872. (Short: \p{InUCAS}) (640)
  3873. X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
  3874. Unified_Canadian_Aboriginal_Syllabics_-
  3875. Extended} (Short: \p{InUCASExt}) (80)
  3876. \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
  3877. (74_617)
  3878. \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
  3879. (1_039_495 plus all above-Unicode code
  3880. points)
  3881. \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (74_617)
  3882. \p{Unknown} \p{Script=Unknown} (Short: \p{Zzzz})
  3883. (1_001_091 plus all above-Unicode code
  3884. points)
  3885. \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
  3886. Cased=Yes) (1610)
  3887. \p{Upper: *} \p{Uppercase: *}
  3888. \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
  3889. Cased=Yes) (1610)
  3890. \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
  3891. No) (1_112_502 plus all above-Unicode
  3892. code points)
  3893. \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
  3894. Yes) (1610)
  3895. \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
  3896. (Short: \p{Lu}; /i= General_Category=
  3897. Cased_Letter) (1490)
  3898. \p{Vai} \p{Script=Vai} (NOT \p{Block=Vai}) (300)
  3899. \p{Vaii} \p{Vai} (= \p{Script=Vai}) (NOT \p{Block=
  3900. Vai}) (300)
  3901. \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
  3902. NOT \p{Variation_Selectors}) (259)
  3903. \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
  3904. plus all above-Unicode code points)
  3905. \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259)
  3906. X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
  3907. \p{InVS}) (16)
  3908. X \p{Variation_Selectors_Supplement} \p{Block=
  3909. Variation_Selectors_Supplement} (Short:
  3910. \p{InVSSup}) (240)
  3911. X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
  3912. Vedic_Extensions}) (48)
  3913. X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
  3914. \p{InVedicExt}) (48)
  3915. X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
  3916. \p{VertSpace} \v (7)
  3917. \p{VS} \p{Variation_Selector} (=
  3918. \p{Variation_Selector=Y}) (NOT
  3919. \p{Variation_Selectors}) (259)
  3920. \p{VS: *} \p{Variation_Selector: *}
  3921. X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
  3922. \p{Block=
  3923. Variation_Selectors_Supplement}) (240)
  3924. \p{Wara} \p{Warang_Citi} (= \p{Script=Warang_Citi})
  3925. (NOT \p{Block=Warang_Citi}) (84)
  3926. \p{Warang_Citi} \p{Script=Warang_Citi} (Short: \p{Wara};
  3927. NOT \p{Block=Warang_Citi}) (84)
  3928. \p{WB: *} \p{Word_Break: *}
  3929. \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25)
  3930. \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087
  3931. plus all above-Unicode code points)
  3932. \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25)
  3933. \p{Word} \p{XPosixWord} (105_473)
  3934. \p{Word_Break: ALetter} (Short: \p{WB=LE}) (26_647)
  3935. \p{Word_Break: CR} (Short: \p{WB=CR}) (1)
  3936. \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1)
  3937. \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1)
  3938. \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (10)
  3939. \p{Word_Break: Extend} (Short: \p{WB=Extend}) (1834)
  3940. \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (10)
  3941. \p{Word_Break: FO} \p{Word_Break=Format} (147)
  3942. \p{Word_Break: Format} (Short: \p{WB=FO}) (147)
  3943. \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (74)
  3944. \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (74)
  3945. \p{Word_Break: KA} \p{Word_Break=Katakana} (310)
  3946. \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310)
  3947. \p{Word_Break: LE} \p{Word_Break=ALetter} (26_647)
  3948. \p{Word_Break: LF} (Short: \p{WB=LF}) (1)
  3949. \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7)
  3950. \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9)
  3951. \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15)
  3952. \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7)
  3953. \p{Word_Break: ML} \p{Word_Break=MidLetter} (9)
  3954. \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
  3955. \p{Word_Break: Newline} (Short: \p{WB=NL}) (5)
  3956. \p{Word_Break: NL} \p{Word_Break=Newline} (5)
  3957. \p{Word_Break: NU} \p{Word_Break=Numeric} (531)
  3958. \p{Word_Break: Numeric} (Short: \p{WB=NU}) (531)
  3959. \p{Word_Break: Other} (Short: \p{WB=XX}) (1_084_493 plus all
  3960. above-Unicode code points)
  3961. \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26)
  3962. \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
  3963. \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1)
  3964. \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1)
  3965. \p{Word_Break: XX} \p{Word_Break=Other} (1_084_493 plus all
  3966. above-Unicode code points)
  3967. \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25)
  3968. \p{WSpace: *} \p{White_Space: *}
  3969. \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  3970. \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
  3971. (105_324)
  3972. \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (1_008_788
  3973. plus all above-Unicode code points)
  3974. \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (105_324)
  3975. \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (102_941)
  3976. \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (1_011_171
  3977. plus all above-Unicode code points)
  3978. \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (102_941)
  3979. \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
  3980. (105_324)
  3981. \p{XIDC: *} \p{XID_Continue: *}
  3982. \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (102_941)
  3983. \p{XIDS: *} \p{XID_Start: *}
  3984. \p{Xpeo} \p{Old_Persian} (= \p{Script=Old_Persian})
  3985. (NOT \p{Block=Old_Persian}) (50)
  3986. \p{XPerlSpace} \p{XPosixSpace} (25)
  3987. \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short:
  3988. \p{Alnum}) (104_617)
  3989. \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha})
  3990. (104_077)
  3991. \p{XPosixBlank} \h, Horizontal white space (Short:
  3992. \p{Blank}) (18)
  3993. \p{XPosixCntrl} \p{General_Category=Control} Control
  3994. characters (Short: \p{Cc}) (65)
  3995. \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9]
  3996. + all other decimal digits (Short:
  3997. \p{Nd}) (540)
  3998. \p{XPosixGraph} Characters that are graphical (Short:
  3999. \p{Graph}) (250_405)
  4000. \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i=
  4001. Cased=Yes) (2030)
  4002. \p{XPosixPrint} Characters that are graphical plus space
  4003. characters (but no controls) (Short:
  4004. \p{Print}) (250_422)
  4005. \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (697)
  4006. \p{XPosixSpace} \s including beyond ASCII and vertical tab
  4007. (Short: \p{SpacePerl}) (25)
  4008. \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i=
  4009. Cased=Yes) (1610)
  4010. \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} +
  4011. \pM + \p{Pc} (Short: \p{Word}) (105_473)
  4012. \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
  4013. \p{Xsux} \p{Cuneiform} (= \p{Script=Cuneiform})
  4014. (NOT \p{Block=Cuneiform}) (1037)
  4015. \p{Yi} \p{Script=Yi} (1220)
  4016. X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
  4017. X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
  4018. \p{Yiii} \p{Yi} (= \p{Script=Yi}) (1220)
  4019. X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
  4020. Yijing_Hexagram_Symbols}) (64)
  4021. X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
  4022. (Short: \p{InYijing}) (64)
  4023. \p{Z} \pZ \p{Separator} (= \p{General_Category=
  4024. Separator}) (19)
  4025. \p{Zinh} \p{Inherited} (= \p{Script=Inherited})
  4026. (563)
  4027. \p{Zl} \p{Line_Separator} (= \p{General_Category=
  4028. Line_Separator}) (1)
  4029. \p{Zp} \p{Paragraph_Separator} (=
  4030. \p{General_Category=
  4031. Paragraph_Separator}) (1)
  4032. \p{Zs} \p{Space_Separator} (=
  4033. \p{General_Category=Space_Separator})
  4034. (17)
  4035. \p{Zyyy} \p{Common} (= \p{Script=Common}) (7129)
  4036. \p{Zzzz} \p{Unknown} (= \p{Script=Unknown})
  4037. (1_001_091 plus all above-Unicode code
  4038. points)
  4039. TX\p{_CanonDCIJ} (For internal use by Perl, not necessarily
  4040. stable) (= \p{Soft_Dotted=Y}) (46)
  4041. TX\p{_Case_Ignorable} (For internal use by Perl, not necessarily
  4042. stable) (= \p{Case_Ignorable=Y}) (1961)
  4043. TX\p{_CombAbove} (For internal use by Perl, not necessarily
  4044. stable) (= \p{Canonical_Combining_Class=
  4045. Above}) (399)

Legal \p{} and \P{} constructs that match no characters

Unicode has some property-value pairs that currently don't match anything. This happens generally either because they are obsolete, or they exist for symmetry with other forms, but no language has yet been encoded that uses them. In this version of Unicode, the following match zero code points:

  • \p{Canonical_Combining_Class=Attached_Below_Left}
  • \p{Canonical_Combining_Class=CCC133}
  • \p{Grapheme_Cluster_Break=Prepend}

Properties accessible through Unicode::UCD

The value of any Unicode (not including Perl extensions) character property mentioned above for any single code point is available through charprop() in Unicode::UCD. charprops_all() in Unicode::UCD returns the values of all the Unicode properties for a given code point.

Besides these, all the Unicode character properties mentioned above (except for those marked as for internal use by Perl) are also accessible by prop_invlist() in Unicode::UCD.

Due to their nature, not all Unicode character properties are suitable for regular expression matches, nor prop_invlist() . The remaining non-provisional, non-internal ones are accessible via prop_invmap() in Unicode::UCD (except for those that this Perl installation hasn't included; see below for which those are).

For compatibility with other parts of Perl, all the single forms given in the table in the section above are recognized. BUT, there are some ambiguities between some Perl extensions and the Unicode properties, all of which are silently resolved in favor of the official Unicode property. To avoid surprises, you should only use prop_invmap() for forms listed in the table below, which omits the non-recommended ones. The affected forms are the Perl single form equivalents of Unicode properties, such as \p{sc} being a single-form equivalent of \p{gc=sc} , which is treated by prop_invmap() as the Script property, whose short name is sc . The table indicates the current ambiguities in the INFO column, beginning with the word "NOT" .

The standard Unicode properties listed below are documented in http://www.unicode.org/reports/tr44/; Perl_Decimal_Digit is documented in prop_invmap() in Unicode::UCD. The other Perl extensions are in Other Properties in perlunicode;

The first column in the table is a name for the property; the second column is an alternative name, if any, plus possibly some annotations. The alternative name is the property's full name, unless that would simply repeat the first column, in which case the second column indicates the property's short name (if different). The annotations are given only in the entry for the full name. If a property is obsolete, etc, the entry will be flagged with the same characters used in the table in the section above, like D or S.

  1. NAME INFO
  2. Age
  3. AHex ASCII_Hex_Digit
  4. All (Perl extension). All code points,
  5. including those above Unicode. Same as
  6. qr/./s
  7. Alnum XPosixAlnum. (Perl extension)
  8. Alpha Alphabetic
  9. Alphabetic (Short: Alpha)
  10. Any (Perl extension). All Unicode code
  11. points: [\x{0000}-\x{10FFFF}]
  12. ASCII Block=ASCII. (Perl extension).
  13. [[:ASCII:]]
  14. ASCII_Hex_Digit (Short: AHex)
  15. Assigned (Perl extension). All assigned code points
  16. Bc Bidi_Class
  17. Bidi_C Bidi_Control
  18. Bidi_Class (Short: bc)
  19. Bidi_Control (Short: Bidi_C)
  20. Bidi_M Bidi_Mirrored
  21. Bidi_Mirrored (Short: Bidi_M)
  22. Bidi_Mirroring_Glyph (Short: bmg)
  23. Bidi_Paired_Bracket (Short: bpb)
  24. Bidi_Paired_Bracket_Type (Short: bpt)
  25. Blank XPosixBlank. (Perl extension)
  26. Blk Block
  27. Block (Short: blk)
  28. Bmg Bidi_Mirroring_Glyph
  29. Bpb Bidi_Paired_Bracket
  30. Bpt Bidi_Paired_Bracket_Type
  31. Canonical_Combining_Class (Short: ccc)
  32. Case_Folding (Short: cf)
  33. Case_Ignorable (Short: CI)
  34. Cased
  35. Category General_Category
  36. Ccc Canonical_Combining_Class
  37. CE Composition_Exclusion
  38. Cf Case_Folding; NOT 'cf' meaning
  39. 'General_Category=Format'
  40. Changes_When_Casefolded (Short: CWCF)
  41. Changes_When_Casemapped (Short: CWCM)
  42. Changes_When_Lowercased (Short: CWL)
  43. Changes_When_NFKC_Casefolded (Short: CWKCF)
  44. Changes_When_Titlecased (Short: CWT)
  45. Changes_When_Uppercased (Short: CWU)
  46. CI Case_Ignorable
  47. Cntrl General_Category=XPosixCntrl. (Perl
  48. extension)
  49. Comp_Ex Full_Composition_Exclusion
  50. Composition_Exclusion (Short: CE)
  51. CWCF Changes_When_Casefolded
  52. CWCM Changes_When_Casemapped
  53. CWKCF Changes_When_NFKC_Casefolded
  54. CWL Changes_When_Lowercased
  55. CWT Changes_When_Titlecased
  56. CWU Changes_When_Uppercased
  57. Dash
  58. Decomposition_Mapping (Short: dm)
  59. Decomposition_Type (Short: dt)
  60. Default_Ignorable_Code_Point (Short: DI)
  61. Dep Deprecated
  62. Deprecated (Short: Dep)
  63. DI Default_Ignorable_Code_Point
  64. Dia Diacritic
  65. Diacritic (Short: Dia)
  66. Digit General_Category=XPosixDigit. (Perl
  67. extension)
  68. Dm Decomposition_Mapping
  69. Dt Decomposition_Type
  70. Ea East_Asian_Width
  71. East_Asian_Width (Short: ea)
  72. Ext Extender
  73. Extender (Short: Ext)
  74. Full_Composition_Exclusion (Short: Comp_Ex)
  75. Gc General_Category
  76. GCB Grapheme_Cluster_Break
  77. General_Category (Short: gc)
  78. Gr_Base Grapheme_Base
  79. Gr_Ext Grapheme_Extend
  80. Graph XPosixGraph. (Perl extension)
  81. Grapheme_Base (Short: Gr_Base)
  82. Grapheme_Cluster_Break (Short: GCB)
  83. Grapheme_Extend (Short: Gr_Ext)
  84. Hangul_Syllable_Type (Short: hst)
  85. Hex Hex_Digit
  86. Hex_Digit (Short: Hex)
  87. HorizSpace XPosixBlank. (Perl extension)
  88. Hst Hangul_Syllable_Type
  89. D Hyphen Supplanted by Line_Break property values;
  90. see www.unicode.org/reports/tr14
  91. ID_Continue (Short: IDC)
  92. ID_Start (Short: IDS)
  93. IDC ID_Continue
  94. Ideo Ideographic
  95. Ideographic (Short: Ideo)
  96. IDS ID_Start
  97. IDS_Binary_Operator (Short: IDSB)
  98. IDS_Trinary_Operator (Short: IDST)
  99. IDSB IDS_Binary_Operator
  100. IDST IDS_Trinary_Operator
  101. In Present_In. (Perl extension)
  102. Isc ISO_Comment; NOT 'isc' meaning
  103. 'General_Category=Other'
  104. ISO_Comment (Short: isc)
  105. Jg Joining_Group
  106. Join_C Join_Control
  107. Join_Control (Short: Join_C)
  108. Joining_Group (Short: jg)
  109. Joining_Type (Short: jt)
  110. Jt Joining_Type
  111. Lb Line_Break
  112. Lc Lowercase_Mapping; NOT 'lc' meaning
  113. 'General_Category=Cased_Letter'
  114. Line_Break (Short: lb)
  115. LOE Logical_Order_Exception
  116. Logical_Order_Exception (Short: LOE)
  117. Lower Lowercase
  118. Lowercase (Short: Lower)
  119. Lowercase_Mapping (Short: lc)
  120. Math
  121. Na Name
  122. Na1 Unicode_1_Name
  123. Name (Short: na)
  124. Name_Alias
  125. NChar Noncharacter_Code_Point
  126. NFC_QC NFC_Quick_Check
  127. NFC_Quick_Check (Short: NFC_QC)
  128. NFD_QC NFD_Quick_Check
  129. NFD_Quick_Check (Short: NFD_QC)
  130. NFKC_Casefold (Short: NFKC_CF)
  131. NFKC_CF NFKC_Casefold
  132. NFKC_QC NFKC_Quick_Check
  133. NFKC_Quick_Check (Short: NFKC_QC)
  134. NFKD_QC NFKD_Quick_Check
  135. NFKD_Quick_Check (Short: NFKD_QC)
  136. Noncharacter_Code_Point (Short: NChar)
  137. Nt Numeric_Type
  138. Numeric_Type (Short: nt)
  139. Numeric_Value (Short: nv)
  140. Nv Numeric_Value
  141. Pat_Syn Pattern_Syntax
  142. Pat_WS Pattern_White_Space
  143. Pattern_Syntax (Short: Pat_Syn)
  144. Pattern_White_Space (Short: Pat_WS)
  145. Perl_Decimal_Digit (Perl extension)
  146. PerlSpace PosixSpace. (Perl extension)
  147. PerlWord PosixWord. (Perl extension)
  148. PosixAlnum (Perl extension). [A-Za-z0-9]
  149. PosixAlpha (Perl extension). [A-Za-z]
  150. PosixBlank (Perl extension). \t and ' '
  151. PosixCntrl (Perl extension). ASCII control
  152. characters: NUL, SOH, STX, ETX, EOT, ENQ,
  153. ACK, BEL, BS, HT, LF, VT, FF, CR, SO, SI,
  154. DLE, DC1, DC2, DC3, DC4, NAK, SYN, ETB,
  155. CAN, EOM, SUB, ESC, FS, GS, RS, US, and DEL
  156. PosixDigit (Perl extension). [0-9]
  157. PosixGraph (Perl extension). [-!"#$%&'()*+,./:;<=
  158. >?@[\\]^_`{|}~0-9A-Za-z]
  159. PosixLower (Perl extension). [a-z]
  160. PosixPrint (Perl extension). [- 0-9A-Za-
  161. z!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~]
  162. PosixPunct (Perl extension). [-!"#$%&'()*+,./:;<=
  163. >?@[\\]^_`{|}~]
  164. PosixSpace (Perl extension). \t, \n, \cK, \f, \r,
  165. and ' '. (\cK is vertical tab)
  166. PosixUpper (Perl extension). [A-Z]
  167. PosixWord (Perl extension). \w, restricted to ASCII
  168. = [A-Za-z0-9_]
  169. PosixXDigit (Perl extension). [0-9A-Fa-f]
  170. Present_In (Short: In). (Perl extension)
  171. Print XPosixPrint. (Perl extension)
  172. Punct General_Category=Punct. (Perl extension)
  173. QMark Quotation_Mark
  174. Quotation_Mark (Short: QMark)
  175. Radical
  176. SB Sentence_Break
  177. Sc Script; NOT 'sc' meaning
  178. 'General_Category=Currency_Symbol'
  179. Scf Simple_Case_Folding
  180. Script (Short: sc)
  181. Script_Extensions (Short: scx)
  182. Scx Script_Extensions
  183. SD Soft_Dotted
  184. Sentence_Break (Short: SB)
  185. Sfc Simple_Case_Folding
  186. Simple_Case_Folding (Short: scf)
  187. Simple_Lowercase_Mapping (Short: slc)
  188. Simple_Titlecase_Mapping (Short: stc)
  189. Simple_Uppercase_Mapping (Short: suc)
  190. Slc Simple_Lowercase_Mapping
  191. Soft_Dotted (Short: SD)
  192. Space White_Space
  193. SpacePerl XPosixSpace. (Perl extension)
  194. Stc Simple_Titlecase_Mapping
  195. STerm
  196. Suc Simple_Uppercase_Mapping
  197. Tc Titlecase_Mapping
  198. Term Terminal_Punctuation
  199. Terminal_Punctuation (Short: Term)
  200. Title Titlecase. (Perl extension)
  201. Titlecase (Short: Title). (Perl extension). (=
  202. \p{Gc=Lt})
  203. Titlecase_Mapping (Short: tc)
  204. Uc Uppercase_Mapping
  205. UIdeo Unified_Ideograph
  206. Unicode Any. (Perl extension)
  207. Unicode_1_Name (Short: na1)
  208. Unified_Ideograph (Short: UIdeo)
  209. Upper Uppercase
  210. Uppercase (Short: Upper)
  211. Uppercase_Mapping (Short: uc)
  212. Variation_Selector (Short: VS)
  213. VertSpace (Perl extension). \v
  214. VS Variation_Selector
  215. WB Word_Break
  216. White_Space (Short: WSpace)
  217. Word XPosixWord. (Perl extension)
  218. Word_Break (Short: WB)
  219. WSpace White_Space
  220. XDigit XPosixXDigit. (Perl extension)
  221. XID_Continue (Short: XIDC)
  222. XID_Start (Short: XIDS)
  223. XIDC XID_Continue
  224. XIDS XID_Start
  225. XPerlSpace XPosixSpace. (Perl extension)
  226. XPosixAlnum (Short: Alnum). (Perl extension).
  227. Alphabetic and (decimal) Numeric
  228. XPosixAlpha (Perl extension)
  229. XPosixBlank (Short: Blank). (Perl extension). \h,
  230. Horizontal white space
  231. XPosixCntrl General_Category=XPosixCntrl (Short:
  232. Cntrl). (Perl extension). Control
  233. characters
  234. XPosixDigit General_Category=XPosixDigit (Short:
  235. Digit). (Perl extension). [0-9] + all
  236. other decimal digits
  237. XPosixGraph (Short: Graph). (Perl extension).
  238. Characters that are graphical
  239. XPosixLower (Perl extension)
  240. XPosixPrint (Short: Print). (Perl extension).
  241. Characters that are graphical plus space
  242. characters (but no controls)
  243. XPosixPunct (Perl extension). \p{Punct} + ASCII-range
  244. \p{Symbol}
  245. XPosixSpace (Perl extension). \s including beyond
  246. ASCII and vertical tab
  247. XPosixUpper (Perl extension)
  248. XPosixWord (Short: Word). (Perl extension). \w,
  249. including beyond ASCII; = \p{Alnum} + \pM
  250. + \p{Pc}
  251. XPosixXDigit (Short: XDigit). (Perl extension)

Properties accessible through other means

Certain properties are accessible also via core function calls. These are:

  1. Lowercase_Mapping lc() and lcfirst()
  2. Titlecase_Mapping ucfirst()
  3. Uppercase_Mapping uc()

Also, Case_Folding is accessible through the /i modifier in regular expressions, the \F transliteration escape, and the fc operator.

And, the Name and Name_Aliases properties are accessible through the \N{} interpolation in double-quoted strings and regular expressions; and functions charnames::viacode() , charnames::vianame() , and charnames::string_vianame() (which require a use charnames (); to be specified.

Finally, most properties related to decomposition are accessible via Unicode::Normalize.

Unicode character properties that are NOT accepted by Perl

Perl will generate an error for a few character properties in Unicode when used in a regular expression. The non-Unihan ones are listed below, with the reasons they are not accepted, perhaps with work-arounds. The short names for the properties are listed enclosed in (parentheses). As described after the list, an installation can change the defaults and choose to accept any of these. The list is machine generated based on the choices made for the installation that generated this document.

  • Expands_On_NFC (XO_NFC)
  • Expands_On_NFD (XO_NFD)
  • Expands_On_NFKC (XO_NFKC)
  • Expands_On_NFKD (XO_NFKD)

    Deprecated by Unicode. These are characters that expand to more than one character in the specified normalization form, but whether they actually take up more bytes or not depends on the encoding being used. For example, a UTF-8 encoded character may expand to a different number of bytes than a UTF-32 encoded character.

  • Grapheme_Link (Gr_Link)

    Deprecated by Unicode: Duplicates ccc=vr (Canonical_Combining_Class=Virama)

  • Indic_Matra_Category (InMC)
  • Indic_Syllabic_Category (InSC)

    Provisional

  • Jamo_Short_Name (JSN)
  • Other_Alphabetic (OAlpha)
  • Other_Default_Ignorable_Code_Point (ODI)
  • Other_Grapheme_Extend (OGr_Ext)
  • Other_ID_Continue (OIDC)
  • Other_ID_Start (OIDS)
  • Other_Lowercase (OLower)
  • Other_Math (OMath)
  • Other_Uppercase (OUpper)

    Used by Unicode internally for generating other properties and not intended to be used stand-alone

  • Script=Katakana_Or_Hiragana (sc=Hrkt)

    Obsolete. All code points previously matched by this have been moved to "Script=Common". Consider instead using "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or both)

  • Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)

    All code points that would be matched by this are matched by either "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"

An installation can choose to allow any of these to be matched by downloading the Unicode database from http://www.unicode.org/Public/ to $Config{privlib} /unicore/ in the Perl source tree, changing the controlling lists contained in the program $Config{privlib} /unicore/mktables and then re-compiling and installing. (%Config is available from the Config module).

Also, perl can be recompiled to operate on an earlier version of the Unicode standard. Further information is at $Config{privlib} /unicore/README.perl.

Other information in the Unicode data base

The Unicode data base is delivered in two different formats. The XML version is valid for more modern Unicode releases. The other version is a collection of files. The two are intended to give equivalent information. Perl uses the older form; this allows you to recompile Perl to use early Unicode releases.

The only non-character property that Perl currently supports is Named Sequences, in which a sequence of code points is given a name and generally treated as a single entity. (Perl supports these via the \N{...} double-quotish construct, charnames::string_vianame(name) in charnames, and namedseq() in Unicode::UCD.

Below is a list of the files in the Unicode data base that Perl doesn't currently use, along with very brief descriptions of their purposes. Some of the names of the files have been shortened from those that Unicode uses, in order to allow them to be distinguishable from similarly named files on file systems for which only the first 8 characters of a name are significant.

  • auxiliary/GraphemeBreakTest.html
  • auxiliary/LineBreakTest.html
  • auxiliary/SentenceBreakTest.html
  • auxiliary/WordBreakTest.html

    Documentation of validation tests

  • auxiliary/LBTest.txt
  • BidiCharacterTest.txt
  • BidiTest.txt
  • NormTest.txt

    Validation Tests

  • CJKRadicals.txt

    Maps the kRSUnicode property values to corresponding code points

  • EmojiSources.txt

    Maps certain Unicode code points to their legacy Japanese cell-phone values

  • Index.txt

    Alphabetical index of Unicode characters

  • IndicMatraCategory.txt
  • IndicSyllabicCategory.txt

    Provisional; for the analysis and processing of Indic scripts

  • NamedSqProv.txt

    Named sequences proposed for inclusion in a later version of the Unicode Standard; if you need them now, you can append this file to NamedSequences.txt and recompile perl

  • NamesList.html

    Describes the format and contents of NamesList.txt

  • NamesList.txt

    Annotated list of characters

  • NormalizationCorrections.txt

    Documentation of corrections already incorporated into the Unicode data base

  • Props.txt

    Only in very early releases; is a subset of PropList.txt (which is used instead)

  • ReadMe.txt

    Documentation

  • StandardizedVariants.html

    Provides a visual display of the standard variant sequences derived from StandardizedVariants.txt.

  • StandardizedVariants.txt

    Certain glyph variations for character display are standardized. This lists the non-Unihan ones; the Unihan ones are also not used by Perl, and are in a separate Unicode data base http://www.unicode.org/ivd

  • USourceData.txt

    Documentation of status and cross reference of proposals for encoding by Unicode of Unihan characters

  • USourceGlyphs.pdf

    Pictures of the characters in USourceData.txt

SEE ALSO

http://www.unicode.org/reports/tr44/

perlrecharclass

perlunicode