Modules

  • ABCDE
  • FGHIL
  • MNOPS
  • TUX

Tools

perluniprops

Perl 5 version 20.0 documentation
Recently read

perluniprops

NAME

perluniprops - Index of Unicode Version 6.3.0 character properties in Perl

DESCRIPTION

This document provides information about the portion of the Unicode database that deals with character properties, that is the portion that is defined on single code points. (Other information in the Unicode data base below briefly mentions other data that Unicode provides.)

Perl can provide access to all non-provisional Unicode character properties, though not all are enabled by default. The omitted ones are the Unihan properties (accessible via the CPAN module Unicode::Unihan) and certain deprecated or Unicode-internal properties. (An installation may choose to recompile Perl's tables to change this. See Unicode character properties that are NOT accepted by Perl.)

For most purposes, access to Unicode properties from the Perl core is through regular expression matches, as described in the next section. For some special purposes, and to access the properties that are not suitable for regular expression matching, all the Unicode character properties that Perl handles are accessible via the standard Unicode::UCD module, as described in the section Properties accessible through Unicode::UCD.

Perl also provides some additional extensions and short-cut synonyms for Unicode properties.

This document merely lists all available properties and does not attempt to explain what each property really means. There is a brief description of each Perl extension; see Other Properties in perlunicode for more information on these. There is some detail about Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but to find out about the intricacies of the official Unicode properties, refer to the Unicode standard. A good starting place is http://www.unicode.org/reports/tr44/.

Note that you can define your own properties; see User-Defined Character Properties in perlunicode.

Properties accessible through \p{} and \P{}

The Perl regular expression \p{} and \P{} constructs give access to most of the Unicode character properties. The table below shows all these constructs, both single and compound forms.

Compound forms consist of two components, separated by an equals sign or a colon. The first component is the property name, and the second component is the particular value of the property to match against, for example, \p{Script: Greek} and \p{Script=Greek} both mean to match characters whose Script property value is Greek.

Single forms, like \p{Greek} , are mostly Perl-defined shortcuts for their equivalent compound forms. The table shows these equivalences. (In our example, \p{Greek} is a just a shortcut for \p{Script=Greek} .) There are also a few Perl-defined single forms that are not shortcuts for a compound form. One such is \p{Word} . These are also listed in the table.

In parsing these constructs, Perl always ignores Upper/lower case differences everywhere within the {braces}. Thus \p{Greek} means the same thing as \p{greek} . But note that changing the case of the "p" or "P" before the left brace completely changes the meaning of the construct, from "match" (for \p{} ) to "doesn't match" (for \P{} ). Casing in this document is for improved legibility.

Also, white space, hyphens, and underscores are normally ignored everywhere between the {braces}, and hence can be freely added or removed even if the /x modifier hasn't been specified on the regular expression. But in the table below a 'T' at the beginning of an entry means that tighter (stricter) rules are used for that entry:

Some properties are considered obsolete by Unicode, but still available. There are several varieties of obsolescence:

Matches in the Block property have shortcuts that begin with "In_". For example, \p{Block=Latin1} can be written as \p{In_Latin1} . For backward compatibility, if there is no conflict with another shortcut, these may also be written as \p{Latin1} or \p{Is_Latin1} . But, N.B., there are numerous such conflicting shortcuts. Use of these forms for Block is discouraged, and are flagged as such, not only because of the potential confusion as to what is meant, but also because a later release of Unicode may preempt the shortcut, and your program would no longer be correct. Use the "In_" form instead to avoid this, or even more clearly, use the compound form, e.g., \p{blk:latin1} . See Blocks in perlunicode for more information about this.

The table below has two columns. The left column contains the \p{} constructs to look up, possibly preceded by the flags mentioned above; and the right column contains information about them, like a description, or synonyms. The table shows both the single and compound forms for each property that has them. If the left column is a short name for a property, the right column will give its longer, more descriptive name; and if the left column is the longest name, the right column will show any equivalent shortest name, in both single and compound forms if applicable.

If braces are not needed to specify a property (e.g., \pL ), the left column contains both forms, with and without braces.

The right column will also caution you if a property means something different than what might normally be expected.

All single forms are Perl extensions; a few compound forms are as well, and are noted as such.

Numbers in (parentheses) indicate the total number of Unicode code points matched by the property. For emphasis, those properties that match no code points at all are listed as well in a separate section following the table.

Most properties match the same code points regardless of whether "/i" case-insensitive matching is specified or not. But a few properties are affected. These are shown with the notation (/i= other_property) in the second column. Under case-insensitive matching they match the same code pode points as the property other_property.

There is no description given for most non-Perl defined properties (See http://www.unicode.org/reports/tr44/ for that).

For compactness, '*' is used as a wildcard instead of showing all possible combinations. For example, entries like:

  1. \p{Gc: *} \p{General_Category: *}

mean that 'Gc' is a synonym for 'General_Category', and anything that is valid for the latter is also valid for the former. Similarly,

  1. \p{Is_*} \p{*}

means that if and only if, for example, \p{Foo} exists, then \p{Is_Foo} and \p{IsFoo} are also valid and all mean the same thing. And similarly, \p{Foo=Bar} means the same as \p{Is_Foo=Bar} and \p{IsFoo=Bar} . "*" here is restricted to something not beginning with an underscore.

Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries for the other possibilities. Note that not all properties which have values 'Yes' and 'No' are binary, and they have all their values spelled out without using this wild card, and a NOT clause in their description that highlights their not being binary. These also require the compound form to match them, whereas true binary properties have both single and compound forms available.

Note that all non-essential underscores are removed in the display of the short names below.

Legend summary:

  • * is a wild-card

  • (\d+) in the info column gives the number of Unicode code points matched by this property.

  • D means this is deprecated.

  • O means this is obsolete.

  • S means this is stabilized.

  • T means tighter (stricter) name matching applies.

  • X means use of this form is discouraged, and may not be stable.

  1. NAME INFO
  2. X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
  3. T \p{Age: 1.1} \p{Age=V1_1} (33_979)
  4. T \p{Age: 2.0} \p{Age=V2_0} (144_521)
  5. T \p{Age: 2.1} \p{Age=V2_1} (2)
  6. T \p{Age: 3.0} \p{Age=V3_0} (10_307)
  7. T \p{Age: 3.1} \p{Age=V3_1} (44_978)
  8. T \p{Age: 3.2} \p{Age=V3_2} (1016)
  9. T \p{Age: 4.0} \p{Age=V4_0} (1226)
  10. T \p{Age: 4.1} \p{Age=V4_1} (1273)
  11. T \p{Age: 5.0} \p{Age=V5_0} (1369)
  12. T \p{Age: 5.1} \p{Age=V5_1} (1624)
  13. T \p{Age: 5.2} \p{Age=V5_2} (6648)
  14. T \p{Age: 6.0} \p{Age=V6_0} (2088)
  15. T \p{Age: 6.1} \p{Age=V6_1} (732)
  16. T \p{Age: 6.2} \p{Age=V6_2} (1)
  17. T \p{Age: 6.3} \p{Age=V6_3} (5)
  18. \p{Age: NA} \p{Age=Unassigned} (864_343 plus all
  19. above-Unicode code points)
  20. \p{Age: Unassigned} Code point's usage has not been assigned
  21. in any Unicode release thus far. (Short:
  22. \p{Age=NA}) (864_343 plus all above-
  23. Unicode code points)
  24. \p{Age: V1_1} Code point's usage introduced in version
  25. 1.1 (33_979)
  26. \p{Age: V2_0} Code point's usage was introduced in
  27. version 2.0; See also Property
  28. 'Present_In' (144_521)
  29. \p{Age: V2_1} Code point's usage was introduced in
  30. version 2.1; See also Property
  31. 'Present_In' (2)
  32. \p{Age: V3_0} Code point's usage was introduced in
  33. version 3.0; See also Property
  34. 'Present_In' (10_307)
  35. \p{Age: V3_1} Code point's usage was introduced in
  36. version 3.1; See also Property
  37. 'Present_In' (44_978)
  38. \p{Age: V3_2} Code point's usage was introduced in
  39. version 3.2; See also Property
  40. 'Present_In' (1016)
  41. \p{Age: V4_0} Code point's usage was introduced in
  42. version 4.0; See also Property
  43. 'Present_In' (1226)
  44. \p{Age: V4_1} Code point's usage was introduced in
  45. version 4.1; See also Property
  46. 'Present_In' (1273)
  47. \p{Age: V5_0} Code point's usage was introduced in
  48. version 5.0; See also Property
  49. 'Present_In' (1369)
  50. \p{Age: V5_1} Code point's usage was introduced in
  51. version 5.1; See also Property
  52. 'Present_In' (1624)
  53. \p{Age: V5_2} Code point's usage was introduced in
  54. version 5.2; See also Property
  55. 'Present_In' (6648)
  56. \p{Age: V6_0} Code point's usage was introduced in
  57. version 6.0; See also Property
  58. 'Present_In' (2088)
  59. \p{Age: V6_1} Code point's usage was introduced in
  60. version 6.1; See also Property
  61. 'Present_In' (732)
  62. \p{Age: V6_2} Code point's usage was introduced in
  63. version 6.2; See also Property
  64. 'Present_In' (1)
  65. \p{Age: V6_3} Code point's usage was introduced in
  66. version 6.3; See also Property
  67. 'Present_In' (5)
  68. \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  69. (22)
  70. \p{AHex: *} \p{ASCII_Hex_Digit: *}
  71. X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
  72. Alchemical_Symbols}) (128)
  73. X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
  74. \p{InAlchemical}) (128)
  75. \p{All} All code points, including those above
  76. Unicode. Same as qr/./s (1_114_112 plus
  77. all above-Unicode code points)
  78. \p{Alnum} Alphabetic and (decimal) Numeric (102_619)
  79. \p{Alpha} \p{Alphabetic=Y} (102_159)
  80. \p{Alpha: *} \p{Alphabetic: *}
  81. \p{Alphabetic} \p{Alpha} (= \p{Alphabetic=Y}) (102_159)
  82. \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (1_011_953
  83. plus all above-Unicode code points)
  84. \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (102_159)
  85. X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
  86. \p{Block=Alphabetic_Presentation_Forms})
  87. (80)
  88. X \p{Alphabetic_Presentation_Forms} \p{Block=
  89. Alphabetic_Presentation_Forms} (Short:
  90. \p{InAlphabeticPF}) (80)
  91. X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
  92. \p{Block=
  93. Ancient_Greek_Musical_Notation}) (80)
  94. X \p{Ancient_Greek_Musical_Notation} \p{Block=
  95. Ancient_Greek_Musical_Notation} (Short:
  96. \p{InAncientGreekMusic}) (80)
  97. X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
  98. X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
  99. \p{Any} All Unicode code points: [\x{0000}-
  100. \x{10FFFF}] (1_114_112)
  101. \p{Arab} \p{Arabic} (= \p{Script=Arabic}) (NOT
  102. \p{Block=Arabic}) (1236)
  103. \p{Arabic} \p{Script=Arabic} (Short: \p{Arab}; NOT
  104. \p{Block=Arabic}) (1236)
  105. X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
  106. Arabic_Extended_A}) (96)
  107. X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
  108. \p{InArabicExtA}) (96)
  109. X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
  110. (= \p{Block=
  111. Arabic_Mathematical_Alphabetic_Symbols})
  112. (256)
  113. X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
  114. Arabic_Mathematical_Alphabetic_Symbols}
  115. (Short: \p{InArabicMath}) (256)
  116. X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
  117. \p{Block=Arabic_Presentation_Forms_A})
  118. (688)
  119. X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
  120. \p{Block=Arabic_Presentation_Forms_B})
  121. (144)
  122. X \p{Arabic_Presentation_Forms_A} \p{Block=
  123. Arabic_Presentation_Forms_A} (Short:
  124. \p{InArabicPFA}) (688)
  125. X \p{Arabic_Presentation_Forms_B} \p{Block=
  126. Arabic_Presentation_Forms_B} (Short:
  127. \p{InArabicPFB}) (144)
  128. X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
  129. Arabic_Supplement}) (48)
  130. X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
  131. \p{InArabicSup}) (48)
  132. \p{Armenian} \p{Script=Armenian} (Short: \p{Armn}; NOT
  133. \p{Block=Armenian}) (91)
  134. \p{Armi} \p{Imperial_Aramaic} (= \p{Script=
  135. Imperial_Aramaic}) (NOT \p{Block=
  136. Imperial_Aramaic}) (31)
  137. \p{Armn} \p{Armenian} (= \p{Script=Armenian}) (NOT
  138. \p{Block=Armenian}) (91)
  139. X \p{Arrows} \p{Block=Arrows} (112)
  140. \p{ASCII} \p{Block=Basic_Latin} [[:ASCII:]] (128)
  141. \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  142. (22)
  143. \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
  144. plus all above-Unicode code points)
  145. \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22)
  146. \p{Assigned} All assigned code points (249_703)
  147. \p{Avestan} \p{Script=Avestan} (Short: \p{Avst}; NOT
  148. \p{Block=Avestan}) (61)
  149. \p{Avst} \p{Avestan} (= \p{Script=Avestan}) (NOT
  150. \p{Block=Avestan}) (61)
  151. \p{Bali} \p{Balinese} (= \p{Script=Balinese}) (NOT
  152. \p{Block=Balinese}) (121)
  153. \p{Balinese} \p{Script=Balinese} (Short: \p{Bali}; NOT
  154. \p{Block=Balinese}) (121)
  155. \p{Bamu} \p{Bamum} (= \p{Script=Bamum}) (NOT
  156. \p{Block=Bamum}) (657)
  157. \p{Bamum} \p{Script=Bamum} (Short: \p{Bamu}; NOT
  158. \p{Block=Bamum}) (657)
  159. X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
  160. Bamum_Supplement}) (576)
  161. X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
  162. \p{InBamumSup}) (576)
  163. X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
  164. \p{Batak} \p{Script=Batak} (Short: \p{Batk}; NOT
  165. \p{Block=Batak}) (56)
  166. \p{Batk} \p{Batak} (= \p{Script=Batak}) (NOT
  167. \p{Block=Batak}) (56)
  168. \p{Bc: *} \p{Bidi_Class: *}
  169. \p{Beng} \p{Bengali} (= \p{Script=Bengali}) (NOT
  170. \p{Block=Bengali}) (92)
  171. \p{Bengali} \p{Script=Bengali} (Short: \p{Beng}; NOT
  172. \p{Block=Bengali}) (92)
  173. \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y})
  174. (12)
  175. \p{Bidi_C: *} \p{Bidi_Control: *}
  176. \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1438)
  177. \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (49)
  178. \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1438)
  179. \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (49)
  180. \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
  181. \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4012)
  182. \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4012)
  183. \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15)
  184. \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
  185. \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (131)
  186. \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
  187. \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (87)
  188. \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (131)
  189. \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12)
  190. \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (87)
  191. \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1)
  192. \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1)
  193. \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_098_508
  194. plus all above-Unicode code points)
  195. \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_098_508 plus
  196. all above-Unicode code points)
  197. \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1)
  198. \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1)
  199. \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1)
  200. \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
  201. \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1)
  202. \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
  203. \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1291)
  204. \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1291)
  205. \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (4447)
  206. \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (4447)
  207. \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7)
  208. \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
  209. \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1)
  210. \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1)
  211. \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1)
  212. \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (4086)
  213. \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4086)
  214. \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1)
  215. \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1)
  216. \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1)
  217. \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
  218. \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1)
  219. \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
  220. \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
  221. \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3)
  222. \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17)
  223. \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17)
  224. \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
  225. \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
  226. plus all above-Unicode code points)
  227. \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12)
  228. \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
  229. (545)
  230. \p{Bidi_M: *} \p{Bidi_Mirrored: *}
  231. \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
  232. (545)
  233. \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
  234. plus all above-Unicode code points)
  235. \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545)
  236. \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
  237. (60)
  238. \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60)
  239. \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
  240. (1_113_992 plus all above-Unicode code
  241. points)
  242. \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
  243. plus all above-Unicode code points)
  244. \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
  245. (60)
  246. \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60)
  247. \p{Blank} \h, Horizontal white space (18)
  248. \p{Blk: *} \p{Block: *}
  249. \p{Block: Aegean_Numbers} (Single: \p{InAegeanNumbers}) (64)
  250. \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
  251. \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical},
  252. \p{InAlchemical}) (128)
  253. \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
  254. (80)
  255. \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
  256. AlphabeticPF}, \p{InAlphabeticPF}) (80)
  257. \p{Block: Ancient_Greek_Music} \p{Block=
  258. Ancient_Greek_Musical_Notation} (80)
  259. \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
  260. AncientGreekMusic},
  261. \p{InAncientGreekMusic}) (80)
  262. \p{Block: Ancient_Greek_Numbers} (Single:
  263. \p{InAncientGreekNumbers}) (80)
  264. \p{Block: Ancient_Symbols} (Single: \p{InAncientSymbols}) (64)
  265. \p{Block: Arabic} (Single: \p{InArabic}; NOT \p{Arabic} NOR
  266. \p{Is_Arabic}) (256)
  267. \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
  268. \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA},
  269. \p{InArabicExtA}) (96)
  270. \p{Block: Arabic_Math} \p{Block=
  271. Arabic_Mathematical_Alphabetic_Symbols}
  272. (256)
  273. \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
  274. ArabicMath}, \p{InArabicMath}) (256)
  275. \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
  276. \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
  277. \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA},
  278. \p{InArabicPFA}) (688)
  279. \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB},
  280. \p{InArabicPFB}) (144)
  281. \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
  282. \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup},
  283. \p{InArabicSup}) (48)
  284. \p{Block: Armenian} (Single: \p{InArmenian}; NOT \p{Armenian}
  285. NOR \p{Is_Armenian}) (96)
  286. \p{Block: Arrows} (Single: \p{InArrows}) (112)
  287. \p{Block: ASCII} \p{Block=Basic_Latin} (128)
  288. \p{Block: Avestan} (Single: \p{InAvestan}; NOT \p{Avestan}
  289. NOR \p{Is_Avestan}) (64)
  290. \p{Block: Balinese} (Single: \p{InBalinese}; NOT \p{Balinese}
  291. NOR \p{Is_Balinese}) (128)
  292. \p{Block: Bamum} (Single: \p{InBamum}; NOT \p{Bamum} NOR
  293. \p{Is_Bamum}) (96)
  294. \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
  295. \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup},
  296. \p{InBamumSup}) (576)
  297. \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}, \p{ASCII}) (128)
  298. \p{Block: Batak} (Single: \p{InBatak}; NOT \p{Batak} NOR
  299. \p{Is_Batak}) (64)
  300. \p{Block: Bengali} (Single: \p{InBengali}; NOT \p{Bengali}
  301. NOR \p{Is_Bengali}) (128)
  302. \p{Block: Block_Elements} (Single: \p{InBlockElements}) (32)
  303. \p{Block: Bopomofo} (Single: \p{InBopomofo}; NOT \p{Bopomofo}
  304. NOR \p{Is_Bopomofo}) (48)
  305. \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
  306. \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt},
  307. \p{InBopomofoExt}) (32)
  308. \p{Block: Box_Drawing} (Single: \p{InBoxDrawing}) (128)
  309. \p{Block: Brahmi} (Single: \p{InBrahmi}; NOT \p{Brahmi} NOR
  310. \p{Is_Brahmi}) (128)
  311. \p{Block: Braille} \p{Block=Braille_Patterns} (256)
  312. \p{Block: Braille_Patterns} (Short: \p{Blk=Braille},
  313. \p{InBraille}) (256)
  314. \p{Block: Buginese} (Single: \p{InBuginese}; NOT \p{Buginese}
  315. NOR \p{Is_Buginese}) (32)
  316. \p{Block: Buhid} (Single: \p{InBuhid}; NOT \p{Buhid} NOR
  317. \p{Is_Buhid}) (32)
  318. \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
  319. (256)
  320. \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
  321. ByzantineMusic}, \p{InByzantineMusic})
  322. (256)
  323. \p{Block: Canadian_Syllabics} \p{Block=
  324. Unified_Canadian_Aboriginal_Syllabics}
  325. (640)
  326. \p{Block: Carian} (Single: \p{InCarian}; NOT \p{Carian} NOR
  327. \p{Is_Carian}) (64)
  328. \p{Block: Chakma} (Single: \p{InChakma}; NOT \p{Chakma} NOR
  329. \p{Is_Chakma}) (80)
  330. \p{Block: Cham} (Single: \p{InCham}; NOT \p{Cham} NOR
  331. \p{Is_Cham}) (96)
  332. \p{Block: Cherokee} (Single: \p{InCherokee}; NOT \p{Cherokee}
  333. NOR \p{Is_Cherokee}) (96)
  334. \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
  335. \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
  336. \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
  337. \p{Block: CJK_Compat_Ideographs} \p{Block=
  338. CJK_Compatibility_Ideographs} (512)
  339. \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
  340. CJK_Compatibility_Ideographs_Supplement}
  341. (544)
  342. \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat},
  343. \p{InCJKCompat}) (256)
  344. \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms},
  345. \p{InCJKCompatForms}) (32)
  346. \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
  347. CJKCompatIdeographs},
  348. \p{InCJKCompatIdeographs}) (512)
  349. \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
  350. CJKCompatIdeographsSup},
  351. \p{InCJKCompatIdeographsSup}) (544)
  352. \p{Block: CJK_Ext_A} \p{Block=
  353. CJK_Unified_Ideographs_Extension_A}
  354. (6592)
  355. \p{Block: CJK_Ext_B} \p{Block=
  356. CJK_Unified_Ideographs_Extension_B}
  357. (42_720)
  358. \p{Block: CJK_Ext_C} \p{Block=
  359. CJK_Unified_Ideographs_Extension_C}
  360. (4160)
  361. \p{Block: CJK_Ext_D} \p{Block=
  362. CJK_Unified_Ideographs_Extension_D} (224)
  363. \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
  364. \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup},
  365. \p{InCJKRadicalsSup}) (128)
  366. \p{Block: CJK_Strokes} (Single: \p{InCJKStrokes}) (48)
  367. \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
  368. \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols},
  369. \p{InCJKSymbols}) (64)
  370. \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}, \p{InCJK})
  371. (20_992)
  372. \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
  373. CJKExtA}, \p{InCJKExtA}) (6592)
  374. \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
  375. CJKExtB}, \p{InCJKExtB}) (42_720)
  376. \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
  377. CJKExtC}, \p{InCJKExtC}) (4160)
  378. \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
  379. CJKExtD}, \p{InCJKExtD}) (224)
  380. \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
  381. Diacriticals}, \p{InDiacriticals}) (112)
  382. \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
  383. DiacriticalsForSymbols},
  384. \p{InDiacriticalsForSymbols}) (48)
  385. \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
  386. DiacriticalsSup}, \p{InDiacriticalsSup})
  387. (64)
  388. \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks},
  389. \p{InHalfMarks}) (16)
  390. \p{Block: Combining_Marks_For_Symbols} \p{Block=
  391. Combining_Diacritical_Marks_For_Symbols}
  392. (48)
  393. \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
  394. IndicNumberForms},
  395. \p{InIndicNumberForms}) (16)
  396. \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
  397. \p{Block: Control_Pictures} (Single: \p{InControlPictures}) (64)
  398. \p{Block: Coptic} (Single: \p{InCoptic}; NOT \p{Coptic} NOR
  399. \p{Is_Coptic}) (128)
  400. \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
  401. \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod},
  402. \p{InCountingRod}) (32)
  403. \p{Block: Cuneiform} (Single: \p{InCuneiform}; NOT
  404. \p{Cuneiform} NOR \p{Is_Cuneiform})
  405. (1024)
  406. \p{Block: Cuneiform_Numbers} \p{Block=
  407. Cuneiform_Numbers_And_Punctuation} (128)
  408. \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
  409. CuneiformNumbers},
  410. \p{InCuneiformNumbers}) (128)
  411. \p{Block: Currency_Symbols} (Single: \p{InCurrencySymbols}) (48)
  412. \p{Block: Cypriot_Syllabary} (Single: \p{InCypriotSyllabary}) (64)
  413. \p{Block: Cyrillic} (Single: \p{InCyrillic}; NOT \p{Cyrillic}
  414. NOR \p{Is_Cyrillic}) (256)
  415. \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
  416. \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
  417. \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA},
  418. \p{InCyrillicExtA}) (32)
  419. \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB},
  420. \p{InCyrillicExtB}) (96)
  421. \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
  422. \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup},
  423. \p{InCyrillicSup}) (48)
  424. \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
  425. (48)
  426. \p{Block: Deseret} (Single: \p{InDeseret}) (80)
  427. \p{Block: Devanagari} (Single: \p{InDevanagari}; NOT
  428. \p{Devanagari} NOR \p{Is_Devanagari})
  429. (128)
  430. \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
  431. \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt},
  432. \p{InDevanagariExt}) (32)
  433. \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
  434. \p{Block: Diacriticals_For_Symbols} \p{Block=
  435. Combining_Diacritical_Marks_For_Symbols}
  436. (48)
  437. \p{Block: Diacriticals_Sup} \p{Block=
  438. Combining_Diacritical_Marks_Supplement}
  439. (64)
  440. \p{Block: Dingbats} (Single: \p{InDingbats}) (192)
  441. \p{Block: Domino} \p{Block=Domino_Tiles} (112)
  442. \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}, \p{InDomino}) (112)
  443. \p{Block: Egyptian_Hieroglyphs} (Single:
  444. \p{InEgyptianHieroglyphs}; NOT
  445. \p{Egyptian_Hieroglyphs} NOR
  446. \p{Is_Egyptian_Hieroglyphs}) (1072)
  447. \p{Block: Emoticons} (Single: \p{InEmoticons}) (80)
  448. \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
  449. \p{Block: Enclosed_Alphanum_Sup} \p{Block=
  450. Enclosed_Alphanumeric_Supplement} (256)
  451. \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
  452. EnclosedAlphanumSup},
  453. \p{InEnclosedAlphanumSup}) (256)
  454. \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
  455. EnclosedAlphanum},
  456. \p{InEnclosedAlphanum}) (160)
  457. \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
  458. (256)
  459. \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
  460. EnclosedCJK}, \p{InEnclosedCJK}) (256)
  461. \p{Block: Enclosed_Ideographic_Sup} \p{Block=
  462. Enclosed_Ideographic_Supplement} (256)
  463. \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
  464. EnclosedIdeographicSup},
  465. \p{InEnclosedIdeographicSup}) (256)
  466. \p{Block: Ethiopic} (Single: \p{InEthiopic}; NOT \p{Ethiopic}
  467. NOR \p{Is_Ethiopic}) (384)
  468. \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
  469. \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
  470. \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt},
  471. \p{InEthiopicExt}) (96)
  472. \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA},
  473. \p{InEthiopicExtA}) (48)
  474. \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
  475. \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup},
  476. \p{InEthiopicSup}) (32)
  477. \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation},
  478. \p{InPunctuation}; NOT \p{Punct} NOR
  479. \p{Is_Punctuation}) (112)
  480. \p{Block: Geometric_Shapes} (Single: \p{InGeometricShapes}) (96)
  481. \p{Block: Georgian} (Single: \p{InGeorgian}; NOT \p{Georgian}
  482. NOR \p{Is_Georgian}) (96)
  483. \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
  484. \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup},
  485. \p{InGeorgianSup}) (48)
  486. \p{Block: Glagolitic} (Single: \p{InGlagolitic}; NOT
  487. \p{Glagolitic} NOR \p{Is_Glagolitic})
  488. (96)
  489. \p{Block: Gothic} (Single: \p{InGothic}; NOT \p{Gothic} NOR
  490. \p{Is_Gothic}) (32)
  491. \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
  492. NOR \p{Is_Greek}) (144)
  493. \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}, \p{InGreek};
  494. NOT \p{Greek} NOR \p{Is_Greek}) (144)
  495. \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
  496. \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt},
  497. \p{InGreekExt}) (256)
  498. \p{Block: Gujarati} (Single: \p{InGujarati}; NOT \p{Gujarati}
  499. NOR \p{Is_Gujarati}) (128)
  500. \p{Block: Gurmukhi} (Single: \p{InGurmukhi}; NOT \p{Gurmukhi}
  501. NOR \p{Is_Gurmukhi}) (128)
  502. \p{Block: Half_And_Full_Forms} \p{Block=
  503. Halfwidth_And_Fullwidth_Forms} (240)
  504. \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
  505. \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
  506. HalfAndFullForms},
  507. \p{InHalfAndFullForms}) (240)
  508. \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
  509. NOR \p{Is_Hangul}) (11_184)
  510. \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo},
  511. \p{InCompatJamo}) (96)
  512. \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}, \p{InJamo}) (256)
  513. \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA},
  514. \p{InJamoExtA}) (32)
  515. \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB},
  516. \p{InJamoExtB}) (80)
  517. \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}, \p{InHangul};
  518. NOT \p{Hangul} NOR \p{Is_Hangul})
  519. (11_184)
  520. \p{Block: Hanunoo} (Single: \p{InHanunoo}; NOT \p{Hanunoo}
  521. NOR \p{Is_Hanunoo}) (32)
  522. \p{Block: Hebrew} (Single: \p{InHebrew}; NOT \p{Hebrew} NOR
  523. \p{Is_Hebrew}) (112)
  524. \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
  525. HighPUSurrogates},
  526. \p{InHighPUSurrogates}) (128)
  527. \p{Block: High_PU_Surrogates} \p{Block=
  528. High_Private_Use_Surrogates} (128)
  529. \p{Block: High_Surrogates} (Single: \p{InHighSurrogates}) (896)
  530. \p{Block: Hiragana} (Single: \p{InHiragana}; NOT \p{Hiragana}
  531. NOR \p{Is_Hiragana}) (96)
  532. \p{Block: IDC} \p{Block=
  533. Ideographic_Description_Characters} (NOT
  534. \p{ID_Continue} NOR \p{Is_IDC}) (16)
  535. \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC},
  536. \p{InIDC}; NOT \p{ID_Continue} NOR
  537. \p{Is_IDC}) (16)
  538. \p{Block: Imperial_Aramaic} (Single: \p{InImperialAramaic}; NOT
  539. \p{Imperial_Aramaic} NOR
  540. \p{Is_Imperial_Aramaic}) (32)
  541. \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  542. (16)
  543. \p{Block: Inscriptional_Pahlavi} (Single:
  544. \p{InInscriptionalPahlavi}; NOT
  545. \p{Inscriptional_Pahlavi} NOR
  546. \p{Is_Inscriptional_Pahlavi}) (32)
  547. \p{Block: Inscriptional_Parthian} (Single:
  548. \p{InInscriptionalParthian}; NOT
  549. \p{Inscriptional_Parthian} NOR
  550. \p{Is_Inscriptional_Parthian}) (32)
  551. \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
  552. \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}, \p{InIPAExt})
  553. (96)
  554. \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
  555. \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
  556. \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
  557. \p{Block: Javanese} (Single: \p{InJavanese}; NOT \p{Javanese}
  558. NOR \p{Is_Javanese}) (96)
  559. \p{Block: Kaithi} (Single: \p{InKaithi}; NOT \p{Kaithi} NOR
  560. \p{Is_Kaithi}) (80)
  561. \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
  562. \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}, \p{InKanaSup})
  563. (256)
  564. \p{Block: Kanbun} (Single: \p{InKanbun}) (16)
  565. \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
  566. \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}, \p{InKangxi})
  567. (224)
  568. \p{Block: Kannada} (Single: \p{InKannada}; NOT \p{Kannada}
  569. NOR \p{Is_Kannada}) (128)
  570. \p{Block: Katakana} (Single: \p{InKatakana}; NOT \p{Katakana}
  571. NOR \p{Is_Katakana}) (96)
  572. \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
  573. \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
  574. KatakanaExt}, \p{InKatakanaExt}) (16)
  575. \p{Block: Kayah_Li} (Single: \p{InKayahLi}) (48)
  576. \p{Block: Kharoshthi} (Single: \p{InKharoshthi}; NOT
  577. \p{Kharoshthi} NOR \p{Is_Kharoshthi})
  578. (96)
  579. \p{Block: Khmer} (Single: \p{InKhmer}; NOT \p{Khmer} NOR
  580. \p{Is_Khmer}) (128)
  581. \p{Block: Khmer_Symbols} (Single: \p{InKhmerSymbols}) (32)
  582. \p{Block: Lao} (Single: \p{InLao}; NOT \p{Lao} NOR
  583. \p{Is_Lao}) (128)
  584. \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
  585. \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
  586. \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1},
  587. \p{InLatin1}) (128)
  588. \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
  589. \p{Block: Latin_Ext_Additional} \p{Block=
  590. Latin_Extended_Additional} (256)
  591. \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
  592. \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
  593. \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
  594. \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA},
  595. \p{InLatinExtA}) (128)
  596. \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
  597. LatinExtAdditional},
  598. \p{InLatinExtAdditional}) (256)
  599. \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB},
  600. \p{InLatinExtB}) (208)
  601. \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC},
  602. \p{InLatinExtC}) (32)
  603. \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD},
  604. \p{InLatinExtD}) (224)
  605. \p{Block: Lepcha} (Single: \p{InLepcha}; NOT \p{Lepcha} NOR
  606. \p{Is_Lepcha}) (80)
  607. \p{Block: Letterlike_Symbols} (Single: \p{InLetterlikeSymbols})
  608. (80)
  609. \p{Block: Limbu} (Single: \p{InLimbu}; NOT \p{Limbu} NOR
  610. \p{Is_Limbu}) (80)
  611. \p{Block: Linear_B_Ideograms} (Single: \p{InLinearBIdeograms})
  612. (128)
  613. \p{Block: Linear_B_Syllabary} (Single: \p{InLinearBSyllabary})
  614. (128)
  615. \p{Block: Lisu} (Single: \p{InLisu}) (48)
  616. \p{Block: Low_Surrogates} (Single: \p{InLowSurrogates}) (1024)
  617. \p{Block: Lycian} (Single: \p{InLycian}; NOT \p{Lycian} NOR
  618. \p{Is_Lycian}) (32)
  619. \p{Block: Lydian} (Single: \p{InLydian}; NOT \p{Lydian} NOR
  620. \p{Is_Lydian}) (32)
  621. \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
  622. \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}, \p{InMahjong})
  623. (48)
  624. \p{Block: Malayalam} (Single: \p{InMalayalam}; NOT
  625. \p{Malayalam} NOR \p{Is_Malayalam}) (128)
  626. \p{Block: Mandaic} (Single: \p{InMandaic}; NOT \p{Mandaic}
  627. NOR \p{Is_Mandaic}) (32)
  628. \p{Block: Math_Alphanum} \p{Block=
  629. Mathematical_Alphanumeric_Symbols} (1024)
  630. \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
  631. \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
  632. MathAlphanum}, \p{InMathAlphanum}) (1024)
  633. \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators},
  634. \p{InMathOperators}) (256)
  635. \p{Block: Meetei_Mayek} (Single: \p{InMeeteiMayek}; NOT
  636. \p{Meetei_Mayek} NOR
  637. \p{Is_Meetei_Mayek}) (64)
  638. \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
  639. \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt},
  640. \p{InMeeteiMayekExt}) (32)
  641. \p{Block: Meroitic_Cursive} (Single: \p{InMeroiticCursive}; NOT
  642. \p{Meroitic_Cursive} NOR
  643. \p{Is_Meroitic_Cursive}) (96)
  644. \p{Block: Meroitic_Hieroglyphs} (Single:
  645. \p{InMeroiticHieroglyphs}) (32)
  646. \p{Block: Miao} (Single: \p{InMiao}; NOT \p{Miao} NOR
  647. \p{Is_Miao}) (160)
  648. \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
  649. (256)
  650. \p{Block: Misc_Math_Symbols_A} \p{Block=
  651. Miscellaneous_Mathematical_Symbols_A}
  652. (48)
  653. \p{Block: Misc_Math_Symbols_B} \p{Block=
  654. Miscellaneous_Mathematical_Symbols_B}
  655. (128)
  656. \p{Block: Misc_Pictographs} \p{Block=
  657. Miscellaneous_Symbols_And_Pictographs}
  658. (768)
  659. \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
  660. \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
  661. \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
  662. MiscMathSymbolsA},
  663. \p{InMiscMathSymbolsA}) (48)
  664. \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
  665. MiscMathSymbolsB},
  666. \p{InMiscMathSymbolsB}) (128)
  667. \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols},
  668. \p{InMiscSymbols}) (256)
  669. \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
  670. MiscArrows}, \p{InMiscArrows}) (256)
  671. \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
  672. MiscPictographs}, \p{InMiscPictographs})
  673. (768)
  674. \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical},
  675. \p{InMiscTechnical}) (256)
  676. \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
  677. \p{Block: Modifier_Tone_Letters} (Single:
  678. \p{InModifierToneLetters}) (32)
  679. \p{Block: Mongolian} (Single: \p{InMongolian}; NOT
  680. \p{Mongolian} NOR \p{Is_Mongolian}) (176)
  681. \p{Block: Music} \p{Block=Musical_Symbols} (256)
  682. \p{Block: Musical_Symbols} (Short: \p{Blk=Music}, \p{InMusic})
  683. (256)
  684. \p{Block: Myanmar} (Single: \p{InMyanmar}; NOT \p{Myanmar}
  685. NOR \p{Is_Myanmar}) (160)
  686. \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
  687. \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA},
  688. \p{InMyanmarExtA}) (32)
  689. \p{Block: NB} \p{Block=No_Block} (860_672 plus all
  690. above-Unicode code points)
  691. \p{Block: New_Tai_Lue} (Single: \p{InNewTaiLue}; NOT
  692. \p{New_Tai_Lue} NOR \p{Is_New_Tai_Lue})
  693. (96)
  694. \p{Block: NKo} (Single: \p{InNKo}; NOT \p{Nko} NOR
  695. \p{Is_NKo}) (64)
  696. \p{Block: No_Block} (Short: \p{Blk=NB}, \p{InNB}) (860_672
  697. plus all above-Unicode code points)
  698. \p{Block: Number_Forms} (Single: \p{InNumberForms}) (64)
  699. \p{Block: OCR} \p{Block=Optical_Character_Recognition}
  700. (32)
  701. \p{Block: Ogham} (Single: \p{InOgham}; NOT \p{Ogham} NOR
  702. \p{Is_Ogham}) (32)
  703. \p{Block: Ol_Chiki} (Single: \p{InOlChiki}) (48)
  704. \p{Block: Old_Italic} (Single: \p{InOldItalic}; NOT
  705. \p{Old_Italic} NOR \p{Is_Old_Italic})
  706. (48)
  707. \p{Block: Old_Persian} (Single: \p{InOldPersian}; NOT
  708. \p{Old_Persian} NOR \p{Is_Old_Persian})
  709. (64)
  710. \p{Block: Old_South_Arabian} (Single: \p{InOldSouthArabian}) (32)
  711. \p{Block: Old_Turkic} (Single: \p{InOldTurkic}; NOT
  712. \p{Old_Turkic} NOR \p{Is_Old_Turkic})
  713. (80)
  714. \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR},
  715. \p{InOCR}) (32)
  716. \p{Block: Oriya} (Single: \p{InOriya}; NOT \p{Oriya} NOR
  717. \p{Is_Oriya}) (128)
  718. \p{Block: Osmanya} (Single: \p{InOsmanya}; NOT \p{Osmanya}
  719. NOR \p{Is_Osmanya}) (48)
  720. \p{Block: Phags_Pa} (Single: \p{InPhagsPa}; NOT \p{Phags_Pa}
  721. NOR \p{Is_Phags_Pa}) (64)
  722. \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
  723. \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}, \p{InPhaistos})
  724. (48)
  725. \p{Block: Phoenician} (Single: \p{InPhoenician}; NOT
  726. \p{Phoenician} NOR \p{Is_Phoenician})
  727. (32)
  728. \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
  729. \p{Block: Phonetic_Ext_Sup} \p{Block=
  730. Phonetic_Extensions_Supplement} (64)
  731. \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt},
  732. \p{InPhoneticExt}) (128)
  733. \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
  734. PhoneticExtSup}, \p{InPhoneticExtSup})
  735. (64)
  736. \p{Block: Playing_Cards} (Single: \p{InPlayingCards}) (96)
  737. \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
  738. \p{Private_Use} NOR \p{Is_Private_Use})
  739. (6400)
  740. \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}, \p{InPUA}; NOT
  741. \p{Private_Use} NOR \p{Is_Private_Use})
  742. (6400)
  743. \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
  744. \p{Private_Use} NOR \p{Is_Private_Use})
  745. (6400)
  746. \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
  747. \p{Punct} NOR \p{Is_Punctuation}) (112)
  748. \p{Block: Rejang} (Single: \p{InRejang}; NOT \p{Rejang} NOR
  749. \p{Is_Rejang}) (48)
  750. \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
  751. \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}, \p{InRumi})
  752. (32)
  753. \p{Block: Runic} (Single: \p{InRunic}; NOT \p{Runic} NOR
  754. \p{Is_Runic}) (96)
  755. \p{Block: Samaritan} (Single: \p{InSamaritan}; NOT
  756. \p{Samaritan} NOR \p{Is_Samaritan}) (64)
  757. \p{Block: Saurashtra} (Single: \p{InSaurashtra}; NOT
  758. \p{Saurashtra} NOR \p{Is_Saurashtra})
  759. (96)
  760. \p{Block: Sharada} (Single: \p{InSharada}; NOT \p{Sharada}
  761. NOR \p{Is_Sharada}) (96)
  762. \p{Block: Shavian} (Single: \p{InShavian}) (48)
  763. \p{Block: Sinhala} (Single: \p{InSinhala}; NOT \p{Sinhala}
  764. NOR \p{Is_Sinhala}) (128)
  765. \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms},
  766. \p{InSmallForms}) (32)
  767. \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
  768. \p{Block: Sora_Sompeng} (Single: \p{InSoraSompeng}; NOT
  769. \p{Sora_Sompeng} NOR
  770. \p{Is_Sora_Sompeng}) (48)
  771. \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
  772. ModifierLetters}, \p{InModifierLetters})
  773. (80)
  774. \p{Block: Specials} (Single: \p{InSpecials}) (16)
  775. \p{Block: Sundanese} (Single: \p{InSundanese}; NOT
  776. \p{Sundanese} NOR \p{Is_Sundanese}) (64)
  777. \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
  778. \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup},
  779. \p{InSundaneseSup}) (16)
  780. \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
  781. \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
  782. \p{Block: Sup_Math_Operators} \p{Block=
  783. Supplemental_Mathematical_Operators}
  784. (256)
  785. \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
  786. (65_536)
  787. \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
  788. (65_536)
  789. \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
  790. \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
  791. \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
  792. SuperAndSub}, \p{InSuperAndSub}) (48)
  793. \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA},
  794. \p{InSupArrowsA}) (16)
  795. \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB},
  796. \p{InSupArrowsB}) (128)
  797. \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
  798. SupMathOperators},
  799. \p{InSupMathOperators}) (256)
  800. \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
  801. SupPunctuation}, \p{InSupPunctuation})
  802. (128)
  803. \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
  804. SupPUAA}, \p{InSupPUAA}) (65_536)
  805. \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
  806. SupPUAB}, \p{InSupPUAB}) (65_536)
  807. \p{Block: Syloti_Nagri} (Single: \p{InSylotiNagri}; NOT
  808. \p{Syloti_Nagri} NOR
  809. \p{Is_Syloti_Nagri}) (48)
  810. \p{Block: Syriac} (Single: \p{InSyriac}; NOT \p{Syriac} NOR
  811. \p{Is_Syriac}) (80)
  812. \p{Block: Tagalog} (Single: \p{InTagalog}; NOT \p{Tagalog}
  813. NOR \p{Is_Tagalog}) (32)
  814. \p{Block: Tagbanwa} (Single: \p{InTagbanwa}; NOT \p{Tagbanwa}
  815. NOR \p{Is_Tagbanwa}) (32)
  816. \p{Block: Tags} (Single: \p{InTags}) (128)
  817. \p{Block: Tai_Le} (Single: \p{InTaiLe}; NOT \p{Tai_Le} NOR
  818. \p{Is_Tai_Le}) (48)
  819. \p{Block: Tai_Tham} (Single: \p{InTaiTham}; NOT \p{Tai_Tham}
  820. NOR \p{Is_Tai_Tham}) (144)
  821. \p{Block: Tai_Viet} (Single: \p{InTaiViet}; NOT \p{Tai_Viet}
  822. NOR \p{Is_Tai_Viet}) (96)
  823. \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
  824. \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing},
  825. \p{InTaiXuanJing}) (96)
  826. \p{Block: Takri} (Single: \p{InTakri}; NOT \p{Takri} NOR
  827. \p{Is_Takri}) (80)
  828. \p{Block: Tamil} (Single: \p{InTamil}; NOT \p{Tamil} NOR
  829. \p{Is_Tamil}) (128)
  830. \p{Block: Telugu} (Single: \p{InTelugu}; NOT \p{Telugu} NOR
  831. \p{Is_Telugu}) (128)
  832. \p{Block: Thaana} (Single: \p{InThaana}; NOT \p{Thaana} NOR
  833. \p{Is_Thaana}) (64)
  834. \p{Block: Thai} (Single: \p{InThai}; NOT \p{Thai} NOR
  835. \p{Is_Thai}) (128)
  836. \p{Block: Tibetan} (Single: \p{InTibetan}; NOT \p{Tibetan}
  837. NOR \p{Is_Tibetan}) (256)
  838. \p{Block: Tifinagh} (Single: \p{InTifinagh}; NOT \p{Tifinagh}
  839. NOR \p{Is_Tifinagh}) (80)
  840. \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
  841. (128)
  842. \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
  843. TransportAndMap}, \p{InTransportAndMap})
  844. (128)
  845. \p{Block: UCAS} \p{Block=
  846. Unified_Canadian_Aboriginal_Syllabics}
  847. (640)
  848. \p{Block: UCAS_Ext} \p{Block=
  849. Unified_Canadian_Aboriginal_Syllabics_-
  850. Extended} (80)
  851. \p{Block: Ugaritic} (Single: \p{InUgaritic}; NOT \p{Ugaritic}
  852. NOR \p{Is_Ugaritic}) (32)
  853. \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
  854. UCAS}, \p{InUCAS}) (640)
  855. \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
  856. \p{Blk=UCASExt}, \p{InUCASExt}) (80)
  857. \p{Block: Vai} (Single: \p{InVai}; NOT \p{Vai} NOR
  858. \p{Is_Vai}) (320)
  859. \p{Block: Variation_Selectors} (Short: \p{Blk=VS}, \p{InVS}; NOT
  860. \p{Variation_Selector} NOR \p{Is_VS})
  861. (16)
  862. \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup},
  863. \p{InVSSup}) (240)
  864. \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
  865. \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt},
  866. \p{InVedicExt}) (48)
  867. \p{Block: Vertical_Forms} (Single: \p{InVerticalForms}) (16)
  868. \p{Block: VS} \p{Block=Variation_Selectors} (NOT
  869. \p{Variation_Selector} NOR \p{Is_VS})
  870. (16)
  871. \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
  872. (240)
  873. \p{Block: Yi_Radicals} (Single: \p{InYiRadicals}) (64)
  874. \p{Block: Yi_Syllables} (Single: \p{InYiSyllables}) (1168)
  875. \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
  876. \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing},
  877. \p{InYijing}) (64)
  878. X \p{Block_Elements} \p{Block=Block_Elements} (32)
  879. \p{Bopo} \p{Bopomofo} (= \p{Script=Bopomofo}) (NOT
  880. \p{Block=Bopomofo}) (70)
  881. \p{Bopomofo} \p{Script=Bopomofo} (Short: \p{Bopo}; NOT
  882. \p{Block=Bopomofo}) (70)
  883. X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
  884. Bopomofo_Extended}) (32)
  885. X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
  886. \p{InBopomofoExt}) (32)
  887. X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
  888. \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *}
  889. \p{Brah} \p{Brahmi} (= \p{Script=Brahmi}) (NOT
  890. \p{Block=Brahmi}) (108)
  891. \p{Brahmi} \p{Script=Brahmi} (Short: \p{Brah}; NOT
  892. \p{Block=Brahmi}) (108)
  893. \p{Brai} \p{Braille} (= \p{Script=Braille}) (256)
  894. \p{Braille} \p{Script=Braille} (Short: \p{Brai}) (256)
  895. X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
  896. \p{InBraille}) (256)
  897. \p{Bugi} \p{Buginese} (= \p{Script=Buginese}) (NOT
  898. \p{Block=Buginese}) (30)
  899. \p{Buginese} \p{Script=Buginese} (Short: \p{Bugi}; NOT
  900. \p{Block=Buginese}) (30)
  901. \p{Buhd} \p{Buhid} (= \p{Script=Buhid}) (NOT
  902. \p{Block=Buhid}) (20)
  903. \p{Buhid} \p{Script=Buhid} (Short: \p{Buhd}; NOT
  904. \p{Block=Buhid}) (20)
  905. X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
  906. Byzantine_Musical_Symbols}) (256)
  907. X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
  908. (Short: \p{InByzantineMusic}) (256)
  909. \p{C} \pC \p{Other} (= \p{General_Category=Other})
  910. (1_004_135 plus all above-Unicode code
  911. points)
  912. \p{Cakm} \p{Chakma} (= \p{Script=Chakma}) (NOT
  913. \p{Block=Chakma}) (67)
  914. \p{Canadian_Aboriginal} \p{Script=Canadian_Aboriginal} (Short:
  915. \p{Cans}) (710)
  916. X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
  917. (= \p{Block=
  918. Unified_Canadian_Aboriginal_Syllabics})
  919. (640)
  920. T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
  921. Not_Reordered} (1_113_459 plus all
  922. above-Unicode code points)
  923. T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
  924. Overlay} (26)
  925. T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
  926. Nukta} (13)
  927. T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
  928. Kana_Voicing} (2)
  929. T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
  930. Virama} (37)
  931. T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
  932. CCC10} (1)
  933. T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
  934. CCC11} (1)
  935. T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
  936. CCC12} (1)
  937. T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
  938. CCC13} (1)
  939. T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
  940. CCC14} (1)
  941. T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
  942. CCC15} (1)
  943. T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
  944. CCC16} (1)
  945. T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
  946. CCC17} (1)
  947. T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
  948. CCC18} (2)
  949. T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
  950. CCC19} (2)
  951. T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
  952. CCC20} (1)
  953. T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
  954. CCC21} (1)
  955. T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
  956. CCC22} (1)
  957. T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
  958. CCC23} (1)
  959. T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
  960. CCC24} (1)
  961. T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
  962. CCC25} (1)
  963. T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
  964. CCC26} (1)
  965. T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
  966. CCC27} (2)
  967. T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
  968. CCC28} (2)
  969. T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
  970. CCC29} (2)
  971. T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
  972. CCC30} (2)
  973. T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
  974. CCC31} (2)
  975. T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
  976. CCC32} (2)
  977. T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
  978. CCC33} (1)
  979. T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
  980. CCC34} (1)
  981. T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
  982. CCC35} (1)
  983. T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
  984. CCC36} (1)
  985. T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
  986. CCC84} (1)
  987. T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
  988. CCC91} (1)
  989. T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
  990. CCC103} (2)
  991. T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
  992. CCC107} (4)
  993. T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
  994. CCC118} (2)
  995. T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
  996. CCC122} (4)
  997. T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
  998. CCC129} (1)
  999. T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
  1000. CCC130} (6)
  1001. T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
  1002. CCC132} (1)
  1003. T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
  1004. CCC133} (0)
  1005. T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
  1006. Attached_Below_Left} (0)
  1007. T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
  1008. Attached_Below} (5)
  1009. T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
  1010. Attached_Above} (1)
  1011. T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
  1012. Attached_Above_Right} (9)
  1013. T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
  1014. Below_Left} (1)
  1015. T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
  1016. Below} (129)
  1017. T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
  1018. Below_Right} (4)
  1019. T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
  1020. Left} (2)
  1021. T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
  1022. Right} (1)
  1023. T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
  1024. Above_Left} (3)
  1025. T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
  1026. Above} (349)
  1027. T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
  1028. Above_Right} (4)
  1029. T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
  1030. Double_Below} (4)
  1031. T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
  1032. Double_Above} (5)
  1033. T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
  1034. Iota_Subscript} (1)
  1035. \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
  1036. Above} (349)
  1037. \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (349)
  1038. \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (3)
  1039. \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (4)
  1040. \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
  1041. Above_Left} (3)
  1042. \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
  1043. Above_Right} (4)
  1044. \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
  1045. Attached_Above} (1)
  1046. \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
  1047. Attached_Above_Right} (9)
  1048. \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
  1049. Attached_Below} (5)
  1050. \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
  1051. Attached_Below_Left} (0)
  1052. \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
  1053. (1)
  1054. \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
  1055. \p{Ccc=ATAR}) (9)
  1056. \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
  1057. (5)
  1058. \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
  1059. ATBL}) (0)
  1060. \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
  1061. Below} (129)
  1062. \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (129)
  1063. \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1)
  1064. \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4)
  1065. \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
  1066. Below_Left} (1)
  1067. \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
  1068. Below_Right} (4)
  1069. \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1)
  1070. \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2)
  1071. \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4)
  1072. \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1)
  1073. \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2)
  1074. \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1)
  1075. \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4)
  1076. \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1)
  1077. \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1)
  1078. \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6)
  1079. \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1)
  1080. \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
  1081. \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1)
  1082. \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1)
  1083. \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1)
  1084. \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1)
  1085. \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2)
  1086. \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2)
  1087. \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1)
  1088. \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1)
  1089. \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1)
  1090. \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1)
  1091. \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1)
  1092. \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1)
  1093. \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1)
  1094. \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2)
  1095. \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2)
  1096. \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2)
  1097. \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2)
  1098. \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2)
  1099. \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2)
  1100. \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1)
  1101. \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1)
  1102. \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1)
  1103. \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1)
  1104. \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1)
  1105. \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1)
  1106. \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
  1107. Double_Above} (5)
  1108. \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
  1109. Double_Below} (4)
  1110. \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) (5)
  1111. \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) (4)
  1112. \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
  1113. (1)
  1114. \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
  1115. Iota_Subscript} (1)
  1116. \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) (2)
  1117. \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
  1118. Kana_Voicing} (2)
  1119. \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
  1120. Left} (2)
  1121. \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2)
  1122. \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
  1123. Nukta} (13)
  1124. \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
  1125. (1_113_459 plus all above-Unicode code
  1126. points)
  1127. \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
  1128. Not_Reordered} (1_113_459 plus all
  1129. above-Unicode code points)
  1130. \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (13)
  1131. \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
  1132. Overlay} (26)
  1133. \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (26)
  1134. \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
  1135. Right} (1)
  1136. \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1)
  1137. \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (37)
  1138. \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
  1139. Virama} (37)
  1140. \p{Cans} \p{Canadian_Aboriginal} (= \p{Script=
  1141. Canadian_Aboriginal}) (710)
  1142. \p{Cari} \p{Carian} (= \p{Script=Carian}) (NOT
  1143. \p{Block=Carian}) (49)
  1144. \p{Carian} \p{Script=Carian} (Short: \p{Cari}; NOT
  1145. \p{Block=Carian}) (49)
  1146. \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (1806)
  1147. \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_112_306 plus
  1148. all above-Unicode code points)
  1149. \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (1806)
  1150. \p{Cased} \p{Cased=Y} (3448)
  1151. \p{Cased: N*} (Single: \P{Cased}) (1_110_664 plus all
  1152. above-Unicode code points)
  1153. \p{Cased: Y*} (Single: \p{Cased}) (3448)
  1154. \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
  1155. \p{LC}) (3223)
  1156. \p{Category: *} \p{General_Category: *}
  1157. \p{Cc} \p{Cntrl} (= \p{General_Category=Control})
  1158. (65)
  1159. \p{Ccc: *} \p{Canonical_Combining_Class: *}
  1160. \p{CE} \p{Composition_Exclusion} (=
  1161. \p{Composition_Exclusion=Y}) (81)
  1162. \p{CE: *} \p{Composition_Exclusion: *}
  1163. \p{Cf} \p{Format} (= \p{General_Category=Format})
  1164. (145)
  1165. \p{Chakma} \p{Script=Chakma} (Short: \p{Cakm}; NOT
  1166. \p{Block=Chakma}) (67)
  1167. \p{Cham} \p{Script=Cham} (NOT \p{Block=Cham}) (83)
  1168. \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
  1169. \p{CWCF}) (1107)
  1170. \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
  1171. (1_113_005 plus all above-Unicode code
  1172. points)
  1173. \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
  1174. (1107)
  1175. \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
  1176. \p{CWCM}) (2138)
  1177. \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
  1178. (1_111_974 plus all above-Unicode code
  1179. points)
  1180. \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
  1181. (2138)
  1182. \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
  1183. \p{CWL}) (1043)
  1184. \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
  1185. (1_113_069 plus all above-Unicode code
  1186. points)
  1187. \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1043)
  1188. \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
  1189. Y} (Short: \p{CWKCF}) (9946)
  1190. \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
  1191. \P{CWKCF}) (1_104_166 plus all above-
  1192. Unicode code points)
  1193. \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
  1194. \p{CWKCF}) (9946)
  1195. \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
  1196. \p{CWT}) (1099)
  1197. \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
  1198. (1_113_013 plus all above-Unicode code
  1199. points)
  1200. \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1099)
  1201. \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
  1202. \p{CWU}) (1126)
  1203. \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
  1204. (1_112_986 plus all above-Unicode code
  1205. points)
  1206. \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1126)
  1207. \p{Cher} \p{Cherokee} (= \p{Script=Cherokee}) (NOT
  1208. \p{Block=Cherokee}) (85)
  1209. \p{Cherokee} \p{Script=Cherokee} (Short: \p{Cher}; NOT
  1210. \p{Block=Cherokee}) (85)
  1211. \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
  1212. Y}) (1806)
  1213. \p{CI: *} \p{Case_Ignorable: *}
  1214. X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
  1215. CJK_Unified_Ideographs}) (20_992)
  1216. X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
  1217. CJK_Compatibility}) (256)
  1218. X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
  1219. CJK_Compatibility_Forms}) (32)
  1220. X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
  1221. \p{Block=CJK_Compatibility_Ideographs})
  1222. (512)
  1223. X \p{CJK_Compat_Ideographs_Sup}
  1224. \p{CJK_Compatibility_Ideographs_-
  1225. Supplement} (= \p{Block=
  1226. CJK_Compatibility_Ideographs_-
  1227. Supplement}) (544)
  1228. X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
  1229. \p{InCJKCompat}) (256)
  1230. X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
  1231. (Short: \p{InCJKCompatForms}) (32)
  1232. X \p{CJK_Compatibility_Ideographs} \p{Block=
  1233. CJK_Compatibility_Ideographs} (Short:
  1234. \p{InCJKCompatIdeographs}) (512)
  1235. X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
  1236. CJK_Compatibility_Ideographs_Supplement}
  1237. (Short: \p{InCJKCompatIdeographsSup})
  1238. (544)
  1239. X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
  1240. \p{Block=
  1241. CJK_Unified_Ideographs_Extension_A})
  1242. (6592)
  1243. X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
  1244. \p{Block=
  1245. CJK_Unified_Ideographs_Extension_B})
  1246. (42_720)
  1247. X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
  1248. \p{Block=
  1249. CJK_Unified_Ideographs_Extension_C})
  1250. (4160)
  1251. X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
  1252. \p{Block=
  1253. CJK_Unified_Ideographs_Extension_D})
  1254. (224)
  1255. X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
  1256. CJK_Radicals_Supplement}) (128)
  1257. X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
  1258. (Short: \p{InCJKRadicalsSup}) (128)
  1259. X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
  1260. X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
  1261. \p{Block=CJK_Symbols_And_Punctuation})
  1262. (64)
  1263. X \p{CJK_Symbols_And_Punctuation} \p{Block=
  1264. CJK_Symbols_And_Punctuation} (Short:
  1265. \p{InCJKSymbols}) (64)
  1266. X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
  1267. (Short: \p{InCJK}) (20_992)
  1268. X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
  1269. CJK_Unified_Ideographs_Extension_A}
  1270. (Short: \p{InCJKExtA}) (6592)
  1271. X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
  1272. CJK_Unified_Ideographs_Extension_B}
  1273. (Short: \p{InCJKExtB}) (42_720)
  1274. X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
  1275. CJK_Unified_Ideographs_Extension_C}
  1276. (Short: \p{InCJKExtC}) (4160)
  1277. X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
  1278. CJK_Unified_Ideographs_Extension_D}
  1279. (Short: \p{InCJKExtD}) (224)
  1280. \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
  1281. (Short: \p{Pe}) (73)
  1282. \p{Cn} \p{Unassigned} (= \p{General_Category=
  1283. Unassigned}) (864_409 plus all above-
  1284. Unicode code points)
  1285. \p{Cntrl} \p{General_Category=Control} Control
  1286. characters (Short: \p{Cc}) (65)
  1287. \p{Co} \p{Private_Use} (= \p{General_Category=
  1288. Private_Use}) (NOT \p{Private_Use_Area})
  1289. (137_468)
  1290. X \p{Combining_Diacritical_Marks} \p{Block=
  1291. Combining_Diacritical_Marks} (Short:
  1292. \p{InDiacriticals}) (112)
  1293. X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
  1294. Combining_Diacritical_Marks_For_Symbols}
  1295. (Short: \p{InDiacriticalsForSymbols})
  1296. (48)
  1297. X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
  1298. Combining_Diacritical_Marks_Supplement}
  1299. (Short: \p{InDiacriticalsSup}) (64)
  1300. X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
  1301. \p{InHalfMarks}) (16)
  1302. \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
  1303. (1645)
  1304. X \p{Combining_Marks_For_Symbols}
  1305. \p{Combining_Diacritical_Marks_For_-
  1306. Symbols} (= \p{Block=
  1307. Combining_Diacritical_Marks_For_-
  1308. Symbols}) (48)
  1309. \p{Common} \p{Script=Common} (Short: \p{Zyyy}) (6418)
  1310. X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  1311. (Short: \p{InIndicNumberForms}) (16)
  1312. \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
  1313. \p{Full_Composition_Exclusion=Y}) (1120)
  1314. \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
  1315. X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
  1316. Hangul_Compatibility_Jamo}) (96)
  1317. \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
  1318. \p{CE}) (81)
  1319. \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
  1320. plus all above-Unicode code points)
  1321. \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81)
  1322. \p{Connector_Punctuation} \p{General_Category=
  1323. Connector_Punctuation} (Short: \p{Pc})
  1324. (10)
  1325. \p{Control} \p{Cntrl} (= \p{General_Category=Control})
  1326. (65)
  1327. X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
  1328. \p{Copt} \p{Coptic} (= \p{Script=Coptic}) (NOT
  1329. \p{Block=Coptic}) (137)
  1330. \p{Coptic} \p{Script=Coptic} (Short: \p{Copt}; NOT
  1331. \p{Block=Coptic}) (137)
  1332. X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
  1333. Counting_Rod_Numerals}) (32)
  1334. X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
  1335. \p{InCountingRod}) (32)
  1336. \p{Cprt} \p{Cypriot} (= \p{Script=Cypriot}) (55)
  1337. \p{Cs} \p{Surrogate} (= \p{General_Category=
  1338. Surrogate}) (2048)
  1339. \p{Cuneiform} \p{Script=Cuneiform} (Short: \p{Xsux}; NOT
  1340. \p{Block=Cuneiform}) (982)
  1341. X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
  1342. \p{Block=
  1343. Cuneiform_Numbers_And_Punctuation}) (128)
  1344. X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
  1345. Cuneiform_Numbers_And_Punctuation}
  1346. (Short: \p{InCuneiformNumbers}) (128)
  1347. \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
  1348. (Short: \p{Sc}) (49)
  1349. X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
  1350. \p{CWCF} \p{Changes_When_Casefolded} (=
  1351. \p{Changes_When_Casefolded=Y}) (1107)
  1352. \p{CWCF: *} \p{Changes_When_Casefolded: *}
  1353. \p{CWCM} \p{Changes_When_Casemapped} (=
  1354. \p{Changes_When_Casemapped=Y}) (2138)
  1355. \p{CWCM: *} \p{Changes_When_Casemapped: *}
  1356. \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
  1357. \p{Changes_When_NFKC_Casefolded=Y})
  1358. (9946)
  1359. \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
  1360. \p{CWL} \p{Changes_When_Lowercased} (=
  1361. \p{Changes_When_Lowercased=Y}) (1043)
  1362. \p{CWL: *} \p{Changes_When_Lowercased: *}
  1363. \p{CWT} \p{Changes_When_Titlecased} (=
  1364. \p{Changes_When_Titlecased=Y}) (1099)
  1365. \p{CWT: *} \p{Changes_When_Titlecased: *}
  1366. \p{CWU} \p{Changes_When_Uppercased} (=
  1367. \p{Changes_When_Uppercased=Y}) (1126)
  1368. \p{CWU: *} \p{Changes_When_Uppercased: *}
  1369. \p{Cypriot} \p{Script=Cypriot} (Short: \p{Cprt}) (55)
  1370. X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
  1371. \p{Cyrillic} \p{Script=Cyrillic} (Short: \p{Cyrl}; NOT
  1372. \p{Block=Cyrillic}) (417)
  1373. X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
  1374. Cyrillic_Extended_A}) (32)
  1375. X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
  1376. Cyrillic_Extended_B}) (96)
  1377. X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
  1378. \p{InCyrillicExtA}) (32)
  1379. X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
  1380. \p{InCyrillicExtB}) (96)
  1381. X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
  1382. Cyrillic_Supplement}) (48)
  1383. X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
  1384. \p{InCyrillicSup}) (48)
  1385. X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
  1386. Cyrillic_Supplement}) (48)
  1387. \p{Cyrl} \p{Cyrillic} (= \p{Script=Cyrillic}) (NOT
  1388. \p{Block=Cyrillic}) (417)
  1389. \p{Dash} \p{Dash=Y} (27)
  1390. \p{Dash: N*} (Single: \P{Dash}) (1_114_085 plus all
  1391. above-Unicode code points)
  1392. \p{Dash: Y*} (Single: \p{Dash}) (27)
  1393. \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
  1394. (Short: \p{Pd}) (23)
  1395. \p{Decimal_Number} \p{Digit} (= \p{General_Category=
  1396. Decimal_Number}) (460)
  1397. \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
  1398. (13_225)
  1399. \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_225)
  1400. \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240)
  1401. \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
  1402. \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720)
  1403. \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
  1404. \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
  1405. \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240)
  1406. \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184)
  1407. \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
  1408. \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20)
  1409. \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
  1410. \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171)
  1411. \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
  1412. \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238)
  1413. \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
  1414. \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82)
  1415. \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
  1416. \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122)
  1417. \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
  1418. \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5)
  1419. \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
  1420. Non_Canonical} (Perl extension) (3655)
  1421. \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
  1422. decompositions (Short: \p{Dt=NonCanon})
  1423. (Perl extension) (3655)
  1424. \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_232 plus
  1425. all above-Unicode code points)
  1426. \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26)
  1427. \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
  1428. \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (284)
  1429. \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (284)
  1430. \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38)
  1431. \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (146)
  1432. \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (146)
  1433. \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
  1434. \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35)
  1435. \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104)
  1436. \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
  1437. Y} (Short: \p{DI}) (4169)
  1438. \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
  1439. (1_109_943 plus all above-Unicode code
  1440. points)
  1441. \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
  1442. (4169)
  1443. \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (111)
  1444. \p{Dep: *} \p{Deprecated: *}
  1445. \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (111)
  1446. \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_001
  1447. plus all above-Unicode code points)
  1448. \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (111)
  1449. \p{Deseret} \p{Script=Deseret} (Short: \p{Dsrt}) (80)
  1450. \p{Deva} \p{Devanagari} (= \p{Script=Devanagari})
  1451. (NOT \p{Block=Devanagari}) (151)
  1452. \p{Devanagari} \p{Script=Devanagari} (Short: \p{Deva};
  1453. NOT \p{Block=Devanagari}) (151)
  1454. X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
  1455. Devanagari_Extended}) (32)
  1456. X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
  1457. \p{InDevanagariExt}) (32)
  1458. \p{DI} \p{Default_Ignorable_Code_Point} (=
  1459. \p{Default_Ignorable_Code_Point=Y})
  1460. (4169)
  1461. \p{DI: *} \p{Default_Ignorable_Code_Point: *}
  1462. \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (693)
  1463. \p{Dia: *} \p{Diacritic: *}
  1464. \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (693)
  1465. \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_419
  1466. plus all above-Unicode code points)
  1467. \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (693)
  1468. X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
  1469. \p{Block=Combining_Diacritical_Marks})
  1470. (112)
  1471. X \p{Diacriticals_For_Symbols}
  1472. \p{Combining_Diacritical_Marks_For_-
  1473. Symbols} (= \p{Block=
  1474. Combining_Diacritical_Marks_For_-
  1475. Symbols}) (48)
  1476. X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
  1477. (= \p{Block=
  1478. Combining_Diacritical_Marks_Supplement})
  1479. (64)
  1480. \p{Digit} \p{General_Category=Decimal_Number} [0-9]
  1481. + all other decimal digits (Short:
  1482. \p{Nd}) (460)
  1483. X \p{Dingbats} \p{Block=Dingbats} (192)
  1484. X \p{Domino} \p{Domino_Tiles} (= \p{Block=
  1485. Domino_Tiles}) (112)
  1486. X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
  1487. \p{InDomino}) (112)
  1488. \p{Dsrt} \p{Deseret} (= \p{Script=Deseret}) (80)
  1489. \p{Dt: *} \p{Decomposition_Type: *}
  1490. \p{Ea: *} \p{East_Asian_Width: *}
  1491. \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_746)
  1492. \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_746)
  1493. \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
  1494. \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104)
  1495. \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
  1496. \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123)
  1497. \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (801_894 plus
  1498. all above-Unicode code points)
  1499. \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
  1500. \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111)
  1501. \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (801_894 plus all
  1502. above-Unicode code points)
  1503. \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (173_134)
  1504. \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (173_134)
  1505. \p{Egyp} \p{Egyptian_Hieroglyphs} (= \p{Script=
  1506. Egyptian_Hieroglyphs}) (NOT \p{Block=
  1507. Egyptian_Hieroglyphs}) (1071)
  1508. \p{Egyptian_Hieroglyphs} \p{Script=Egyptian_Hieroglyphs} (Short:
  1509. \p{Egyp}; NOT \p{Block=
  1510. Egyptian_Hieroglyphs}) (1071)
  1511. X \p{Emoticons} \p{Block=Emoticons} (80)
  1512. X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
  1513. Enclosed_Alphanumerics}) (160)
  1514. X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
  1515. \p{Block=
  1516. Enclosed_Alphanumeric_Supplement}) (256)
  1517. X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
  1518. Enclosed_Alphanumeric_Supplement}
  1519. (Short: \p{InEnclosedAlphanumSup}) (256)
  1520. X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
  1521. (Short: \p{InEnclosedAlphanum}) (160)
  1522. X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
  1523. \p{Block=
  1524. Enclosed_CJK_Letters_And_Months}) (256)
  1525. X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
  1526. Enclosed_CJK_Letters_And_Months} (Short:
  1527. \p{InEnclosedCJK}) (256)
  1528. X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
  1529. (= \p{Block=
  1530. Enclosed_Ideographic_Supplement}) (256)
  1531. X \p{Enclosed_Ideographic_Supplement} \p{Block=
  1532. Enclosed_Ideographic_Supplement} (Short:
  1533. \p{InEnclosedIdeographicSup}) (256)
  1534. \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
  1535. (Short: \p{Me}) (12)
  1536. \p{Ethi} \p{Ethiopic} (= \p{Script=Ethiopic}) (NOT
  1537. \p{Block=Ethiopic}) (495)
  1538. \p{Ethiopic} \p{Script=Ethiopic} (Short: \p{Ethi}; NOT
  1539. \p{Block=Ethiopic}) (495)
  1540. X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
  1541. Ethiopic_Extended}) (96)
  1542. X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
  1543. Ethiopic_Extended_A}) (48)
  1544. X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
  1545. \p{InEthiopicExt}) (96)
  1546. X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
  1547. \p{InEthiopicExtA}) (48)
  1548. X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
  1549. Ethiopic_Supplement}) (32)
  1550. X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
  1551. \p{InEthiopicSup}) (32)
  1552. \p{Ext} \p{Extender} (= \p{Extender=Y}) (31)
  1553. \p{Ext: *} \p{Extender: *}
  1554. \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (31)
  1555. \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_081
  1556. plus all above-Unicode code points)
  1557. \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (31)
  1558. \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
  1559. (Short: \p{Pf}) (10)
  1560. \p{Format} \p{General_Category=Format} (Short:
  1561. \p{Cf}) (145)
  1562. \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
  1563. (Short: \p{CompEx}) (1120)
  1564. \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
  1565. \P{CompEx}) (1_112_992 plus all above-
  1566. Unicode code points)
  1567. \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
  1568. \p{CompEx}) (1120)
  1569. \p{Gc: *} \p{General_Category: *}
  1570. \p{GCB: *} \p{Grapheme_Cluster_Break: *}
  1571. \p{General_Category: C} \p{General_Category=Other} (1_004_135 plus
  1572. all above-Unicode code points)
  1573. \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
  1574. \p{Gc=LC}, \p{LC}) (3223)
  1575. \p{General_Category: Cc} \p{General_Category=Control} (65)
  1576. \p{General_Category: Cf} \p{General_Category=Format} (145)
  1577. \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
  1578. (73)
  1579. \p{General_Category: Cn} \p{General_Category=Unassigned} (864_409
  1580. plus all above-Unicode code points)
  1581. \p{General_Category: Cntrl} \p{General_Category=Control} (65)
  1582. \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
  1583. \p{General_Category: Combining_Mark} \p{General_Category=Mark}
  1584. (1645)
  1585. \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
  1586. \p{Pc}) (10)
  1587. \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65)
  1588. \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
  1589. \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
  1590. (49)
  1591. \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
  1592. (23)
  1593. \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
  1594. (460)
  1595. \p{General_Category: Digit} \p{General_Category=Decimal_Number}
  1596. (460)
  1597. \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
  1598. (12)
  1599. \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
  1600. (10)
  1601. \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (145)
  1602. \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
  1603. \p{Pi}) (12)
  1604. \p{General_Category: L} \p{General_Category=Letter} (101_013)
  1605. X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3223)
  1606. X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
  1607. the trailing '_' matters in spite of
  1608. loose matching rules. (3223)
  1609. \p{General_Category: LC} \p{General_Category=Cased_Letter} (3223)
  1610. \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (101_013)
  1611. \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
  1612. (224)
  1613. \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) (1)
  1614. \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
  1615. (/i= General_Category=Cased_Letter)
  1616. (1751)
  1617. \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (237)
  1618. \p{General_Category: Lo} \p{General_Category=Other_Letter} (97_553)
  1619. \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
  1620. /i= General_Category=Cased_Letter) (1751)
  1621. \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
  1622. (/i= General_Category=Cased_Letter) (31)
  1623. \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
  1624. (/i= General_Category=Cased_Letter)
  1625. (1441)
  1626. \p{General_Category: M} \p{General_Category=Mark} (1645)
  1627. \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (1645)
  1628. \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948)
  1629. \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (352)
  1630. \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (12)
  1631. \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
  1632. (1281)
  1633. \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
  1634. (237)
  1635. \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
  1636. (115)
  1637. \p{General_Category: N} \p{General_Category=Number} (1148)
  1638. \p{General_Category: Nd} \p{General_Category=Decimal_Number} (460)
  1639. \p{General_Category: Nl} \p{General_Category=Letter_Number} (224)
  1640. \p{General_Category: No} \p{General_Category=Other_Number} (464)
  1641. \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
  1642. (1281)
  1643. \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1148)
  1644. \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
  1645. (74)
  1646. \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (1_004_135
  1647. plus all above-Unicode code points)
  1648. \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
  1649. (97_553)
  1650. \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) (464)
  1651. \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
  1652. (434)
  1653. \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
  1654. (4404)
  1655. \p{General_Category: P} \p{General_Category=Punctuation} (636)
  1656. \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
  1657. \p{Zp}) (1)
  1658. \p{General_Category: Pc} \p{General_Category=
  1659. Connector_Punctuation} (10)
  1660. \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (23)
  1661. \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
  1662. (73)
  1663. \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
  1664. (10)
  1665. \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
  1666. (12)
  1667. \p{General_Category: Po} \p{General_Category=Other_Punctuation}
  1668. (434)
  1669. \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
  1670. (137_468)
  1671. \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (74)
  1672. \p{General_Category: Punct} \p{General_Category=Punctuation} (636)
  1673. \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (636)
  1674. \p{General_Category: S} \p{General_Category=Symbol} (5516)
  1675. \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (49)
  1676. \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19)
  1677. \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (115)
  1678. \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
  1679. \p{General_Category: So} \p{General_Category=Other_Symbol} (4404)
  1680. \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
  1681. (17)
  1682. \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) (352)
  1683. \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048)
  1684. \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (5516)
  1685. \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
  1686. /i= General_Category=Cased_Letter) (31)
  1687. \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
  1688. (864_409 plus all above-Unicode code
  1689. points)
  1690. \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
  1691. /i= General_Category=Cased_Letter) (1441)
  1692. \p{General_Category: Z} \p{General_Category=Separator} (19)
  1693. \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
  1694. \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
  1695. (1)
  1696. \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
  1697. X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
  1698. \p{InPunctuation}) (112)
  1699. X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
  1700. \p{Geor} \p{Georgian} (= \p{Script=Georgian}) (NOT
  1701. \p{Block=Georgian}) (127)
  1702. \p{Georgian} \p{Script=Georgian} (Short: \p{Geor}; NOT
  1703. \p{Block=Georgian}) (127)
  1704. X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
  1705. Georgian_Supplement}) (48)
  1706. X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
  1707. \p{InGeorgianSup}) (48)
  1708. \p{Glag} \p{Glagolitic} (= \p{Script=Glagolitic})
  1709. (NOT \p{Block=Glagolitic}) (94)
  1710. \p{Glagolitic} \p{Script=Glagolitic} (Short: \p{Glag};
  1711. NOT \p{Block=Glagolitic}) (94)
  1712. \p{Goth} \p{Gothic} (= \p{Script=Gothic}) (NOT
  1713. \p{Block=Gothic}) (27)
  1714. \p{Gothic} \p{Script=Gothic} (Short: \p{Goth}; NOT
  1715. \p{Block=Gothic}) (27)
  1716. \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
  1717. (108_659)
  1718. \p{Gr_Base: *} \p{Grapheme_Base: *}
  1719. \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
  1720. Y}) (1318)
  1721. \p{Gr_Ext: *} \p{Grapheme_Extend: *}
  1722. \p{Graph} Characters that are graphical (247_571)
  1723. \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
  1724. (108_659)
  1725. \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase})
  1726. (1_005_453 plus all above-Unicode code
  1727. points)
  1728. \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase}) (108_659)
  1729. \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
  1730. (6025)
  1731. \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (6025)
  1732. \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1)
  1733. \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
  1734. (1318)
  1735. \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1318)
  1736. \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125)
  1737. \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1)
  1738. \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399)
  1739. \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773)
  1740. \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_094_922
  1741. plus all above-Unicode code points)
  1742. \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
  1743. (0)
  1744. \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (0)
  1745. \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
  1746. (26)
  1747. \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
  1748. Regional_Indicator} (26)
  1749. \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
  1750. SpacingMark} (290)
  1751. \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (290)
  1752. \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137)
  1753. \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95)
  1754. \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
  1755. (1_094_922 plus all above-Unicode code
  1756. points)
  1757. \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
  1758. (1318)
  1759. \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_794
  1760. plus all above-Unicode code points)
  1761. \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1318)
  1762. \p{Greek} \p{Script=Greek} (Short: \p{Grek}; NOT
  1763. \p{Greek_And_Coptic}) (511)
  1764. X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
  1765. \p{InGreek}) (144)
  1766. X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
  1767. Greek_Extended}) (256)
  1768. X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
  1769. \p{InGreekExt}) (256)
  1770. \p{Grek} \p{Greek} (= \p{Script=Greek}) (NOT
  1771. \p{Greek_And_Coptic}) (511)
  1772. \p{Gujarati} \p{Script=Gujarati} (Short: \p{Gujr}; NOT
  1773. \p{Block=Gujarati}) (84)
  1774. \p{Gujr} \p{Gujarati} (= \p{Script=Gujarati}) (NOT
  1775. \p{Block=Gujarati}) (84)
  1776. \p{Gurmukhi} \p{Script=Gurmukhi} (Short: \p{Guru}; NOT
  1777. \p{Block=Gurmukhi}) (79)
  1778. \p{Guru} \p{Gurmukhi} (= \p{Script=Gurmukhi}) (NOT
  1779. \p{Block=Gurmukhi}) (79)
  1780. X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
  1781. \p{Block=Halfwidth_And_Fullwidth_Forms})
  1782. (240)
  1783. X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
  1784. Combining_Half_Marks}) (16)
  1785. X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
  1786. Halfwidth_And_Fullwidth_Forms} (Short:
  1787. \p{InHalfAndFullForms}) (240)
  1788. \p{Han} \p{Script=Han} (75_963)
  1789. \p{Hang} \p{Hangul} (= \p{Script=Hangul}) (NOT
  1790. \p{Hangul_Syllables}) (11_739)
  1791. \p{Hangul} \p{Script=Hangul} (Short: \p{Hang}; NOT
  1792. \p{Hangul_Syllables}) (11_739)
  1793. X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
  1794. (Short: \p{InCompatJamo}) (96)
  1795. X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
  1796. (256)
  1797. X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
  1798. (Short: \p{InJamoExtA}) (32)
  1799. X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
  1800. (Short: \p{InJamoExtB}) (80)
  1801. \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
  1802. (125)
  1803. \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125)
  1804. \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
  1805. (399)
  1806. \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399)
  1807. \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
  1808. LVT_Syllable} (10_773)
  1809. \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
  1810. (10_773)
  1811. \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
  1812. Not_Applicable} (1_102_583 plus all
  1813. above-Unicode code points)
  1814. \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
  1815. (1_102_583 plus all above-Unicode code
  1816. points)
  1817. \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
  1818. (137)
  1819. \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137)
  1820. \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
  1821. (95)
  1822. \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95)
  1823. X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
  1824. \p{InHangul}) (11_184)
  1825. \p{Hani} \p{Han} (= \p{Script=Han}) (75_963)
  1826. \p{Hano} \p{Hanunoo} (= \p{Script=Hanunoo}) (NOT
  1827. \p{Block=Hanunoo}) (21)
  1828. \p{Hanunoo} \p{Script=Hanunoo} (Short: \p{Hano}; NOT
  1829. \p{Block=Hanunoo}) (21)
  1830. \p{Hebr} \p{Hebrew} (= \p{Script=Hebrew}) (NOT
  1831. \p{Block=Hebrew}) (133)
  1832. \p{Hebrew} \p{Script=Hebrew} (Short: \p{Hebr}; NOT
  1833. \p{Block=Hebrew}) (133)
  1834. \p{Hex} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
  1835. \p{Hex: *} \p{Hex_Digit: *}
  1836. \p{Hex_Digit} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
  1837. \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068
  1838. plus all above-Unicode code points)
  1839. \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44)
  1840. X \p{High_Private_Use_Surrogates} \p{Block=
  1841. High_Private_Use_Surrogates} (Short:
  1842. \p{InHighPUSurrogates}) (128)
  1843. X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
  1844. \p{Block=High_Private_Use_Surrogates})
  1845. (128)
  1846. X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
  1847. \p{Hira} \p{Hiragana} (= \p{Script=Hiragana}) (NOT
  1848. \p{Block=Hiragana}) (91)
  1849. \p{Hiragana} \p{Script=Hiragana} (Short: \p{Hira}; NOT
  1850. \p{Block=Hiragana}) (91)
  1851. \p{HorizSpace} \p{Blank} (18)
  1852. \p{Hst: *} \p{Hangul_Syllable_Type: *}
  1853. D \p{Hyphen} \p{Hyphen=Y} (11)
  1854. D \p{Hyphen: N*} Supplanted by Line_Break property values;
  1855. see www.unicode.org/reports/tr14
  1856. (Single: \P{Hyphen}) (1_114_101 plus all
  1857. above-Unicode code points)
  1858. D \p{Hyphen: Y*} Supplanted by Line_Break property values;
  1859. see www.unicode.org/reports/tr14
  1860. (Single: \p{Hyphen}) (11)
  1861. \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
  1862. \p{Ideographic_Description_Characters})
  1863. (103_355)
  1864. \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (1_010_757
  1865. plus all above-Unicode code points)
  1866. \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (103_355)
  1867. \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (101_240)
  1868. \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (1_012_872
  1869. plus all above-Unicode code points)
  1870. \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (101_240)
  1871. \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
  1872. \p{Ideographic_Description_Characters})
  1873. (103_355)
  1874. \p{IDC: *} \p{ID_Continue: *}
  1875. \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
  1876. (75_633)
  1877. \p{Ideo: *} \p{Ideographic: *}
  1878. \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
  1879. (75_633)
  1880. \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_038_479
  1881. plus all above-Unicode code points)
  1882. \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (75_633)
  1883. X \p{Ideographic_Description_Characters} \p{Block=
  1884. Ideographic_Description_Characters}
  1885. (Short: \p{InIDC}) (16)
  1886. \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (101_240)
  1887. \p{IDS: *} \p{ID_Start: *}
  1888. \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
  1889. \p{IDSB}) (10)
  1890. \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
  1891. (1_114_102 plus all above-Unicode code
  1892. points)
  1893. \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10)
  1894. \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
  1895. \p{IDST}) (2)
  1896. \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
  1897. (1_114_110 plus all above-Unicode code
  1898. points)
  1899. \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2)
  1900. \p{IDSB} \p{IDS_Binary_Operator} (=
  1901. \p{IDS_Binary_Operator=Y}) (10)
  1902. \p{IDSB: *} \p{IDS_Binary_Operator: *}
  1903. \p{IDST} \p{IDS_Trinary_Operator} (=
  1904. \p{IDS_Trinary_Operator=Y}) (2)
  1905. \p{IDST: *} \p{IDS_Trinary_Operator: *}
  1906. \p{Imperial_Aramaic} \p{Script=Imperial_Aramaic} (Short:
  1907. \p{Armi}; NOT \p{Block=
  1908. Imperial_Aramaic}) (31)
  1909. \p{In: *} \p{Present_In: *} (Perl extension)
  1910. \p{In_*} \p{Block: *}
  1911. X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
  1912. Common_Indic_Number_Forms}) (16)
  1913. \p{Inherited} \p{Script=Inherited} (Short: \p{Zinh})
  1914. (523)
  1915. \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
  1916. (Short: \p{Pi}) (12)
  1917. \p{Inscriptional_Pahlavi} \p{Script=Inscriptional_Pahlavi} (Short:
  1918. \p{Phli}; NOT \p{Block=
  1919. Inscriptional_Pahlavi}) (27)
  1920. \p{Inscriptional_Parthian} \p{Script=Inscriptional_Parthian}
  1921. (Short: \p{Prti}; NOT \p{Block=
  1922. Inscriptional_Parthian}) (30)
  1923. X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
  1924. IPA_Extensions}) (96)
  1925. X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
  1926. \p{InIPAExt}) (96)
  1927. \p{Is_*} \p{*} (Any exceptions are individually
  1928. noted beginning with the word NOT.) If
  1929. an entry has flag(s) at its beginning,
  1930. like "D", the "Is_" form has the same
  1931. flag(s)
  1932. \p{Ital} \p{Old_Italic} (= \p{Script=Old_Italic})
  1933. (NOT \p{Block=Old_Italic}) (35)
  1934. X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
  1935. (256)
  1936. X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
  1937. Hangul_Jamo_Extended_A}) (32)
  1938. X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
  1939. Hangul_Jamo_Extended_B}) (80)
  1940. \p{Java} \p{Javanese} (= \p{Script=Javanese}) (NOT
  1941. \p{Block=Javanese}) (90)
  1942. \p{Javanese} \p{Script=Javanese} (Short: \p{Java}; NOT
  1943. \p{Block=Javanese}) (90)
  1944. \p{Jg: *} \p{Joining_Group: *}
  1945. \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
  1946. \p{Join_C: *} \p{Join_Control: *}
  1947. \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
  1948. \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
  1949. plus all above-Unicode code points)
  1950. \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2)
  1951. \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (7)
  1952. \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1)
  1953. \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10)
  1954. \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (20)
  1955. \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2)
  1956. \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
  1957. BurushaskiYehBarree}) (2)
  1958. \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (14)
  1959. \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4)
  1960. \p{Joining_Group: E} (Short: \p{Jg=E}) (1)
  1961. \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7)
  1962. \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1)
  1963. \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10)
  1964. \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1)
  1965. \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (13)
  1966. \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3)
  1967. \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (18)
  1968. \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
  1969. HamzaOnHehGoal}) (1)
  1970. \p{Joining_Group: He} (Short: \p{Jg=He}) (1)
  1971. \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1)
  1972. \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2)
  1973. \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1)
  1974. \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (5)
  1975. \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1)
  1976. \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1)
  1977. \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2)
  1978. \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (7)
  1979. \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1)
  1980. \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4)
  1981. \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1)
  1982. \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
  1983. (1_113_870 plus all above-Unicode code
  1984. points)
  1985. \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8)
  1986. \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1)
  1987. \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1)
  1988. \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1)
  1989. \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5)
  1990. \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1)
  1991. \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (17)
  1992. \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1)
  1993. \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1)
  1994. \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (5)
  1995. \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1)
  1996. \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11)
  1997. \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1)
  1998. \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1)
  1999. \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1)
  2000. \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1)
  2001. \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4)
  2002. \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1)
  2003. \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3)
  2004. \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
  2005. Hamza_On_Heh_Goal} (1)
  2006. \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2)
  2007. \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16)
  2008. \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (10)
  2009. \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2)
  2010. \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1)
  2011. \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1)
  2012. \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1)
  2013. \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1)
  2014. \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1)
  2015. \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (4)
  2016. \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (389)
  2017. \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (389)
  2018. \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4)
  2019. \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (1)
  2020. \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (1)
  2021. \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_112_211 plus
  2022. all above-Unicode code points)
  2023. \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (82)
  2024. \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (82)
  2025. \p{Joining_Type: T} \p{Joining_Type=Transparent} (1425)
  2026. \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1425)
  2027. \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_112_211
  2028. plus all above-Unicode code points)
  2029. \p{Jt: *} \p{Joining_Type: *}
  2030. \p{Kaithi} \p{Script=Kaithi} (Short: \p{Kthi}; NOT
  2031. \p{Block=Kaithi}) (66)
  2032. \p{Kali} \p{Kayah_Li} (= \p{Script=Kayah_Li}) (48)
  2033. \p{Kana} \p{Katakana} (= \p{Script=Katakana}) (NOT
  2034. \p{Block=Katakana}) (300)
  2035. X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
  2036. Kana_Supplement}) (256)
  2037. X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
  2038. \p{InKanaSup}) (256)
  2039. X \p{Kanbun} \p{Block=Kanbun} (16)
  2040. X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
  2041. Kangxi_Radicals}) (224)
  2042. X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
  2043. \p{InKangxi}) (224)
  2044. \p{Kannada} \p{Script=Kannada} (Short: \p{Knda}; NOT
  2045. \p{Block=Kannada}) (86)
  2046. \p{Katakana} \p{Script=Katakana} (Short: \p{Kana}; NOT
  2047. \p{Block=Katakana}) (300)
  2048. X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
  2049. \p{Block=Katakana_Phonetic_Extensions})
  2050. (16)
  2051. X \p{Katakana_Phonetic_Extensions} \p{Block=
  2052. Katakana_Phonetic_Extensions} (Short:
  2053. \p{InKatakanaExt}) (16)
  2054. \p{Kayah_Li} \p{Script=Kayah_Li} (Short: \p{Kali}) (48)
  2055. \p{Khar} \p{Kharoshthi} (= \p{Script=Kharoshthi})
  2056. (NOT \p{Block=Kharoshthi}) (65)
  2057. \p{Kharoshthi} \p{Script=Kharoshthi} (Short: \p{Khar};
  2058. NOT \p{Block=Kharoshthi}) (65)
  2059. \p{Khmer} \p{Script=Khmer} (Short: \p{Khmr}; NOT
  2060. \p{Block=Khmer}) (146)
  2061. X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
  2062. \p{Khmr} \p{Khmer} (= \p{Script=Khmer}) (NOT
  2063. \p{Block=Khmer}) (146)
  2064. \p{Knda} \p{Kannada} (= \p{Script=Kannada}) (NOT
  2065. \p{Block=Kannada}) (86)
  2066. \p{Kthi} \p{Kaithi} (= \p{Script=Kaithi}) (NOT
  2067. \p{Block=Kaithi}) (66)
  2068. \p{L} \pL \p{Letter} (= \p{General_Category=Letter})
  2069. (101_013)
  2070. X \p{L&} \p{Cased_Letter} (= \p{General_Category=
  2071. Cased_Letter}) (3223)
  2072. X \p{L_} \p{Cased_Letter} (= \p{General_Category=
  2073. Cased_Letter}) Note the trailing '_'
  2074. matters in spite of loose matching
  2075. rules. (3223)
  2076. \p{Lana} \p{Tai_Tham} (= \p{Script=Tai_Tham}) (NOT
  2077. \p{Block=Tai_Tham}) (127)
  2078. \p{Lao} \p{Script=Lao} (NOT \p{Block=Lao}) (67)
  2079. \p{Laoo} \p{Lao} (= \p{Script=Lao}) (NOT \p{Block=
  2080. Lao}) (67)
  2081. \p{Latin} \p{Script=Latin} (Short: \p{Latn}) (1272)
  2082. X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
  2083. Latin_1_Supplement}) (128)
  2084. X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
  2085. Latin_1_Supplement}) (128)
  2086. X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
  2087. \p{InLatin1}) (128)
  2088. X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
  2089. Latin_Extended_A}) (128)
  2090. X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
  2091. \p{Block=Latin_Extended_Additional})
  2092. (256)
  2093. X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
  2094. Latin_Extended_B}) (208)
  2095. X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
  2096. Latin_Extended_C}) (32)
  2097. X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
  2098. Latin_Extended_D}) (224)
  2099. X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
  2100. \p{InLatinExtA}) (128)
  2101. X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
  2102. (Short: \p{InLatinExtAdditional}) (256)
  2103. X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
  2104. \p{InLatinExtB}) (208)
  2105. X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
  2106. \p{InLatinExtC}) (32)
  2107. X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
  2108. \p{InLatinExtD}) (224)
  2109. \p{Latn} \p{Latin} (= \p{Script=Latin}) (1272)
  2110. \p{Lb: *} \p{Line_Break: *}
  2111. \p{LC} \p{Cased_Letter} (= \p{General_Category=
  2112. Cased_Letter}) (3223)
  2113. \p{Lepc} \p{Lepcha} (= \p{Script=Lepcha}) (NOT
  2114. \p{Block=Lepcha}) (74)
  2115. \p{Lepcha} \p{Script=Lepcha} (Short: \p{Lepc}; NOT
  2116. \p{Block=Lepcha}) (74)
  2117. \p{Letter} \p{General_Category=Letter} (Short: \p{L})
  2118. (101_013)
  2119. \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
  2120. \p{Nl}) (224)
  2121. X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
  2122. \p{Limb} \p{Limbu} (= \p{Script=Limbu}) (NOT
  2123. \p{Block=Limbu}) (66)
  2124. \p{Limbu} \p{Script=Limbu} (Short: \p{Limb}; NOT
  2125. \p{Block=Limbu}) (66)
  2126. \p{Linb} \p{Linear_B} (= \p{Script=Linear_B}) (211)
  2127. \p{Line_Break: AI} \p{Line_Break=Ambiguous} (687)
  2128. \p{Line_Break: AL} \p{Line_Break=Alphabetic} (15_355)
  2129. \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (15_355)
  2130. \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (687)
  2131. \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
  2132. \p{Line_Break: BA} \p{Line_Break=Break_After} (152)
  2133. \p{Line_Break: BB} \p{Line_Break=Break_Before} (19)
  2134. \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
  2135. \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (152)
  2136. \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (19)
  2137. \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3)
  2138. \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1)
  2139. \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1)
  2140. \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
  2141. \p{Line_Break: CJ} \p{Line_Break=
  2142. Conditional_Japanese_Starter} (51)
  2143. \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (87)
  2144. \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2)
  2145. \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (87)
  2146. \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (1634)
  2147. \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (1634)
  2148. \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (665)
  2149. \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
  2150. (51)
  2151. \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1)
  2152. \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
  2153. \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
  2154. \p{Line_Break: EX} \p{Line_Break=Exclamation} (34)
  2155. \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (34)
  2156. \p{Line_Break: GL} \p{Line_Break=Glue} (18)
  2157. \p{Line_Break: Glue} (Short: \p{Lb=GL}) (18)
  2158. \p{Line_Break: H2} (Short: \p{Lb=H2}) (399)
  2159. \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773)
  2160. \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (74)
  2161. \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (74)
  2162. \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
  2163. \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1)
  2164. \p{Line_Break: ID} \p{Line_Break=Ideographic} (162_698)
  2165. \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (162_698)
  2166. \p{Line_Break: IN} \p{Line_Break=Inseparable} (4)
  2167. \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13)
  2168. \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (4)
  2169. \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (4)
  2170. \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
  2171. \p{Line_Break: JL} (Short: \p{Lb=JL}) (125)
  2172. \p{Line_Break: JT} (Short: \p{Lb=JT}) (137)
  2173. \p{Line_Break: JV} (Short: \p{Lb=JV}) (95)
  2174. \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
  2175. \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1)
  2176. \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4)
  2177. \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1)
  2178. \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
  2179. \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (26)
  2180. \p{Line_Break: NS} \p{Line_Break=Nonstarter} (26)
  2181. \p{Line_Break: NU} \p{Line_Break=Numeric} (452)
  2182. \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (452)
  2183. \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (81)
  2184. \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (81)
  2185. \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (28)
  2186. \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (28)
  2187. \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (67)
  2188. \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (67)
  2189. \p{Line_Break: QU} \p{Line_Break=Quotation} (34)
  2190. \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (34)
  2191. \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26)
  2192. \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
  2193. \p{Line_Break: SA} \p{Line_Break=Complex_Context} (665)
  2194. D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
  2195. \p{Line_Break: SP} \p{Line_Break=Space} (1)
  2196. \p{Line_Break: Space} (Short: \p{Lb=SP}) (1)
  2197. D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
  2198. should never appear in well-formed text,
  2199. and therefore shouldn't be the basis for
  2200. line breaking (Short: \p{Lb=SG}) (2048)
  2201. \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
  2202. \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (918_311 plus all
  2203. above-Unicode code points)
  2204. \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
  2205. \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2)
  2206. \p{Line_Break: XX} \p{Line_Break=Unknown} (918_311 plus all
  2207. above-Unicode code points)
  2208. \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
  2209. \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1)
  2210. \p{Line_Separator} \p{General_Category=Line_Separator}
  2211. (Short: \p{Zl}) (1)
  2212. \p{Linear_B} \p{Script=Linear_B} (Short: \p{Linb}) (211)
  2213. X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
  2214. X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
  2215. \p{Lisu} \p{Script=Lisu} (48)
  2216. \p{Ll} \p{Lowercase_Letter} (=
  2217. \p{General_Category=Lowercase_Letter})
  2218. (/i= General_Category=Cased_Letter)
  2219. (1751)
  2220. \p{Lm} \p{Modifier_Letter} (=
  2221. \p{General_Category=Modifier_Letter})
  2222. (237)
  2223. \p{Lo} \p{Other_Letter} (= \p{General_Category=
  2224. Other_Letter}) (97_553)
  2225. \p{LOE} \p{Logical_Order_Exception} (=
  2226. \p{Logical_Order_Exception=Y}) (15)
  2227. \p{LOE: *} \p{Logical_Order_Exception: *}
  2228. \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
  2229. \p{LOE}) (15)
  2230. \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
  2231. (1_114_097 plus all above-Unicode code
  2232. points)
  2233. \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (15)
  2234. X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
  2235. \p{Lower} \p{Lowercase=Y} (/i= Cased=Yes) (1934)
  2236. \p{Lower: *} \p{Lowercase: *}
  2237. \p{Lowercase} \p{Lower} (= \p{Lowercase=Y}) (/i= Cased=
  2238. Yes) (1934)
  2239. \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
  2240. No) (1_112_178 plus all above-Unicode
  2241. code points)
  2242. \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
  2243. Yes) (1934)
  2244. \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
  2245. (Short: \p{Ll}; /i= General_Category=
  2246. Cased_Letter) (1751)
  2247. \p{Lt} \p{Titlecase_Letter} (=
  2248. \p{General_Category=Titlecase_Letter})
  2249. (/i= General_Category=Cased_Letter) (31)
  2250. \p{Lu} \p{Uppercase_Letter} (=
  2251. \p{General_Category=Uppercase_Letter})
  2252. (/i= General_Category=Cased_Letter)
  2253. (1441)
  2254. \p{Lyci} \p{Lycian} (= \p{Script=Lycian}) (NOT
  2255. \p{Block=Lycian}) (29)
  2256. \p{Lycian} \p{Script=Lycian} (Short: \p{Lyci}; NOT
  2257. \p{Block=Lycian}) (29)
  2258. \p{Lydi} \p{Lydian} (= \p{Script=Lydian}) (NOT
  2259. \p{Block=Lydian}) (27)
  2260. \p{Lydian} \p{Script=Lydian} (Short: \p{Lydi}; NOT
  2261. \p{Block=Lydian}) (27)
  2262. \p{M} \pM \p{Mark} (= \p{General_Category=Mark})
  2263. (1645)
  2264. X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
  2265. Mahjong_Tiles}) (48)
  2266. X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
  2267. \p{InMahjong}) (48)
  2268. \p{Malayalam} \p{Script=Malayalam} (Short: \p{Mlym}; NOT
  2269. \p{Block=Malayalam}) (98)
  2270. \p{Mand} \p{Mandaic} (= \p{Script=Mandaic}) (NOT
  2271. \p{Block=Mandaic}) (29)
  2272. \p{Mandaic} \p{Script=Mandaic} (Short: \p{Mand}; NOT
  2273. \p{Block=Mandaic}) (29)
  2274. \p{Mark} \p{General_Category=Mark} (Short: \p{M})
  2275. (1645)
  2276. \p{Math} \p{Math=Y} (2310)
  2277. \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all
  2278. above-Unicode code points)
  2279. \p{Math: Y*} (Single: \p{Math}) (2310)
  2280. X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
  2281. \p{Block=
  2282. Mathematical_Alphanumeric_Symbols})
  2283. (1024)
  2284. X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
  2285. Mathematical_Operators}) (256)
  2286. \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
  2287. \p{Sm}) (948)
  2288. X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
  2289. Mathematical_Alphanumeric_Symbols}
  2290. (Short: \p{InMathAlphanum}) (1024)
  2291. X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
  2292. (Short: \p{InMathOperators}) (256)
  2293. \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
  2294. Spacing_Mark}) (352)
  2295. \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
  2296. Enclosing_Mark}) (12)
  2297. \p{Meetei_Mayek} \p{Script=Meetei_Mayek} (Short: \p{Mtei};
  2298. NOT \p{Block=Meetei_Mayek}) (79)
  2299. X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
  2300. Meetei_Mayek_Extensions}) (32)
  2301. X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
  2302. (Short: \p{InMeeteiMayekExt}) (32)
  2303. \p{Merc} \p{Meroitic_Cursive} (= \p{Script=
  2304. Meroitic_Cursive}) (NOT \p{Block=
  2305. Meroitic_Cursive}) (26)
  2306. \p{Mero} \p{Meroitic_Hieroglyphs} (= \p{Script=
  2307. Meroitic_Hieroglyphs}) (32)
  2308. \p{Meroitic_Cursive} \p{Script=Meroitic_Cursive} (Short:
  2309. \p{Merc}; NOT \p{Block=
  2310. Meroitic_Cursive}) (26)
  2311. \p{Meroitic_Hieroglyphs} \p{Script=Meroitic_Hieroglyphs} (Short:
  2312. \p{Mero}) (32)
  2313. \p{Miao} \p{Script=Miao} (NOT \p{Block=Miao}) (133)
  2314. X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
  2315. \p{Block=
  2316. Miscellaneous_Symbols_And_Arrows}) (256)
  2317. X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
  2318. (= \p{Block=
  2319. Miscellaneous_Mathematical_Symbols_A})
  2320. (48)
  2321. X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
  2322. (= \p{Block=
  2323. Miscellaneous_Mathematical_Symbols_B})
  2324. (128)
  2325. X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
  2326. (= \p{Block=
  2327. Miscellaneous_Symbols_And_Pictographs})
  2328. (768)
  2329. X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
  2330. Miscellaneous_Symbols}) (256)
  2331. X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
  2332. Miscellaneous_Technical}) (256)
  2333. X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
  2334. Miscellaneous_Mathematical_Symbols_A}
  2335. (Short: \p{InMiscMathSymbolsA}) (48)
  2336. X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
  2337. Miscellaneous_Mathematical_Symbols_B}
  2338. (Short: \p{InMiscMathSymbolsB}) (128)
  2339. X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
  2340. \p{InMiscSymbols}) (256)
  2341. X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
  2342. Miscellaneous_Symbols_And_Arrows}
  2343. (Short: \p{InMiscArrows}) (256)
  2344. X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
  2345. Miscellaneous_Symbols_And_Pictographs}
  2346. (Short: \p{InMiscPictographs}) (768)
  2347. X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
  2348. (Short: \p{InMiscTechnical}) (256)
  2349. \p{Mlym} \p{Malayalam} (= \p{Script=Malayalam})
  2350. (NOT \p{Block=Malayalam}) (98)
  2351. \p{Mn} \p{Nonspacing_Mark} (=
  2352. \p{General_Category=Nonspacing_Mark})
  2353. (1281)
  2354. \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
  2355. (Short: \p{Lm}) (237)
  2356. X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
  2357. Spacing_Modifier_Letters}) (80)
  2358. \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
  2359. (Short: \p{Sk}) (115)
  2360. X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
  2361. \p{Mong} \p{Mongolian} (= \p{Script=Mongolian})
  2362. (NOT \p{Block=Mongolian}) (153)
  2363. \p{Mongolian} \p{Script=Mongolian} (Short: \p{Mong}; NOT
  2364. \p{Block=Mongolian}) (153)
  2365. \p{Mtei} \p{Meetei_Mayek} (= \p{Script=
  2366. Meetei_Mayek}) (NOT \p{Block=
  2367. Meetei_Mayek}) (79)
  2368. X \p{Music} \p{Musical_Symbols} (= \p{Block=
  2369. Musical_Symbols}) (256)
  2370. X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
  2371. \p{InMusic}) (256)
  2372. \p{Myanmar} \p{Script=Myanmar} (Short: \p{Mymr}; NOT
  2373. \p{Block=Myanmar}) (188)
  2374. X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
  2375. Myanmar_Extended_A}) (32)
  2376. X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
  2377. \p{InMyanmarExtA}) (32)
  2378. \p{Mymr} \p{Myanmar} (= \p{Script=Myanmar}) (NOT
  2379. \p{Block=Myanmar}) (188)
  2380. \p{N} \pN \p{Number} (= \p{General_Category=Number})
  2381. (1148)
  2382. X \p{NB} \p{No_Block} (= \p{Block=No_Block})
  2383. (860_672 plus all above-Unicode code
  2384. points)
  2385. \p{NChar} \p{Noncharacter_Code_Point} (=
  2386. \p{Noncharacter_Code_Point=Y}) (66)
  2387. \p{NChar: *} \p{Noncharacter_Code_Point: *}
  2388. \p{Nd} \p{Digit} (= \p{General_Category=
  2389. Decimal_Number}) (460)
  2390. \p{New_Tai_Lue} \p{Script=New_Tai_Lue} (Short: \p{Talu};
  2391. NOT \p{Block=New_Tai_Lue}) (83)
  2392. \p{NFC_QC: *} \p{NFC_Quick_Check: *}
  2393. \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (104)
  2394. \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (104)
  2395. \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
  2396. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  2397. (1120)
  2398. \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
  2399. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  2400. (1120)
  2401. \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
  2402. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  2403. (1_112_888 plus all above-Unicode code
  2404. points)
  2405. \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
  2406. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  2407. (1_112_888 plus all above-Unicode code
  2408. points)
  2409. \p{NFD_QC: *} \p{NFD_Quick_Check: *}
  2410. \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
  2411. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  2412. (13_225)
  2413. \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
  2414. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  2415. (13_225)
  2416. \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
  2417. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  2418. (1_100_887 plus all above-Unicode code
  2419. points)
  2420. \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
  2421. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  2422. (1_100_887 plus all above-Unicode code
  2423. points)
  2424. \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
  2425. \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (104)
  2426. \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (104)
  2427. \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
  2428. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  2429. (4787)
  2430. \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
  2431. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  2432. (4787)
  2433. \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
  2434. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  2435. (1_109_221 plus all above-Unicode code
  2436. points)
  2437. \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
  2438. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  2439. (1_109_221 plus all above-Unicode code
  2440. points)
  2441. \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
  2442. \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
  2443. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  2444. (16_880)
  2445. \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
  2446. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  2447. (16_880)
  2448. \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
  2449. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  2450. (1_097_232 plus all above-Unicode code
  2451. points)
  2452. \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
  2453. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  2454. (1_097_232 plus all above-Unicode code
  2455. points)
  2456. \p{Nko} \p{Script=Nko} (NOT \p{NKo}) (59)
  2457. \p{Nkoo} \p{Nko} (= \p{Script=Nko}) (NOT \p{NKo})
  2458. (59)
  2459. \p{Nl} \p{Letter_Number} (= \p{General_Category=
  2460. Letter_Number}) (224)
  2461. \p{No} \p{Other_Number} (= \p{General_Category=
  2462. Other_Number}) (464)
  2463. X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
  2464. (860_672 plus all above-Unicode code
  2465. points)
  2466. \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
  2467. \p{NChar}) (66)
  2468. \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
  2469. (1_114_046 plus all above-Unicode code
  2470. points)
  2471. \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
  2472. (66)
  2473. \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
  2474. (Short: \p{Mn}) (1281)
  2475. \p{Nt: *} \p{Numeric_Type: *}
  2476. \p{Number} \p{General_Category=Number} (Short: \p{N})
  2477. (1148)
  2478. X \p{Number_Forms} \p{Block=Number_Forms} (64)
  2479. \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (460)
  2480. \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (460)
  2481. \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
  2482. \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128)
  2483. \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_883 plus all
  2484. above-Unicode code points)
  2485. \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (641)
  2486. \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (641)
  2487. T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1)
  2488. T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (60)
  2489. T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (3)
  2490. T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (1)
  2491. T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1)
  2492. T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (5)
  2493. T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1)
  2494. T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (2)
  2495. T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (3)
  2496. T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (1)
  2497. T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (9)
  2498. T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (4)
  2499. T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1)
  2500. T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1)
  2501. T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (10)
  2502. T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1)
  2503. T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1)
  2504. T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (5)
  2505. T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (6)
  2506. T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1)
  2507. T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (2)
  2508. T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1)
  2509. T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (97)
  2510. T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1)
  2511. T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (101)
  2512. T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1)
  2513. T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (103)
  2514. T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1)
  2515. T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (93)
  2516. T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1)
  2517. T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (90)
  2518. T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1)
  2519. T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (82)
  2520. T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1)
  2521. T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (81)
  2522. T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1)
  2523. T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (77)
  2524. T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1)
  2525. T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (81)
  2526. T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (40)
  2527. T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (6)
  2528. T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (6)
  2529. T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (4)
  2530. T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (4)
  2531. T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (4)
  2532. T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (5)
  2533. T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (5)
  2534. T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (5)
  2535. T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (5)
  2536. T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (19)
  2537. T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1)
  2538. T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1)
  2539. T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1)
  2540. T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1)
  2541. T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1)
  2542. T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1)
  2543. T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1)
  2544. T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1)
  2545. T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1)
  2546. T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (11)
  2547. T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1)
  2548. T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1)
  2549. T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1)
  2550. T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1)
  2551. T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1)
  2552. T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1)
  2553. T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1)
  2554. T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1)
  2555. T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1)
  2556. T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (10)
  2557. T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1)
  2558. T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1)
  2559. T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1)
  2560. T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1)
  2561. T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1)
  2562. T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1)
  2563. T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1)
  2564. T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1)
  2565. T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1)
  2566. T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (20)
  2567. T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (6)
  2568. T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (6)
  2569. T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (6)
  2570. T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (6)
  2571. T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (20)
  2572. T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (2)
  2573. T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (3)
  2574. T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (2)
  2575. T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (12)
  2576. T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (2)
  2577. T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (2)
  2578. T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (2)
  2579. T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (3)
  2580. T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (17)
  2581. T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (1)
  2582. T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (1)
  2583. T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (1)
  2584. T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (5)
  2585. T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (1)
  2586. T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (1)
  2587. T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (1)
  2588. T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (1)
  2589. T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (7)
  2590. T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (1)
  2591. T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (1)
  2592. T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (1)
  2593. T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (4)
  2594. T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (1)
  2595. T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (1)
  2596. T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (1)
  2597. T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (1)
  2598. T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (1)
  2599. T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1)
  2600. T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1)
  2601. T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
  2602. (2)
  2603. T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
  2604. 1000000000000}) (1)
  2605. \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_883 plus all
  2606. above-Unicode code points)
  2607. \p{Nv: *} \p{Numeric_Value: *}
  2608. X \p{OCR} \p{Optical_Character_Recognition} (=
  2609. \p{Block=Optical_Character_Recognition})
  2610. (32)
  2611. \p{Ogam} \p{Ogham} (= \p{Script=Ogham}) (NOT
  2612. \p{Block=Ogham}) (29)
  2613. \p{Ogham} \p{Script=Ogham} (Short: \p{Ogam}; NOT
  2614. \p{Block=Ogham}) (29)
  2615. \p{Ol_Chiki} \p{Script=Ol_Chiki} (Short: \p{Olck}) (48)
  2616. \p{Olck} \p{Ol_Chiki} (= \p{Script=Ol_Chiki}) (48)
  2617. \p{Old_Italic} \p{Script=Old_Italic} (Short: \p{Ital};
  2618. NOT \p{Block=Old_Italic}) (35)
  2619. \p{Old_Persian} \p{Script=Old_Persian} (Short: \p{Xpeo};
  2620. NOT \p{Block=Old_Persian}) (50)
  2621. \p{Old_South_Arabian} \p{Script=Old_South_Arabian} (Short:
  2622. \p{Sarb}) (32)
  2623. \p{Old_Turkic} \p{Script=Old_Turkic} (Short: \p{Orkh};
  2624. NOT \p{Block=Old_Turkic}) (73)
  2625. \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
  2626. (Short: \p{Ps}) (74)
  2627. X \p{Optical_Character_Recognition} \p{Block=
  2628. Optical_Character_Recognition} (Short:
  2629. \p{InOCR}) (32)
  2630. \p{Oriya} \p{Script=Oriya} (Short: \p{Orya}; NOT
  2631. \p{Block=Oriya}) (90)
  2632. \p{Orkh} \p{Old_Turkic} (= \p{Script=Old_Turkic})
  2633. (NOT \p{Block=Old_Turkic}) (73)
  2634. \p{Orya} \p{Oriya} (= \p{Script=Oriya}) (NOT
  2635. \p{Block=Oriya}) (90)
  2636. \p{Osma} \p{Osmanya} (= \p{Script=Osmanya}) (NOT
  2637. \p{Block=Osmanya}) (40)
  2638. \p{Osmanya} \p{Script=Osmanya} (Short: \p{Osma}; NOT
  2639. \p{Block=Osmanya}) (40)
  2640. \p{Other} \p{General_Category=Other} (Short: \p{C})
  2641. (1_004_135 plus all above-Unicode code
  2642. points)
  2643. \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
  2644. \p{Lo}) (97_553)
  2645. \p{Other_Number} \p{General_Category=Other_Number} (Short:
  2646. \p{No}) (464)
  2647. \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
  2648. (Short: \p{Po}) (434)
  2649. \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
  2650. \p{So}) (4404)
  2651. \p{P} \pP \p{Punct} (= \p{General_Category=
  2652. Punctuation}) (NOT
  2653. \p{General_Punctuation}) (636)
  2654. \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
  2655. (Short: \p{Zp}) (1)
  2656. \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
  2657. Y}) (2760)
  2658. \p{Pat_Syn: *} \p{Pattern_Syntax: *}
  2659. \p{Pat_WS} \p{Pattern_White_Space} (=
  2660. \p{Pattern_White_Space=Y}) (11)
  2661. \p{Pat_WS: *} \p{Pattern_White_Space: *}
  2662. \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
  2663. (2760)
  2664. \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
  2665. (1_111_352 plus all above-Unicode code
  2666. points)
  2667. \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760)
  2668. \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
  2669. \p{PatWS}) (11)
  2670. \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
  2671. (1_114_101 plus all above-Unicode code
  2672. points)
  2673. \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11)
  2674. \p{Pc} \p{Connector_Punctuation} (=
  2675. \p{General_Category=
  2676. Connector_Punctuation}) (10)
  2677. \p{Pd} \p{Dash_Punctuation} (=
  2678. \p{General_Category=Dash_Punctuation})
  2679. (23)
  2680. \p{Pe} \p{Close_Punctuation} (=
  2681. \p{General_Category=Close_Punctuation})
  2682. (73)
  2683. \p{PerlSpace} \s, restricted to ASCII = [ \f\n\r\t] plus
  2684. vertical tab (6)
  2685. \p{PerlWord} \w, restricted to ASCII = [A-Za-z0-9_] (63)
  2686. \p{Pf} \p{Final_Punctuation} (=
  2687. \p{General_Category=Final_Punctuation})
  2688. (10)
  2689. \p{Phag} \p{Phags_Pa} (= \p{Script=Phags_Pa}) (NOT
  2690. \p{Block=Phags_Pa}) (56)
  2691. \p{Phags_Pa} \p{Script=Phags_Pa} (Short: \p{Phag}; NOT
  2692. \p{Block=Phags_Pa}) (56)
  2693. X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
  2694. Phaistos_Disc}) (48)
  2695. X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
  2696. \p{InPhaistos}) (48)
  2697. \p{Phli} \p{Inscriptional_Pahlavi} (= \p{Script=
  2698. Inscriptional_Pahlavi}) (NOT \p{Block=
  2699. Inscriptional_Pahlavi}) (27)
  2700. \p{Phnx} \p{Phoenician} (= \p{Script=Phoenician})
  2701. (NOT \p{Block=Phoenician}) (29)
  2702. \p{Phoenician} \p{Script=Phoenician} (Short: \p{Phnx};
  2703. NOT \p{Block=Phoenician}) (29)
  2704. X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
  2705. Phonetic_Extensions}) (128)
  2706. X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
  2707. \p{Block=
  2708. Phonetic_Extensions_Supplement}) (64)
  2709. X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
  2710. \p{InPhoneticExt}) (128)
  2711. X \p{Phonetic_Extensions_Supplement} \p{Block=
  2712. Phonetic_Extensions_Supplement} (Short:
  2713. \p{InPhoneticExtSup}) (64)
  2714. \p{Pi} \p{Initial_Punctuation} (=
  2715. \p{General_Category=
  2716. Initial_Punctuation}) (12)
  2717. X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
  2718. \p{Plrd} \p{Miao} (= \p{Script=Miao}) (NOT
  2719. \p{Block=Miao}) (133)
  2720. \p{Po} \p{Other_Punctuation} (=
  2721. \p{General_Category=Other_Punctuation})
  2722. (434)
  2723. \p{PosixAlnum} [A-Za-z0-9] (62)
  2724. \p{PosixAlpha} [A-Za-z] (52)
  2725. \p{PosixBlank} \t and ' ' (2)
  2726. \p{PosixCntrl} ASCII control characters: NUL, SOH, STX,
  2727. ETX, EOT, ENQ, ACK, BEL, BS, HT, LF, VT,
  2728. FF, CR, SO, SI, DLE, DC1, DC2, DC3, DC4,
  2729. NAK, SYN, ETB, CAN, EOM, SUB, ESC, FS,
  2730. GS, RS, US, and DEL (33)
  2731. \p{PosixDigit} [0-9] (10)
  2732. \p{PosixGraph} [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~0-9A-Za-
  2733. z] (94)
  2734. \p{PosixLower} [a-z] (/i= PosixAlpha) (26)
  2735. \p{PosixPrint} [- 0-9A-Za-z!"#$%&'()*+,./:;<=
  2736. >?@[\\]^_`{|}~] (95)
  2737. \p{PosixPunct} [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~] (32)
  2738. \p{PosixSpace} \t, \n, \cK, \f, \r, and ' '. (\cK is
  2739. vertical tab) (6)
  2740. \p{PosixUpper} [A-Z] (/i= PosixAlpha) (26)
  2741. \p{PosixWord} \p{PerlWord} (63)
  2742. \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} [0-9A-Fa-f] (Short:
  2743. \p{AHex}) (22)
  2744. T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
  2745. extension) (33_979)
  2746. T \p{Present_In: 2.0} Code point's usage introduced in version
  2747. 2.0 or earlier (Short: \p{In=2.0}) (Perl
  2748. extension) (178_500)
  2749. T \p{Present_In: 2.1} Code point's usage introduced in version
  2750. 2.1 or earlier (Short: \p{In=2.1}) (Perl
  2751. extension) (178_502)
  2752. T \p{Present_In: 3.0} Code point's usage introduced in version
  2753. 3.0 or earlier (Short: \p{In=3.0}) (Perl
  2754. extension) (188_809)
  2755. T \p{Present_In: 3.1} Code point's usage introduced in version
  2756. 3.1 or earlier (Short: \p{In=3.1}) (Perl
  2757. extension) (233_787)
  2758. T \p{Present_In: 3.2} Code point's usage introduced in version
  2759. 3.2 or earlier (Short: \p{In=3.2}) (Perl
  2760. extension) (234_803)
  2761. T \p{Present_In: 4.0} Code point's usage introduced in version
  2762. 4.0 or earlier (Short: \p{In=4.0}) (Perl
  2763. extension) (236_029)
  2764. T \p{Present_In: 4.1} Code point's usage introduced in version
  2765. 4.1 or earlier (Short: \p{In=4.1}) (Perl
  2766. extension) (237_302)
  2767. T \p{Present_In: 5.0} Code point's usage introduced in version
  2768. 5.0 or earlier (Short: \p{In=5.0}) (Perl
  2769. extension) (238_671)
  2770. T \p{Present_In: 5.1} Code point's usage introduced in version
  2771. 5.1 or earlier (Short: \p{In=5.1}) (Perl
  2772. extension) (240_295)
  2773. T \p{Present_In: 5.2} Code point's usage introduced in version
  2774. 5.2 or earlier (Short: \p{In=5.2}) (Perl
  2775. extension) (246_943)
  2776. T \p{Present_In: 6.0} Code point's usage introduced in version
  2777. 6.0 or earlier (Short: \p{In=6.0}) (Perl
  2778. extension) (249_031)
  2779. T \p{Present_In: 6.1} Code point's usage introduced in version
  2780. 6.1 or earlier (Short: \p{In=6.1}) (Perl
  2781. extension) (249_763)
  2782. T \p{Present_In: 6.2} Code point's usage introduced in version
  2783. 6.2 or earlier (Short: \p{In=6.2}) (Perl
  2784. extension) (249_764)
  2785. T \p{Present_In: 6.3} Code point's usage introduced in version
  2786. 6.3 or earlier (Short: \p{In=6.3}) (Perl
  2787. extension) (249_769)
  2788. \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
  2789. Unassigned}) (Perl extension) (864_343
  2790. plus all above-Unicode code points)
  2791. \p{Print} Characters that are graphical plus space
  2792. characters (but no controls) (247_588)
  2793. \p{Private_Use} \p{General_Category=Private_Use} (Short:
  2794. \p{Co}; NOT \p{Private_Use_Area})
  2795. (137_468)
  2796. X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
  2797. \p{InPUA}) (6400)
  2798. \p{Prti} \p{Inscriptional_Parthian} (= \p{Script=
  2799. Inscriptional_Parthian}) (NOT \p{Block=
  2800. Inscriptional_Parthian}) (30)
  2801. \p{Ps} \p{Open_Punctuation} (=
  2802. \p{General_Category=Open_Punctuation})
  2803. (74)
  2804. X \p{PUA} \p{Private_Use_Area} (= \p{Block=
  2805. Private_Use_Area}) (6400)
  2806. \p{Punct} \p{General_Category=Punctuation} (Short:
  2807. \p{P}; NOT \p{General_Punctuation}) (636)
  2808. \p{Punctuation} \p{Punct} (= \p{General_Category=
  2809. Punctuation}) (NOT
  2810. \p{General_Punctuation}) (636)
  2811. \p{Qaac} \p{Coptic} (= \p{Script=Coptic}) (NOT
  2812. \p{Block=Coptic}) (137)
  2813. \p{Qaai} \p{Inherited} (= \p{Script=Inherited})
  2814. (523)
  2815. \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
  2816. Y}) (29)
  2817. \p{QMark: *} \p{Quotation_Mark: *}
  2818. \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
  2819. (29)
  2820. \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_083
  2821. plus all above-Unicode code points)
  2822. \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (29)
  2823. \p{Radical} \p{Radical=Y} (329)
  2824. \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all
  2825. above-Unicode code points)
  2826. \p{Radical: Y*} (Single: \p{Radical}) (329)
  2827. \p{Rejang} \p{Script=Rejang} (Short: \p{Rjng}; NOT
  2828. \p{Block=Rejang}) (37)
  2829. \p{Rjng} \p{Rejang} (= \p{Script=Rejang}) (NOT
  2830. \p{Block=Rejang}) (37)
  2831. X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
  2832. Rumi_Numeral_Symbols}) (32)
  2833. X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
  2834. \p{InRumi}) (32)
  2835. \p{Runic} \p{Script=Runic} (Short: \p{Runr}; NOT
  2836. \p{Block=Runic}) (78)
  2837. \p{Runr} \p{Runic} (= \p{Script=Runic}) (NOT
  2838. \p{Block=Runic}) (78)
  2839. \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol})
  2840. (5516)
  2841. \p{Samaritan} \p{Script=Samaritan} (Short: \p{Samr}; NOT
  2842. \p{Block=Samaritan}) (61)
  2843. \p{Samr} \p{Samaritan} (= \p{Script=Samaritan})
  2844. (NOT \p{Block=Samaritan}) (61)
  2845. \p{Sarb} \p{Old_South_Arabian} (= \p{Script=
  2846. Old_South_Arabian}) (32)
  2847. \p{Saur} \p{Saurashtra} (= \p{Script=Saurashtra})
  2848. (NOT \p{Block=Saurashtra}) (81)
  2849. \p{Saurashtra} \p{Script=Saurashtra} (Short: \p{Saur};
  2850. NOT \p{Block=Saurashtra}) (81)
  2851. \p{SB: *} \p{Sentence_Break: *}
  2852. \p{Sc} \p{Currency_Symbol} (=
  2853. \p{General_Category=Currency_Symbol})
  2854. (49)
  2855. \p{Sc: *} \p{Script: *}
  2856. \p{Script: Arab} \p{Script=Arabic} (1236)
  2857. \p{Script: Arabic} (Short: \p{Sc=Arab}, \p{Arab}) (1236)
  2858. \p{Script: Armenian} (Short: \p{Sc=Armn}, \p{Armn}) (91)
  2859. \p{Script: Armi} \p{Script=Imperial_Aramaic} (31)
  2860. \p{Script: Armn} \p{Script=Armenian} (91)
  2861. \p{Script: Avestan} (Short: \p{Sc=Avst}, \p{Avst}) (61)
  2862. \p{Script: Avst} \p{Script=Avestan} (61)
  2863. \p{Script: Bali} \p{Script=Balinese} (121)
  2864. \p{Script: Balinese} (Short: \p{Sc=Bali}, \p{Bali}) (121)
  2865. \p{Script: Bamu} \p{Script=Bamum} (657)
  2866. \p{Script: Bamum} (Short: \p{Sc=Bamu}, \p{Bamu}) (657)
  2867. \p{Script: Batak} (Short: \p{Sc=Batk}, \p{Batk}) (56)
  2868. \p{Script: Batk} \p{Script=Batak} (56)
  2869. \p{Script: Beng} \p{Script=Bengali} (92)
  2870. \p{Script: Bengali} (Short: \p{Sc=Beng}, \p{Beng}) (92)
  2871. \p{Script: Bopo} \p{Script=Bopomofo} (70)
  2872. \p{Script: Bopomofo} (Short: \p{Sc=Bopo}, \p{Bopo}) (70)
  2873. \p{Script: Brah} \p{Script=Brahmi} (108)
  2874. \p{Script: Brahmi} (Short: \p{Sc=Brah}, \p{Brah}) (108)
  2875. \p{Script: Brai} \p{Script=Braille} (256)
  2876. \p{Script: Braille} (Short: \p{Sc=Brai}, \p{Brai}) (256)
  2877. \p{Script: Bugi} \p{Script=Buginese} (30)
  2878. \p{Script: Buginese} (Short: \p{Sc=Bugi}, \p{Bugi}) (30)
  2879. \p{Script: Buhd} \p{Script=Buhid} (20)
  2880. \p{Script: Buhid} (Short: \p{Sc=Buhd}, \p{Buhd}) (20)
  2881. \p{Script: Cakm} \p{Script=Chakma} (67)
  2882. \p{Script: Canadian_Aboriginal} (Short: \p{Sc=Cans}, \p{Cans})
  2883. (710)
  2884. \p{Script: Cans} \p{Script=Canadian_Aboriginal} (710)
  2885. \p{Script: Cari} \p{Script=Carian} (49)
  2886. \p{Script: Carian} (Short: \p{Sc=Cari}, \p{Cari}) (49)
  2887. \p{Script: Chakma} (Short: \p{Sc=Cakm}, \p{Cakm}) (67)
  2888. \p{Script: Cham} (Short: \p{Sc=Cham}, \p{Cham}) (83)
  2889. \p{Script: Cher} \p{Script=Cherokee} (85)
  2890. \p{Script: Cherokee} (Short: \p{Sc=Cher}, \p{Cher}) (85)
  2891. \p{Script: Common} (Short: \p{Sc=Zyyy}, \p{Zyyy}) (6418)
  2892. \p{Script: Copt} \p{Script=Coptic} (137)
  2893. \p{Script: Coptic} (Short: \p{Sc=Copt}, \p{Copt}) (137)
  2894. \p{Script: Cprt} \p{Script=Cypriot} (55)
  2895. \p{Script: Cuneiform} (Short: \p{Sc=Xsux}, \p{Xsux}) (982)
  2896. \p{Script: Cypriot} (Short: \p{Sc=Cprt}, \p{Cprt}) (55)
  2897. \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}, \p{Cyrl}) (417)
  2898. \p{Script: Cyrl} \p{Script=Cyrillic} (417)
  2899. \p{Script: Deseret} (Short: \p{Sc=Dsrt}, \p{Dsrt}) (80)
  2900. \p{Script: Deva} \p{Script=Devanagari} (151)
  2901. \p{Script: Devanagari} (Short: \p{Sc=Deva}, \p{Deva}) (151)
  2902. \p{Script: Dsrt} \p{Script=Deseret} (80)
  2903. \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (1071)
  2904. \p{Script: Egyptian_Hieroglyphs} (Short: \p{Sc=Egyp}, \p{Egyp})
  2905. (1071)
  2906. \p{Script: Ethi} \p{Script=Ethiopic} (495)
  2907. \p{Script: Ethiopic} (Short: \p{Sc=Ethi}, \p{Ethi}) (495)
  2908. \p{Script: Geor} \p{Script=Georgian} (127)
  2909. \p{Script: Georgian} (Short: \p{Sc=Geor}, \p{Geor}) (127)
  2910. \p{Script: Glag} \p{Script=Glagolitic} (94)
  2911. \p{Script: Glagolitic} (Short: \p{Sc=Glag}, \p{Glag}) (94)
  2912. \p{Script: Goth} \p{Script=Gothic} (27)
  2913. \p{Script: Gothic} (Short: \p{Sc=Goth}, \p{Goth}) (27)
  2914. \p{Script: Greek} (Short: \p{Sc=Grek}, \p{Grek}) (511)
  2915. \p{Script: Grek} \p{Script=Greek} (511)
  2916. \p{Script: Gujarati} (Short: \p{Sc=Gujr}, \p{Gujr}) (84)
  2917. \p{Script: Gujr} \p{Script=Gujarati} (84)
  2918. \p{Script: Gurmukhi} (Short: \p{Sc=Guru}, \p{Guru}) (79)
  2919. \p{Script: Guru} \p{Script=Gurmukhi} (79)
  2920. \p{Script: Han} (Short: \p{Sc=Han}, \p{Han}) (75_963)
  2921. \p{Script: Hang} \p{Script=Hangul} (11_739)
  2922. \p{Script: Hangul} (Short: \p{Sc=Hang}, \p{Hang}) (11_739)
  2923. \p{Script: Hani} \p{Script=Han} (75_963)
  2924. \p{Script: Hano} \p{Script=Hanunoo} (21)
  2925. \p{Script: Hanunoo} (Short: \p{Sc=Hano}, \p{Hano}) (21)
  2926. \p{Script: Hebr} \p{Script=Hebrew} (133)
  2927. \p{Script: Hebrew} (Short: \p{Sc=Hebr}, \p{Hebr}) (133)
  2928. \p{Script: Hira} \p{Script=Hiragana} (91)
  2929. \p{Script: Hiragana} (Short: \p{Sc=Hira}, \p{Hira}) (91)
  2930. \p{Script: Imperial_Aramaic} (Short: \p{Sc=Armi}, \p{Armi}) (31)
  2931. \p{Script: Inherited} (Short: \p{Sc=Zinh}, \p{Zinh}) (523)
  2932. \p{Script: Inscriptional_Pahlavi} (Short: \p{Sc=Phli}, \p{Phli})
  2933. (27)
  2934. \p{Script: Inscriptional_Parthian} (Short: \p{Sc=Prti}, \p{Prti})
  2935. (30)
  2936. \p{Script: Ital} \p{Script=Old_Italic} (35)
  2937. \p{Script: Java} \p{Script=Javanese} (90)
  2938. \p{Script: Javanese} (Short: \p{Sc=Java}, \p{Java}) (90)
  2939. \p{Script: Kaithi} (Short: \p{Sc=Kthi}, \p{Kthi}) (66)
  2940. \p{Script: Kali} \p{Script=Kayah_Li} (48)
  2941. \p{Script: Kana} \p{Script=Katakana} (300)
  2942. \p{Script: Kannada} (Short: \p{Sc=Knda}, \p{Knda}) (86)
  2943. \p{Script: Katakana} (Short: \p{Sc=Kana}, \p{Kana}) (300)
  2944. \p{Script: Kayah_Li} (Short: \p{Sc=Kali}, \p{Kali}) (48)
  2945. \p{Script: Khar} \p{Script=Kharoshthi} (65)
  2946. \p{Script: Kharoshthi} (Short: \p{Sc=Khar}, \p{Khar}) (65)
  2947. \p{Script: Khmer} (Short: \p{Sc=Khmr}, \p{Khmr}) (146)
  2948. \p{Script: Khmr} \p{Script=Khmer} (146)
  2949. \p{Script: Knda} \p{Script=Kannada} (86)
  2950. \p{Script: Kthi} \p{Script=Kaithi} (66)
  2951. \p{Script: Lana} \p{Script=Tai_Tham} (127)
  2952. \p{Script: Lao} (Short: \p{Sc=Lao}, \p{Lao}) (67)
  2953. \p{Script: Laoo} \p{Script=Lao} (67)
  2954. \p{Script: Latin} (Short: \p{Sc=Latn}, \p{Latn}) (1272)
  2955. \p{Script: Latn} \p{Script=Latin} (1272)
  2956. \p{Script: Lepc} \p{Script=Lepcha} (74)
  2957. \p{Script: Lepcha} (Short: \p{Sc=Lepc}, \p{Lepc}) (74)
  2958. \p{Script: Limb} \p{Script=Limbu} (66)
  2959. \p{Script: Limbu} (Short: \p{Sc=Limb}, \p{Limb}) (66)
  2960. \p{Script: Linb} \p{Script=Linear_B} (211)
  2961. \p{Script: Linear_B} (Short: \p{Sc=Linb}, \p{Linb}) (211)
  2962. \p{Script: Lisu} (Short: \p{Sc=Lisu}, \p{Lisu}) (48)
  2963. \p{Script: Lyci} \p{Script=Lycian} (29)
  2964. \p{Script: Lycian} (Short: \p{Sc=Lyci}, \p{Lyci}) (29)
  2965. \p{Script: Lydi} \p{Script=Lydian} (27)
  2966. \p{Script: Lydian} (Short: \p{Sc=Lydi}, \p{Lydi}) (27)
  2967. \p{Script: Malayalam} (Short: \p{Sc=Mlym}, \p{Mlym}) (98)
  2968. \p{Script: Mand} \p{Script=Mandaic} (29)
  2969. \p{Script: Mandaic} (Short: \p{Sc=Mand}, \p{Mand}) (29)
  2970. \p{Script: Meetei_Mayek} (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
  2971. \p{Script: Merc} \p{Script=Meroitic_Cursive} (26)
  2972. \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (32)
  2973. \p{Script: Meroitic_Cursive} (Short: \p{Sc=Merc}, \p{Merc}) (26)
  2974. \p{Script: Meroitic_Hieroglyphs} (Short: \p{Sc=Mero}, \p{Mero})
  2975. (32)
  2976. \p{Script: Miao} (Short: \p{Sc=Miao}, \p{Miao}) (133)
  2977. \p{Script: Mlym} \p{Script=Malayalam} (98)
  2978. \p{Script: Mong} \p{Script=Mongolian} (153)
  2979. \p{Script: Mongolian} (Short: \p{Sc=Mong}, \p{Mong}) (153)
  2980. \p{Script: Mtei} \p{Script=Meetei_Mayek} (79)
  2981. \p{Script: Myanmar} (Short: \p{Sc=Mymr}, \p{Mymr}) (188)
  2982. \p{Script: Mymr} \p{Script=Myanmar} (188)
  2983. \p{Script: New_Tai_Lue} (Short: \p{Sc=Talu}, \p{Talu}) (83)
  2984. \p{Script: Nko} (Short: \p{Sc=Nko}, \p{Nko}) (59)
  2985. \p{Script: Nkoo} \p{Script=Nko} (59)
  2986. \p{Script: Ogam} \p{Script=Ogham} (29)
  2987. \p{Script: Ogham} (Short: \p{Sc=Ogam}, \p{Ogam}) (29)
  2988. \p{Script: Ol_Chiki} (Short: \p{Sc=Olck}, \p{Olck}) (48)
  2989. \p{Script: Olck} \p{Script=Ol_Chiki} (48)
  2990. \p{Script: Old_Italic} (Short: \p{Sc=Ital}, \p{Ital}) (35)
  2991. \p{Script: Old_Persian} (Short: \p{Sc=Xpeo}, \p{Xpeo}) (50)
  2992. \p{Script: Old_South_Arabian} (Short: \p{Sc=Sarb}, \p{Sarb}) (32)
  2993. \p{Script: Old_Turkic} (Short: \p{Sc=Orkh}, \p{Orkh}) (73)
  2994. \p{Script: Oriya} (Short: \p{Sc=Orya}, \p{Orya}) (90)
  2995. \p{Script: Orkh} \p{Script=Old_Turkic} (73)
  2996. \p{Script: Orya} \p{Script=Oriya} (90)
  2997. \p{Script: Osma} \p{Script=Osmanya} (40)
  2998. \p{Script: Osmanya} (Short: \p{Sc=Osma}, \p{Osma}) (40)
  2999. \p{Script: Phag} \p{Script=Phags_Pa} (56)
  3000. \p{Script: Phags_Pa} (Short: \p{Sc=Phag}, \p{Phag}) (56)
  3001. \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (27)
  3002. \p{Script: Phnx} \p{Script=Phoenician} (29)
  3003. \p{Script: Phoenician} (Short: \p{Sc=Phnx}, \p{Phnx}) (29)
  3004. \p{Script: Plrd} \p{Script=Miao} (133)
  3005. \p{Script: Prti} \p{Script=Inscriptional_Parthian} (30)
  3006. \p{Script: Qaac} \p{Script=Coptic} (137)
  3007. \p{Script: Qaai} \p{Script=Inherited} (523)
  3008. \p{Script: Rejang} (Short: \p{Sc=Rjng}, \p{Rjng}) (37)
  3009. \p{Script: Rjng} \p{Script=Rejang} (37)
  3010. \p{Script: Runic} (Short: \p{Sc=Runr}, \p{Runr}) (78)
  3011. \p{Script: Runr} \p{Script=Runic} (78)
  3012. \p{Script: Samaritan} (Short: \p{Sc=Samr}, \p{Samr}) (61)
  3013. \p{Script: Samr} \p{Script=Samaritan} (61)
  3014. \p{Script: Sarb} \p{Script=Old_South_Arabian} (32)
  3015. \p{Script: Saur} \p{Script=Saurashtra} (81)
  3016. \p{Script: Saurashtra} (Short: \p{Sc=Saur}, \p{Saur}) (81)
  3017. \p{Script: Sharada} (Short: \p{Sc=Shrd}, \p{Shrd}) (83)
  3018. \p{Script: Shavian} (Short: \p{Sc=Shaw}, \p{Shaw}) (48)
  3019. \p{Script: Shaw} \p{Script=Shavian} (48)
  3020. \p{Script: Shrd} \p{Script=Sharada} (83)
  3021. \p{Script: Sinh} \p{Script=Sinhala} (80)
  3022. \p{Script: Sinhala} (Short: \p{Sc=Sinh}, \p{Sinh}) (80)
  3023. \p{Script: Sora} \p{Script=Sora_Sompeng} (35)
  3024. \p{Script: Sora_Sompeng} (Short: \p{Sc=Sora}, \p{Sora}) (35)
  3025. \p{Script: Sund} \p{Script=Sundanese} (72)
  3026. \p{Script: Sundanese} (Short: \p{Sc=Sund}, \p{Sund}) (72)
  3027. \p{Script: Sylo} \p{Script=Syloti_Nagri} (44)
  3028. \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}, \p{Sylo}) (44)
  3029. \p{Script: Syrc} \p{Script=Syriac} (77)
  3030. \p{Script: Syriac} (Short: \p{Sc=Syrc}, \p{Syrc}) (77)
  3031. \p{Script: Tagalog} (Short: \p{Sc=Tglg}, \p{Tglg}) (20)
  3032. \p{Script: Tagb} \p{Script=Tagbanwa} (18)
  3033. \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}, \p{Tagb}) (18)
  3034. \p{Script: Tai_Le} (Short: \p{Sc=Tale}, \p{Tale}) (35)
  3035. \p{Script: Tai_Tham} (Short: \p{Sc=Lana}, \p{Lana}) (127)
  3036. \p{Script: Tai_Viet} (Short: \p{Sc=Tavt}, \p{Tavt}) (72)
  3037. \p{Script: Takr} \p{Script=Takri} (66)
  3038. \p{Script: Takri} (Short: \p{Sc=Takr}, \p{Takr}) (66)
  3039. \p{Script: Tale} \p{Script=Tai_Le} (35)
  3040. \p{Script: Talu} \p{Script=New_Tai_Lue} (83)
  3041. \p{Script: Tamil} (Short: \p{Sc=Taml}, \p{Taml}) (72)
  3042. \p{Script: Taml} \p{Script=Tamil} (72)
  3043. \p{Script: Tavt} \p{Script=Tai_Viet} (72)
  3044. \p{Script: Telu} \p{Script=Telugu} (93)
  3045. \p{Script: Telugu} (Short: \p{Sc=Telu}, \p{Telu}) (93)
  3046. \p{Script: Tfng} \p{Script=Tifinagh} (59)
  3047. \p{Script: Tglg} \p{Script=Tagalog} (20)
  3048. \p{Script: Thaa} \p{Script=Thaana} (50)
  3049. \p{Script: Thaana} (Short: \p{Sc=Thaa}, \p{Thaa}) (50)
  3050. \p{Script: Thai} (Short: \p{Sc=Thai}, \p{Thai}) (86)
  3051. \p{Script: Tibetan} (Short: \p{Sc=Tibt}, \p{Tibt}) (207)
  3052. \p{Script: Tibt} \p{Script=Tibetan} (207)
  3053. \p{Script: Tifinagh} (Short: \p{Sc=Tfng}, \p{Tfng}) (59)
  3054. \p{Script: Ugar} \p{Script=Ugaritic} (31)
  3055. \p{Script: Ugaritic} (Short: \p{Sc=Ugar}, \p{Ugar}) (31)
  3056. \p{Script: Unknown} (Short: \p{Sc=Zzzz}, \p{Zzzz}) (1_003_925
  3057. plus all above-Unicode code points)
  3058. \p{Script: Vai} (Short: \p{Sc=Vai}, \p{Vai}) (300)
  3059. \p{Script: Vaii} \p{Script=Vai} (300)
  3060. \p{Script: Xpeo} \p{Script=Old_Persian} (50)
  3061. \p{Script: Xsux} \p{Script=Cuneiform} (982)
  3062. \p{Script: Yi} (Short: \p{Sc=Yi}, \p{Yi}) (1220)
  3063. \p{Script: Yiii} \p{Script=Yi} (1220)
  3064. \p{Script: Zinh} \p{Script=Inherited} (523)
  3065. \p{Script: Zyyy} \p{Script=Common} (6418)
  3066. \p{Script: Zzzz} \p{Script=Unknown} (1_003_925 plus all
  3067. above-Unicode code points)
  3068. \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1263)
  3069. \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}) (1263)
  3070. \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}) (92)
  3071. \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
  3072. (31)
  3073. \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (92)
  3074. \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}) (61)
  3075. \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
  3076. \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
  3077. \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}) (121)
  3078. \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
  3079. \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}) (657)
  3080. \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}) (56)
  3081. \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
  3082. \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (94)
  3083. \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}) (94)
  3084. \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (306)
  3085. \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}) (306)
  3086. \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (108)
  3087. \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}) (108)
  3088. \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
  3089. \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}) (256)
  3090. \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
  3091. \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}) (31)
  3092. \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
  3093. \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}) (22)
  3094. \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (87)
  3095. \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans})
  3096. (710)
  3097. \p{Script_Extensions: Cans} \p{Script_Extensions=
  3098. Canadian_Aboriginal} (710)
  3099. \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
  3100. \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}) (49)
  3101. \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}) (87)
  3102. \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}) (83)
  3103. \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (85)
  3104. \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}) (85)
  3105. \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}) (6061)
  3106. \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (137)
  3107. \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}) (137)
  3108. \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
  3109. \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}) (982)
  3110. \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}) (112)
  3111. \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}) (419)
  3112. \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (419)
  3113. \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}) (80)
  3114. \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (193)
  3115. \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}) (193)
  3116. \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
  3117. \p{Script_Extensions: Egyp} \p{Script_Extensions=
  3118. Egyptian_Hieroglyphs} (1071)
  3119. \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp})
  3120. (1071)
  3121. \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
  3122. \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}) (495)
  3123. \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (128)
  3124. \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}) (128)
  3125. \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (94)
  3126. \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}) (94)
  3127. \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
  3128. \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}) (27)
  3129. \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}) (515)
  3130. \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (515)
  3131. \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}) (94)
  3132. \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (94)
  3133. \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}) (91)
  3134. \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (91)
  3135. \p{Script_Extensions: Han} (Short: \p{Scx=Han}) (76_218)
  3136. \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_971)
  3137. \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}) (11_971)
  3138. \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (76_218)
  3139. \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
  3140. \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}) (23)
  3141. \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (133)
  3142. \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}) (133)
  3143. \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (356)
  3144. \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}) (356)
  3145. \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi}) (31)
  3146. \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}) (459)
  3147. \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli})
  3148. (27)
  3149. \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
  3150. Prti}) (30)
  3151. \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (35)
  3152. \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
  3153. \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}) (91)
  3154. \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}) (86)
  3155. \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
  3156. \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (565)
  3157. \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}) (86)
  3158. \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}) (565)
  3159. \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}) (48)
  3160. \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (65)
  3161. \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}) (65)
  3162. \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}) (146)
  3163. \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
  3164. \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (86)
  3165. \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (86)
  3166. \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
  3167. \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}) (67)
  3168. \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (67)
  3169. \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}) (1289)
  3170. \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1289)
  3171. \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
  3172. \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}) (74)
  3173. \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (66)
  3174. \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}) (66)
  3175. \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
  3176. \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}) (268)
  3177. \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}) (48)
  3178. \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
  3179. \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}) (29)
  3180. \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
  3181. \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}) (27)
  3182. \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}) (98)
  3183. \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
  3184. \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}) (30)
  3185. \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei}) (79)
  3186. \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
  3187. (26)
  3188. \p{Script_Extensions: Mero} \p{Script_Extensions=
  3189. Meroitic_Hieroglyphs} (32)
  3190. \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc}) (26)
  3191. \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero})
  3192. (32)
  3193. \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}) (133)
  3194. \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (98)
  3195. \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (156)
  3196. \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}) (156)
  3197. \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
  3198. \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}) (188)
  3199. \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (188)
  3200. \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}) (83)
  3201. \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}) (59)
  3202. \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (59)
  3203. \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
  3204. \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}) (29)
  3205. \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}) (48)
  3206. \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
  3207. \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}) (35)
  3208. \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}) (50)
  3209. \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb}) (32)
  3210. \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}) (73)
  3211. \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}) (92)
  3212. \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
  3213. \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (92)
  3214. \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
  3215. \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}) (40)
  3216. \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
  3217. \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}) (59)
  3218. \p{Script_Extensions: Phli} \p{Script_Extensions=
  3219. Inscriptional_Pahlavi} (27)
  3220. \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
  3221. \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}) (29)
  3222. \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (133)
  3223. \p{Script_Extensions: Prti} \p{Script_Extensions=
  3224. Inscriptional_Parthian} (30)
  3225. \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (137)
  3226. \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (459)
  3227. \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}) (37)
  3228. \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
  3229. \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}) (78)
  3230. \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (78)
  3231. \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}) (61)
  3232. \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
  3233. \p{Script_Extensions: Sarb} \p{Script_Extensions=
  3234. Old_South_Arabian} (32)
  3235. \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (81)
  3236. \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}) (81)
  3237. \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}) (83)
  3238. \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}) (48)
  3239. \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
  3240. \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (83)
  3241. \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (80)
  3242. \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}) (80)
  3243. \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
  3244. \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora}) (35)
  3245. \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
  3246. \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}) (72)
  3247. \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (54)
  3248. \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo}) (54)
  3249. \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (94)
  3250. \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}) (94)
  3251. \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}) (22)
  3252. \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
  3253. \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}) (20)
  3254. \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}) (45)
  3255. \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}) (127)
  3256. \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}) (72)
  3257. \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (78)
  3258. \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}) (78)
  3259. \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
  3260. \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
  3261. \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}) (72)
  3262. \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (72)
  3263. \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
  3264. \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (93)
  3265. \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}) (93)
  3266. \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
  3267. \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
  3268. \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
  3269. \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}) (66)
  3270. \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}) (86)
  3271. \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}) (207)
  3272. \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
  3273. \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}) (59)
  3274. \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
  3275. \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}) (31)
  3276. \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}) (1_003_925
  3277. plus all above-Unicode code points)
  3278. \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}) (300)
  3279. \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
  3280. \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
  3281. \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (982)
  3282. \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}) (1246)
  3283. \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
  3284. \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (459)
  3285. \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (6061)
  3286. \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown}
  3287. (1_003_925 plus all above-Unicode code
  3288. points)
  3289. \p{Scx: *} \p{Script_Extensions: *}
  3290. \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
  3291. \p{SD: *} \p{Soft_Dotted: *}
  3292. \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
  3293. \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4)
  3294. \p{Sentence_Break: CL} \p{Sentence_Break=Close} (181)
  3295. \p{Sentence_Break: Close} (Short: \p{SB=CL}) (181)
  3296. \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1)
  3297. \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (1649)
  3298. \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (1649)
  3299. \p{Sentence_Break: FO} \p{Sentence_Break=Format} (143)
  3300. \p{Sentence_Break: Format} (Short: \p{SB=FO}) (143)
  3301. \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (97_841)
  3302. \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1)
  3303. \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (1933)
  3304. \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (1933)
  3305. \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (452)
  3306. \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (452)
  3307. \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (97_841)
  3308. \p{Sentence_Break: Other} (Short: \p{SB=XX}) (1_010_264 plus all
  3309. above-Unicode code points)
  3310. \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
  3311. \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26)
  3312. \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
  3313. \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3)
  3314. \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20)
  3315. \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (80)
  3316. \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (80)
  3317. \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1514)
  3318. \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1514)
  3319. \p{Sentence_Break: XX} \p{Sentence_Break=Other} (1_010_264 plus
  3320. all above-Unicode code points)
  3321. \p{Separator} \p{General_Category=Separator} (Short:
  3322. \p{Z}) (19)
  3323. \p{Sharada} \p{Script=Sharada} (Short: \p{Shrd}; NOT
  3324. \p{Block=Sharada}) (83)
  3325. \p{Shavian} \p{Script=Shavian} (Short: \p{Shaw}) (48)
  3326. \p{Shaw} \p{Shavian} (= \p{Script=Shavian}) (48)
  3327. \p{Shrd} \p{Sharada} (= \p{Script=Sharada}) (NOT
  3328. \p{Block=Sharada}) (83)
  3329. \p{Sinh} \p{Sinhala} (= \p{Script=Sinhala}) (NOT
  3330. \p{Block=Sinhala}) (80)
  3331. \p{Sinhala} \p{Script=Sinhala} (Short: \p{Sinh}; NOT
  3332. \p{Block=Sinhala}) (80)
  3333. \p{Sk} \p{Modifier_Symbol} (=
  3334. \p{General_Category=Modifier_Symbol})
  3335. (115)
  3336. \p{Sm} \p{Math_Symbol} (= \p{General_Category=
  3337. Math_Symbol}) (948)
  3338. X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
  3339. \p{InSmallForms}) (32)
  3340. X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
  3341. Small_Form_Variants}) (32)
  3342. \p{So} \p{Other_Symbol} (= \p{General_Category=
  3343. Other_Symbol}) (4404)
  3344. \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
  3345. \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
  3346. all above-Unicode code points)
  3347. \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46)
  3348. \p{Sora} \p{Sora_Sompeng} (= \p{Script=
  3349. Sora_Sompeng}) (NOT \p{Block=
  3350. Sora_Sompeng}) (35)
  3351. \p{Sora_Sompeng} \p{Script=Sora_Sompeng} (Short: \p{Sora};
  3352. NOT \p{Block=Sora_Sompeng}) (35)
  3353. \p{Space} \p{White_Space=Y} \s including beyond
  3354. ASCII and vertical tab (25)
  3355. \p{Space: *} \p{White_Space: *}
  3356. \p{Space_Separator} \p{General_Category=Space_Separator}
  3357. (Short: \p{Zs}) (17)
  3358. \p{SpacePerl} \p{XPerlSpace} (25)
  3359. \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
  3360. \p{Mc}) (352)
  3361. X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
  3362. (Short: \p{InModifierLetters}) (80)
  3363. X \p{Specials} \p{Block=Specials} (16)
  3364. \p{STerm} \p{STerm=Y} (83)
  3365. \p{STerm: N*} (Single: \P{STerm}) (1_114_029 plus all
  3366. above-Unicode code points)
  3367. \p{STerm: Y*} (Single: \p{STerm}) (83)
  3368. \p{Sund} \p{Sundanese} (= \p{Script=Sundanese})
  3369. (NOT \p{Block=Sundanese}) (72)
  3370. \p{Sundanese} \p{Script=Sundanese} (Short: \p{Sund}; NOT
  3371. \p{Block=Sundanese}) (72)
  3372. X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
  3373. Sundanese_Supplement}) (16)
  3374. X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
  3375. \p{InSundaneseSup}) (16)
  3376. X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
  3377. Supplemental_Arrows_A}) (16)
  3378. X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
  3379. Supplemental_Arrows_B}) (128)
  3380. X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
  3381. \p{Block=
  3382. Supplemental_Mathematical_Operators})
  3383. (256)
  3384. X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
  3385. \p{Block=
  3386. Supplementary_Private_Use_Area_A})
  3387. (65_536)
  3388. X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
  3389. \p{Block=
  3390. Supplementary_Private_Use_Area_B})
  3391. (65_536)
  3392. X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
  3393. Supplemental_Punctuation}) (128)
  3394. X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
  3395. \p{Block=Superscripts_And_Subscripts})
  3396. (48)
  3397. X \p{Superscripts_And_Subscripts} \p{Block=
  3398. Superscripts_And_Subscripts} (Short:
  3399. \p{InSuperAndSub}) (48)
  3400. X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
  3401. \p{InSupArrowsA}) (16)
  3402. X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
  3403. \p{InSupArrowsB}) (128)
  3404. X \p{Supplemental_Mathematical_Operators} \p{Block=
  3405. Supplemental_Mathematical_Operators}
  3406. (Short: \p{InSupMathOperators}) (256)
  3407. X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
  3408. (Short: \p{InSupPunctuation}) (128)
  3409. X \p{Supplementary_Private_Use_Area_A} \p{Block=
  3410. Supplementary_Private_Use_Area_A}
  3411. (Short: \p{InSupPUAA}) (65_536)
  3412. X \p{Supplementary_Private_Use_Area_B} \p{Block=
  3413. Supplementary_Private_Use_Area_B}
  3414. (Short: \p{InSupPUAB}) (65_536)
  3415. \p{Surrogate} \p{General_Category=Surrogate} (Short:
  3416. \p{Cs}) (2048)
  3417. \p{Sylo} \p{Syloti_Nagri} (= \p{Script=
  3418. Syloti_Nagri}) (NOT \p{Block=
  3419. Syloti_Nagri}) (44)
  3420. \p{Syloti_Nagri} \p{Script=Syloti_Nagri} (Short: \p{Sylo};
  3421. NOT \p{Block=Syloti_Nagri}) (44)
  3422. \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
  3423. (5516)
  3424. \p{Syrc} \p{Syriac} (= \p{Script=Syriac}) (NOT
  3425. \p{Block=Syriac}) (77)
  3426. \p{Syriac} \p{Script=Syriac} (Short: \p{Syrc}; NOT
  3427. \p{Block=Syriac}) (77)
  3428. \p{Tagalog} \p{Script=Tagalog} (Short: \p{Tglg}; NOT
  3429. \p{Block=Tagalog}) (20)
  3430. \p{Tagb} \p{Tagbanwa} (= \p{Script=Tagbanwa}) (NOT
  3431. \p{Block=Tagbanwa}) (18)
  3432. \p{Tagbanwa} \p{Script=Tagbanwa} (Short: \p{Tagb}; NOT
  3433. \p{Block=Tagbanwa}) (18)
  3434. X \p{Tags} \p{Block=Tags} (128)
  3435. \p{Tai_Le} \p{Script=Tai_Le} (Short: \p{Tale}; NOT
  3436. \p{Block=Tai_Le}) (35)
  3437. \p{Tai_Tham} \p{Script=Tai_Tham} (Short: \p{Lana}; NOT
  3438. \p{Block=Tai_Tham}) (127)
  3439. \p{Tai_Viet} \p{Script=Tai_Viet} (Short: \p{Tavt}; NOT
  3440. \p{Block=Tai_Viet}) (72)
  3441. X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
  3442. Tai_Xuan_Jing_Symbols}) (96)
  3443. X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
  3444. \p{InTaiXuanJing}) (96)
  3445. \p{Takr} \p{Takri} (= \p{Script=Takri}) (NOT
  3446. \p{Block=Takri}) (66)
  3447. \p{Takri} \p{Script=Takri} (Short: \p{Takr}; NOT
  3448. \p{Block=Takri}) (66)
  3449. \p{Tale} \p{Tai_Le} (= \p{Script=Tai_Le}) (NOT
  3450. \p{Block=Tai_Le}) (35)
  3451. \p{Talu} \p{New_Tai_Lue} (= \p{Script=New_Tai_Lue})
  3452. (NOT \p{Block=New_Tai_Lue}) (83)
  3453. \p{Tamil} \p{Script=Tamil} (Short: \p{Taml}; NOT
  3454. \p{Block=Tamil}) (72)
  3455. \p{Taml} \p{Tamil} (= \p{Script=Tamil}) (NOT
  3456. \p{Block=Tamil}) (72)
  3457. \p{Tavt} \p{Tai_Viet} (= \p{Script=Tai_Viet}) (NOT
  3458. \p{Block=Tai_Viet}) (72)
  3459. \p{Telu} \p{Telugu} (= \p{Script=Telugu}) (NOT
  3460. \p{Block=Telugu}) (93)
  3461. \p{Telugu} \p{Script=Telugu} (Short: \p{Telu}; NOT
  3462. \p{Block=Telugu}) (93)
  3463. \p{Term} \p{Terminal_Punctuation} (=
  3464. \p{Terminal_Punctuation=Y}) (176)
  3465. \p{Term: *} \p{Terminal_Punctuation: *}
  3466. \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
  3467. \p{Term}) (176)
  3468. \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
  3469. (1_113_936 plus all above-Unicode code
  3470. points)
  3471. \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (176)
  3472. \p{Tfng} \p{Tifinagh} (= \p{Script=Tifinagh}) (NOT
  3473. \p{Block=Tifinagh}) (59)
  3474. \p{Tglg} \p{Tagalog} (= \p{Script=Tagalog}) (NOT
  3475. \p{Block=Tagalog}) (20)
  3476. \p{Thaa} \p{Thaana} (= \p{Script=Thaana}) (NOT
  3477. \p{Block=Thaana}) (50)
  3478. \p{Thaana} \p{Script=Thaana} (Short: \p{Thaa}; NOT
  3479. \p{Block=Thaana}) (50)
  3480. \p{Thai} \p{Script=Thai} (NOT \p{Block=Thai}) (86)
  3481. \p{Tibetan} \p{Script=Tibetan} (Short: \p{Tibt}; NOT
  3482. \p{Block=Tibetan}) (207)
  3483. \p{Tibt} \p{Tibetan} (= \p{Script=Tibetan}) (NOT
  3484. \p{Block=Tibetan}) (207)
  3485. \p{Tifinagh} \p{Script=Tifinagh} (Short: \p{Tfng}; NOT
  3486. \p{Block=Tifinagh}) (59)
  3487. \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
  3488. \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
  3489. Cased=Yes) (31)
  3490. \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
  3491. (Short: \p{Lt}; /i= General_Category=
  3492. Cased_Letter) (31)
  3493. X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
  3494. Transport_And_Map_Symbols}) (128)
  3495. X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
  3496. (Short: \p{InTransportAndMap}) (128)
  3497. X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
  3498. (= \p{Block=
  3499. Unified_Canadian_Aboriginal_Syllabics})
  3500. (640)
  3501. X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
  3502. Extended} (= \p{Block=
  3503. Unified_Canadian_Aboriginal_Syllabics_-
  3504. Extended}) (80)
  3505. \p{Ugar} \p{Ugaritic} (= \p{Script=Ugaritic}) (NOT
  3506. \p{Block=Ugaritic}) (31)
  3507. \p{Ugaritic} \p{Script=Ugaritic} (Short: \p{Ugar}; NOT
  3508. \p{Block=Ugaritic}) (31)
  3509. \p{UIdeo} \p{Unified_Ideograph} (=
  3510. \p{Unified_Ideograph=Y}) (74_617)
  3511. \p{UIdeo: *} \p{Unified_Ideograph: *}
  3512. \p{Unassigned} \p{General_Category=Unassigned} (Short:
  3513. \p{Cn}) (864_409 plus all above-Unicode
  3514. code points)
  3515. \p{Unicode} \p{Any} (1_114_112)
  3516. X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
  3517. Unified_Canadian_Aboriginal_Syllabics}
  3518. (Short: \p{InUCAS}) (640)
  3519. X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
  3520. Unified_Canadian_Aboriginal_Syllabics_-
  3521. Extended} (Short: \p{InUCASExt}) (80)
  3522. \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
  3523. (74_617)
  3524. \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
  3525. (1_039_495 plus all above-Unicode code
  3526. points)
  3527. \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (74_617)
  3528. \p{Unknown} \p{Script=Unknown} (Short: \p{Zzzz})
  3529. (1_003_925 plus all above-Unicode code
  3530. points)
  3531. \p{Upper} \p{Uppercase=Y} (/i= Cased=Yes) (1483)
  3532. \p{Upper: *} \p{Uppercase: *}
  3533. \p{Uppercase} \p{Upper} (= \p{Uppercase=Y}) (/i= Cased=
  3534. Yes) (1483)
  3535. \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
  3536. No) (1_112_629 plus all above-Unicode
  3537. code points)
  3538. \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
  3539. Yes) (1483)
  3540. \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
  3541. (Short: \p{Lu}; /i= General_Category=
  3542. Cased_Letter) (1441)
  3543. \p{Vai} \p{Script=Vai} (NOT \p{Block=Vai}) (300)
  3544. \p{Vaii} \p{Vai} (= \p{Script=Vai}) (NOT \p{Block=
  3545. Vai}) (300)
  3546. \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
  3547. NOT \p{Variation_Selectors}) (259)
  3548. \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
  3549. plus all above-Unicode code points)
  3550. \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259)
  3551. X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
  3552. \p{InVS}) (16)
  3553. X \p{Variation_Selectors_Supplement} \p{Block=
  3554. Variation_Selectors_Supplement} (Short:
  3555. \p{InVSSup}) (240)
  3556. X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
  3557. Vedic_Extensions}) (48)
  3558. X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
  3559. \p{InVedicExt}) (48)
  3560. X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
  3561. \p{VertSpace} \v (7)
  3562. \p{VS} \p{Variation_Selector} (=
  3563. \p{Variation_Selector=Y}) (NOT
  3564. \p{Variation_Selectors}) (259)
  3565. \p{VS: *} \p{Variation_Selector: *}
  3566. X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
  3567. \p{Block=
  3568. Variation_Selectors_Supplement}) (240)
  3569. \p{WB: *} \p{Word_Break: *}
  3570. \p{White_Space} \p{White_Space=Y} (Short: \p{WSpace}) (25)
  3571. \p{White_Space: N*} (Short: \p{Space=N}, \P{WSpace})
  3572. (1_114_087 plus all above-Unicode code
  3573. points)
  3574. \p{White_Space: Y*} (Short: \p{Space=Y}, \p{WSpace}) (25)
  3575. \p{Word} \w, including beyond ASCII; = \p{Alnum} +
  3576. \pM + \p{Pc} (103_406)
  3577. \p{Word_Break: ALetter} (Short: \p{WB=LE}) (24_867)
  3578. \p{Word_Break: CR} (Short: \p{WB=CR}) (1)
  3579. \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1)
  3580. \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1)
  3581. \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (10)
  3582. \p{Word_Break: Extend} (Short: \p{WB=Extend}) (1649)
  3583. \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (10)
  3584. \p{Word_Break: FO} \p{Word_Break=Format} (142)
  3585. \p{Word_Break: Format} (Short: \p{WB=FO}) (142)
  3586. \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (74)
  3587. \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (74)
  3588. \p{Word_Break: KA} \p{Word_Break=Katakana} (310)
  3589. \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310)
  3590. \p{Word_Break: LE} \p{Word_Break=ALetter} (24_867)
  3591. \p{Word_Break: LF} (Short: \p{WB=LF}) (1)
  3592. \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7)
  3593. \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9)
  3594. \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15)
  3595. \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7)
  3596. \p{Word_Break: ML} \p{Word_Break=MidLetter} (9)
  3597. \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
  3598. \p{Word_Break: Newline} (Short: \p{WB=NL}) (5)
  3599. \p{Word_Break: NL} \p{Word_Break=Newline} (5)
  3600. \p{Word_Break: NU} \p{Word_Break=Numeric} (451)
  3601. \p{Word_Break: Numeric} (Short: \p{WB=NU}) (451)
  3602. \p{Word_Break: Other} (Short: \p{WB=XX}) (1_086_543 plus all
  3603. above-Unicode code points)
  3604. \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26)
  3605. \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
  3606. \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1)
  3607. \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1)
  3608. \p{Word_Break: XX} \p{Word_Break=Other} (1_086_543 plus all
  3609. above-Unicode code points)
  3610. \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25)
  3611. \p{WSpace: *} \p{White_Space: *}
  3612. \p{XDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
  3613. \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
  3614. (103_336)
  3615. \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (1_010_776
  3616. plus all above-Unicode code points)
  3617. \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (103_336)
  3618. \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (101_217)
  3619. \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (1_012_895
  3620. plus all above-Unicode code points)
  3621. \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (101_217)
  3622. \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
  3623. (103_336)
  3624. \p{XIDC: *} \p{XID_Continue: *}
  3625. \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (101_217)
  3626. \p{XIDS: *} \p{XID_Start: *}
  3627. \p{Xpeo} \p{Old_Persian} (= \p{Script=Old_Persian})
  3628. (NOT \p{Block=Old_Persian}) (50)
  3629. \p{XPerlSpace} \s, including beyond ASCII (Short:
  3630. \p{SpacePerl}) (25)
  3631. \p{XPosixAlnum} \p{Alnum} (102_619)
  3632. \p{XPosixAlpha} \p{Alpha} (= \p{Alphabetic=Y}) (102_159)
  3633. \p{XPosixBlank} \p{Blank} (18)
  3634. \p{XPosixCntrl} \p{Cntrl} (= \p{General_Category=Control})
  3635. (65)
  3636. \p{XPosixDigit} \p{Digit} (= \p{General_Category=
  3637. Decimal_Number}) (460)
  3638. \p{XPosixGraph} \p{Graph} (247_571)
  3639. \p{XPosixLower} \p{Lower} (= \p{Lowercase=Y}) (/i= Cased=
  3640. Yes) (1934)
  3641. \p{XPosixPrint} \p{Print} (247_588)
  3642. \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (645)
  3643. \p{XPosixSpace} \p{Space} (= \p{White_Space=Y}) (25)
  3644. \p{XPosixUpper} \p{Upper} (= \p{Uppercase=Y}) (/i= Cased=
  3645. Yes) (1483)
  3646. \p{XPosixWord} \p{Word} (103_406)
  3647. \p{XPosixXDigit} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
  3648. \p{Xsux} \p{Cuneiform} (= \p{Script=Cuneiform})
  3649. (NOT \p{Block=Cuneiform}) (982)
  3650. \p{Yi} \p{Script=Yi} (1220)
  3651. X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
  3652. X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
  3653. \p{Yiii} \p{Yi} (= \p{Script=Yi}) (1220)
  3654. X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
  3655. Yijing_Hexagram_Symbols}) (64)
  3656. X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
  3657. (Short: \p{InYijing}) (64)
  3658. \p{Z} \pZ \p{Separator} (= \p{General_Category=
  3659. Separator}) (19)
  3660. \p{Zinh} \p{Inherited} (= \p{Script=Inherited})
  3661. (523)
  3662. \p{Zl} \p{Line_Separator} (= \p{General_Category=
  3663. Line_Separator}) (1)
  3664. \p{Zp} \p{Paragraph_Separator} (=
  3665. \p{General_Category=
  3666. Paragraph_Separator}) (1)
  3667. \p{Zs} \p{Space_Separator} (=
  3668. \p{General_Category=Space_Separator})
  3669. (17)
  3670. \p{Zyyy} \p{Common} (= \p{Script=Common}) (6418)
  3671. \p{Zzzz} \p{Unknown} (= \p{Script=Unknown})
  3672. (1_003_925 plus all above-Unicode code
  3673. points)
  3674. TX\p{_CanonDCIJ} (For internal use by Perl, not necessarily
  3675. stable) (= \p{Soft_Dotted=Y}) (46)
  3676. TX\p{_Case_Ignorable} (For internal use by Perl, not necessarily
  3677. stable) (= \p{Case_Ignorable=Y}) (1806)
  3678. TX\p{_CombAbove} (For internal use by Perl, not necessarily
  3679. stable) (= \p{Canonical_Combining_Class=
  3680. Above}) (349)

Legal \p{} and \P{} constructs that match no characters

Unicode has some property-value pairs that currently don't match anything. This happens generally either because they are obsolete, or they exist for symmetry with other forms, but no language has yet been encoded that uses them. In this version of Unicode, the following match zero code points:

  • \p{Canonical_Combining_Class=Attached_Below_Left}
  • \p{Canonical_Combining_Class=CCC133}
  • \p{Grapheme_Cluster_Break=Prepend}

Properties accessible through Unicode::UCD

All the Unicode character properties mentioned above (except for those marked as for internal use by Perl) are also accessible by prop_invlist() in Unicode::UCD.

Due to their nature, not all Unicode character properties are suitable for regular expression matches, nor prop_invlist() . The remaining non-provisional, non-internal ones are accessible via prop_invmap() in Unicode::UCD (except for those that this Perl installation hasn't included; see below for which those are).

For compatibility with other parts of Perl, all the single forms given in the table in the section above are recognized. BUT, there are some ambiguities between some Perl extensions and the Unicode properties, all of which are silently resolved in favor of the official Unicode property. To avoid surprises, you should only use prop_invmap() for forms listed in the table below, which omits the non-recommended ones. The affected forms are the Perl single form equivalents of Unicode properties, such as \p{sc} being a single-form equivalent of \p{gc=sc} , which is treated by prop_invmap() as the Script property, whose short name is sc . The table indicates the current ambiguities in the INFO column, beginning with the word "NOT" .

The standard Unicode properties listed below are documented in http://www.unicode.org/reports/tr44/; Perl_Decimal_Digit is documented in prop_invmap() in Unicode::UCD. The other Perl extensions are in Other Properties in perlunicode;

The first column in the table is a name for the property; the second column is an alternative name, if any, plus possibly some annotations. The alternative name is the property's full name, unless that would simply repeat the first column, in which case the second column indicates the property's short name (if different). The annotations are given only in the entry for the full name. If a property is obsolete, etc, the entry will be flagged with the same characters used in the table in the section above, like D or S.

  1. NAME INFO
  2. Age
  3. AHex ASCII_Hex_Digit
  4. All (Perl extension). All code points,
  5. including those above Unicode. Same as
  6. qr/./s
  7. Alnum (Perl extension). Alphabetic and
  8. (decimal) Numeric
  9. Alpha Alphabetic
  10. Alphabetic (Short: Alpha)
  11. Any (Perl extension). All Unicode code
  12. points: [\x{0000}-\x{10FFFF}]
  13. ASCII Block=ASCII. (Perl extension).
  14. [[:ASCII:]]
  15. ASCII_Hex_Digit (Short: AHex)
  16. Assigned (Perl extension). All assigned code points
  17. Bc Bidi_Class
  18. Bidi_C Bidi_Control
  19. Bidi_Class (Short: bc)
  20. Bidi_Control (Short: Bidi_C)
  21. Bidi_M Bidi_Mirrored
  22. Bidi_Mirrored (Short: Bidi_M)
  23. Bidi_Mirroring_Glyph (Short: bmg)
  24. Bidi_Paired_Bracket (Short: bpb)
  25. Bidi_Paired_Bracket_Type (Short: bpt)
  26. Blank (Perl extension). \h, Horizontal white
  27. space
  28. Blk Block
  29. Block (Short: blk)
  30. Bmg Bidi_Mirroring_Glyph
  31. Bpb Bidi_Paired_Bracket
  32. Bpt Bidi_Paired_Bracket_Type
  33. Canonical_Combining_Class (Short: ccc)
  34. Case_Folding (Short: cf)
  35. Case_Ignorable (Short: CI)
  36. Cased
  37. Category General_Category
  38. Ccc Canonical_Combining_Class
  39. CE Composition_Exclusion
  40. Cf Case_Folding; NOT 'cf' meaning
  41. 'General_Category=Format'
  42. Changes_When_Casefolded (Short: CWCF)
  43. Changes_When_Casemapped (Short: CWCM)
  44. Changes_When_Lowercased (Short: CWL)
  45. Changes_When_NFKC_Casefolded (Short: CWKCF)
  46. Changes_When_Titlecased (Short: CWT)
  47. Changes_When_Uppercased (Short: CWU)
  48. CI Case_Ignorable
  49. Cntrl General_Category=Cntrl. (Perl extension).
  50. Control characters
  51. Comp_Ex Full_Composition_Exclusion
  52. Composition_Exclusion (Short: CE)
  53. CWCF Changes_When_Casefolded
  54. CWCM Changes_When_Casemapped
  55. CWKCF Changes_When_NFKC_Casefolded
  56. CWL Changes_When_Lowercased
  57. CWT Changes_When_Titlecased
  58. CWU Changes_When_Uppercased
  59. Dash
  60. Decomposition_Mapping (Short: dm)
  61. Decomposition_Type (Short: dt)
  62. Default_Ignorable_Code_Point (Short: DI)
  63. Dep Deprecated
  64. Deprecated (Short: Dep)
  65. DI Default_Ignorable_Code_Point
  66. Dia Diacritic
  67. Diacritic (Short: Dia)
  68. Digit General_Category=Digit. (Perl extension).
  69. [0-9] + all other decimal digits
  70. Dm Decomposition_Mapping
  71. Dt Decomposition_Type
  72. Ea East_Asian_Width
  73. East_Asian_Width (Short: ea)
  74. Ext Extender
  75. Extender (Short: Ext)
  76. Full_Composition_Exclusion (Short: Comp_Ex)
  77. Gc General_Category
  78. GCB Grapheme_Cluster_Break
  79. General_Category (Short: gc)
  80. Gr_Base Grapheme_Base
  81. Gr_Ext Grapheme_Extend
  82. Graph (Perl extension). Characters that are
  83. graphical
  84. Grapheme_Base (Short: Gr_Base)
  85. Grapheme_Cluster_Break (Short: GCB)
  86. Grapheme_Extend (Short: Gr_Ext)
  87. Hangul_Syllable_Type (Short: hst)
  88. Hex Hex_Digit
  89. Hex_Digit (Short: Hex)
  90. HorizSpace Blank. (Perl extension)
  91. Hst Hangul_Syllable_Type
  92. D Hyphen Supplanted by Line_Break property values;
  93. see www.unicode.org/reports/tr14
  94. ID_Continue (Short: IDC)
  95. ID_Start (Short: IDS)
  96. IDC ID_Continue
  97. Ideo Ideographic
  98. Ideographic (Short: Ideo)
  99. IDS ID_Start
  100. IDS_Binary_Operator (Short: IDSB)
  101. IDS_Trinary_Operator (Short: IDST)
  102. IDSB IDS_Binary_Operator
  103. IDST IDS_Trinary_Operator
  104. In Present_In. (Perl extension)
  105. Isc ISO_Comment; NOT 'isc' meaning
  106. 'General_Category=Other'
  107. ISO_Comment (Short: isc)
  108. Jg Joining_Group
  109. Join_C Join_Control
  110. Join_Control (Short: Join_C)
  111. Joining_Group (Short: jg)
  112. Joining_Type (Short: jt)
  113. Jt Joining_Type
  114. Lb Line_Break
  115. Lc Lowercase_Mapping; NOT 'lc' meaning
  116. 'General_Category=Cased_Letter'
  117. Line_Break (Short: lb)
  118. LOE Logical_Order_Exception
  119. Logical_Order_Exception (Short: LOE)
  120. Lower Lowercase
  121. Lowercase (Short: Lower)
  122. Lowercase_Mapping (Short: lc)
  123. Math
  124. Na Name
  125. Na1 Unicode_1_Name
  126. Name (Short: na)
  127. Name_Alias
  128. NChar Noncharacter_Code_Point
  129. NFC_QC NFC_Quick_Check
  130. NFC_Quick_Check (Short: NFC_QC)
  131. NFD_QC NFD_Quick_Check
  132. NFD_Quick_Check (Short: NFD_QC)
  133. NFKC_Casefold (Short: NFKC_CF)
  134. NFKC_CF NFKC_Casefold
  135. NFKC_QC NFKC_Quick_Check
  136. NFKC_Quick_Check (Short: NFKC_QC)
  137. NFKD_QC NFKD_Quick_Check
  138. NFKD_Quick_Check (Short: NFKD_QC)
  139. Noncharacter_Code_Point (Short: NChar)
  140. Nt Numeric_Type
  141. Numeric_Type (Short: nt)
  142. Numeric_Value (Short: nv)
  143. Nv Numeric_Value
  144. Pat_Syn Pattern_Syntax
  145. Pat_WS Pattern_White_Space
  146. Pattern_Syntax (Short: Pat_Syn)
  147. Pattern_White_Space (Short: Pat_WS)
  148. Perl_Decimal_Digit (Perl extension)
  149. PerlSpace (Perl extension). \s, restricted to ASCII
  150. = [ \f\n\r\t] plus vertical tab
  151. PerlWord (Perl extension). \w, restricted to ASCII
  152. = [A-Za-z0-9_]
  153. PosixAlnum (Perl extension). [A-Za-z0-9]
  154. PosixAlpha (Perl extension). [A-Za-z]
  155. PosixBlank (Perl extension). \t and ' '
  156. PosixCntrl (Perl extension). ASCII control
  157. characters: NUL, SOH, STX, ETX, EOT, ENQ,
  158. ACK, BEL, BS, HT, LF, VT, FF, CR, SO, SI,
  159. DLE, DC1, DC2, DC3, DC4, NAK, SYN, ETB,
  160. CAN, EOM, SUB, ESC, FS, GS, RS, US, and DEL
  161. PosixDigit (Perl extension). [0-9]
  162. PosixGraph (Perl extension). [-!"#$%&'()*+,./:;<=
  163. >?@[\\]^_`{|}~0-9A-Za-z]
  164. PosixLower (Perl extension). [a-z]
  165. PosixPrint (Perl extension). [- 0-9A-Za-
  166. z!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~]
  167. PosixPunct (Perl extension). [-!"#$%&'()*+,./:;<=
  168. >?@[\\]^_`{|}~]
  169. PosixSpace (Perl extension). \t, \n, \cK, \f, \r,
  170. and ' '. (\cK is vertical tab)
  171. PosixUpper (Perl extension). [A-Z]
  172. PosixWord PerlWord. (Perl extension)
  173. PosixXDigit (Perl extension). [0-9A-Fa-f]
  174. Present_In (Short: In). (Perl extension)
  175. Print (Perl extension). Characters that are
  176. graphical plus space characters (but no
  177. controls)
  178. Punct General_Category=Punct. (Perl extension)
  179. QMark Quotation_Mark
  180. Quotation_Mark (Short: QMark)
  181. Radical
  182. SB Sentence_Break
  183. Sc Script; NOT 'sc' meaning
  184. 'General_Category=Currency_Symbol'
  185. Scf Simple_Case_Folding
  186. Script (Short: sc)
  187. Script_Extensions (Short: scx)
  188. Scx Script_Extensions
  189. SD Soft_Dotted
  190. Sentence_Break (Short: SB)
  191. Sfc Simple_Case_Folding
  192. Simple_Case_Folding (Short: scf)
  193. Simple_Lowercase_Mapping (Short: slc)
  194. Simple_Titlecase_Mapping (Short: stc)
  195. Simple_Uppercase_Mapping (Short: suc)
  196. Slc Simple_Lowercase_Mapping
  197. Soft_Dotted (Short: SD)
  198. Space White_Space
  199. SpacePerl XPerlSpace. (Perl extension)
  200. Stc Simple_Titlecase_Mapping
  201. STerm
  202. Suc Simple_Uppercase_Mapping
  203. Tc Titlecase_Mapping
  204. Term Terminal_Punctuation
  205. Terminal_Punctuation (Short: Term)
  206. Title Titlecase. (Perl extension)
  207. Titlecase (Short: Title). (Perl extension). (=
  208. \p{Gc=Lt})
  209. Titlecase_Mapping (Short: tc)
  210. Uc Uppercase_Mapping
  211. UIdeo Unified_Ideograph
  212. Unicode Any. (Perl extension)
  213. Unicode_1_Name (Short: na1)
  214. Unified_Ideograph (Short: UIdeo)
  215. Upper Uppercase
  216. Uppercase (Short: Upper)
  217. Uppercase_Mapping (Short: uc)
  218. Variation_Selector (Short: VS)
  219. VertSpace (Perl extension). \v
  220. VS Variation_Selector
  221. WB Word_Break
  222. White_Space (Short: WSpace)
  223. Word (Perl extension). \w, including beyond
  224. ASCII; = \p{Alnum} + \pM + \p{Pc}
  225. Word_Break (Short: WB)
  226. WSpace White_Space
  227. XDigit (Perl extension)
  228. XID_Continue (Short: XIDC)
  229. XID_Start (Short: XIDS)
  230. XIDC XID_Continue
  231. XIDS XID_Start
  232. XPerlSpace (Perl extension). \s, including beyond
  233. ASCII
  234. XPosixAlnum Alnum. (Perl extension)
  235. XPosixAlpha Alpha. (Perl extension)
  236. XPosixBlank Blank. (Perl extension)
  237. XPosixCntrl General_Category=Cntrl. (Perl extension)
  238. XPosixDigit General_Category=Digit. (Perl extension)
  239. XPosixGraph Graph. (Perl extension)
  240. XPosixLower Lower. (Perl extension)
  241. XPosixPrint Print. (Perl extension)
  242. XPosixPunct (Perl extension). \p{Punct} + ASCII-range
  243. \p{Symbol}
  244. XPosixSpace Space. (Perl extension)
  245. XPosixUpper Upper. (Perl extension)
  246. XPosixWord Word. (Perl extension)
  247. XPosixXDigit XDigit. (Perl extension)

Properties accessible through other means

Certain properties are accessible also via core function calls. These are:

  1. Lowercase_Mapping lc() and lcfirst()
  2. Titlecase_Mapping ucfirst()
  3. Uppercase_Mapping uc()

Also, Case_Folding is accessible through the /i modifier in regular expressions, the \F transliteration escape, and the fc operator.

And, the Name and Name_Aliases properties are accessible through the \N{} interpolation in double-quoted strings and regular expressions; and functions charnames::viacode() , charnames::vianame() , and charnames::string_vianame() (which require a use charnames (); to be specified.

Finally, most properties related to decomposition are accessible via Unicode::Normalize.

Unicode character properties that are NOT accepted by Perl

Perl will generate an error for a few character properties in Unicode when used in a regular expression. The non-Unihan ones are listed below, with the reasons they are not accepted, perhaps with work-arounds. The short names for the properties are listed enclosed in (parentheses). As described after the list, an installation can change the defaults and choose to accept any of these. The list is machine generated based on the choices made for the installation that generated this document.

  • Expands_On_NFC (XO_NFC)
  • Expands_On_NFD (XO_NFD)
  • Expands_On_NFKC (XO_NFKC)
  • Expands_On_NFKD (XO_NFKD)

    Deprecated by Unicode. These are characters that expand to more than one character in the specified normalization form, but whether they actually take up more bytes or not depends on the encoding being used. For example, a UTF-8 encoded character may expand to a different number of bytes than a UTF-32 encoded character.

  • Grapheme_Link (Gr_Link)

    Deprecated by Unicode: Duplicates ccc=vr (Canonical_Combining_Class=Virama)

  • Indic_Matra_Category (InMC)
  • Indic_Syllabic_Category (InSC)

    Provisional

  • Jamo_Short_Name (JSN)
  • Other_Alphabetic (OAlpha)
  • Other_Default_Ignorable_Code_Point (ODI)
  • Other_Grapheme_Extend (OGr_Ext)
  • Other_ID_Continue (OIDC)
  • Other_ID_Start (OIDS)
  • Other_Lowercase (OLower)
  • Other_Math (OMath)
  • Other_Uppercase (OUpper)

    Used by Unicode internally for generating other properties and not intended to be used stand-alone

  • Script=Katakana_Or_Hiragana (sc=Hrkt)

    Obsolete. All code points previously matched by this have been moved to "Script=Common". Consider instead using "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or both)

  • Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)

    All code points that would be matched by this are matched by either "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"

An installation can choose to allow any of these to be matched by downloading the Unicode database from http://www.unicode.org/Public/ to $Config{privlib} /unicore/ in the Perl source tree, changing the controlling lists contained in the program $Config{privlib} /unicore/mktables and then re-compiling and installing. (%Config is available from the Config module).

Other information in the Unicode data base

The Unicode data base is delivered in two different formats. The XML version is valid for more modern Unicode releases. The other version is a collection of files. The two are intended to give equivalent information. Perl uses the older form; this allows you to recompile Perl to use early Unicode releases.

The only non-character property that Perl currently supports is Named Sequences, in which a sequence of code points is given a name and generally treated as a single entity. (Perl supports these via the \N{...} double-quotish construct, charnames::string_vianame(name) in charnames, and namedseq() in Unicode::UCD.

Below is a list of the files in the Unicode data base that Perl doesn't currently use, along with very brief descriptions of their purposes. Some of the names of the files have been shortened from those that Unicode uses, in order to allow them to be distinguishable from similarly named files on file systems for which only the first 8 characters of a name are significant.

  • auxiliary/GraphemeBreakTest.html
  • auxiliary/LineBreakTest.html
  • auxiliary/SentenceBreakTest.html
  • auxiliary/WordBreakTest.html

    Documentation of validation tests

  • auxiliary/LBTest.txt
  • auxiliary/SBTest.txt
  • auxiliary/WBTest.txt
  • BidiCharacterTest.txt
  • BidiTest.txt
  • NormTest.txt

    Validation Tests

  • CJKRadicals.txt

    Maps the kRSUnicode property values to corresponding code points

  • EmojiSources.txt

    Maps certain Unicode code points to their legacy Japanese cell-phone values

  • Index.txt

    Alphabetical index of Unicode characters

  • IndicMatraCategory.txt
  • IndicSyllabicCategory.txt

    Provisional; for the analysis and processing of Indic scripts

  • NamedSqProv.txt

    Named sequences proposed for inclusion in a later version of the Unicode Standard; if you need them now, you can append this file to NamedSequences.txt and recompile perl

  • NamesList.html

    Describes the format and contents of NamesList.txt

  • NamesList.txt

    Annotated list of characters

  • NormalizationCorrections.txt

    Documentation of corrections already incorporated into the Unicode data base

  • Props.txt

    Only in very early releases; is a subset of PropList.txt (which is used instead)

  • ReadMe.txt

    Documentation

  • StandardizedVariants.html

    Provides a visual display of the standard variant sequences derived from StandardizedVariants.txt.

  • StandardizedVariants.txt

    Certain glyph variations for character display are standardized. This lists the non-Unihan ones; the Unihan ones are also not used by Perl, and are in a separate Unicode data base http://www.unicode.org/ivd

  • USourceData.txt

    Documentation of status and cross reference of proposals for encoding by Unicode of Unihan characters

  • USourceGlyphs.pdf

    Pictures of the characters in USourceData.txt

SEE ALSO

http://www.unicode.org/reports/tr44/

perlrecharclass

perlunicode