Modules

  • ABCDE
  • FGHIL
  • MNOPS
  • TUX

Tools

perluniprops

Perl 5 version 24.0 documentation
Recently read

perluniprops

NAME

perluniprops - Index of Unicode Version 8.0.0 character properties in Perl

DESCRIPTION

This document provides information about the portion of the Unicode database that deals with character properties, that is the portion that is defined on single code points. (Other information in the Unicode data base below briefly mentions other data that Unicode provides.)

Perl can provide access to all non-provisional Unicode character properties, though not all are enabled by default. The omitted ones are the Unihan properties (accessible via the CPAN module Unicode::Unihan) and certain deprecated or Unicode-internal properties. (An installation may choose to recompile Perl's tables to change this. See Unicode character properties that are NOT accepted by Perl.)

For most purposes, access to Unicode properties from the Perl core is through regular expression matches, as described in the next section. For some special purposes, and to access the properties that are not suitable for regular expression matching, all the Unicode character properties that Perl handles are accessible via the standard Unicode::UCD module, as described in the section Properties accessible through Unicode::UCD.

Perl also provides some additional extensions and short-cut synonyms for Unicode properties.

This document merely lists all available properties and does not attempt to explain what each property really means. There is a brief description of each Perl extension; see Other Properties in perlunicode for more information on these. There is some detail about Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but to find out about the intricacies of the official Unicode properties, refer to the Unicode standard. A good starting place is http://www.unicode.org/reports/tr44/.

Note that you can define your own properties; see User-Defined Character Properties in perlunicode.

Properties accessible through \p{} and \P{}

The Perl regular expression \p{} and \P{} constructs give access to most of the Unicode character properties. The table below shows all these constructs, both single and compound forms.

Compound forms consist of two components, separated by an equals sign or a colon. The first component is the property name, and the second component is the particular value of the property to match against, for example, \p{Script: Greek} and \p{Script=Greek} both mean to match characters whose Script property value is Greek.

Single forms, like \p{Greek} , are mostly Perl-defined shortcuts for their equivalent compound forms. The table shows these equivalences. (In our example, \p{Greek} is a just a shortcut for \p{Script=Greek} .) There are also a few Perl-defined single forms that are not shortcuts for a compound form. One such is \p{Word} . These are also listed in the table.

In parsing these constructs, Perl always ignores Upper/lower case differences everywhere within the {braces}. Thus \p{Greek} means the same thing as \p{greek} . But note that changing the case of the "p" or "P" before the left brace completely changes the meaning of the construct, from "match" (for \p{} ) to "doesn't match" (for \P{} ). Casing in this document is for improved legibility.

Also, white space, hyphens, and underscores are normally ignored everywhere between the {braces}, and hence can be freely added or removed even if the /x modifier hasn't been specified on the regular expression. But in the table below a 'T' at the beginning of an entry means that tighter (stricter) rules are used for that entry:

Some properties are considered obsolete by Unicode, but still available. There are several varieties of obsolescence:

The table below has two columns. The left column contains the \p{} constructs to look up, possibly preceded by the flags mentioned above; and the right column contains information about them, like a description, or synonyms. The table shows both the single and compound forms for each property that has them. If the left column is a short name for a property, the right column will give its longer, more descriptive name; and if the left column is the longest name, the right column will show any equivalent shortest name, in both single and compound forms if applicable.

If braces are not needed to specify a property (e.g., \pL ), the left column contains both forms, with and without braces.

The right column will also caution you if a property means something different than what might normally be expected.

All single forms are Perl extensions; a few compound forms are as well, and are noted as such.

Numbers in (parentheses) indicate the total number of Unicode code points matched by the property. For emphasis, those properties that match no code points at all are listed as well in a separate section following the table.

Most properties match the same code points regardless of whether "/i" case-insensitive matching is specified or not. But a few properties are affected. These are shown with the notation (/i= other_property) in the second column. Under case-insensitive matching they match the same code pode points as the property other_property.

There is no description given for most non-Perl defined properties (See http://www.unicode.org/reports/tr44/ for that).

For compactness, '*' is used as a wildcard instead of showing all possible combinations. For example, entries like:

  1. \p{Gc: *} \p{General_Category: *}

mean that 'Gc' is a synonym for 'General_Category', and anything that is valid for the latter is also valid for the former. Similarly,

  1. \p{Is_*} \p{*}

means that if and only if, for example, \p{Foo} exists, then \p{Is_Foo} and \p{IsFoo} are also valid and all mean the same thing. And similarly, \p{Foo=Bar} means the same as \p{Is_Foo=Bar} and \p{IsFoo=Bar} . "*" here is restricted to something not beginning with an underscore.

Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries for the other possibilities. Note that not all properties which have values 'Yes' and 'No' are binary, and they have all their values spelled out without using this wild card, and a NOT clause in their description that highlights their not being binary. These also require the compound form to match them, whereas true binary properties have both single and compound forms available.

Note that all non-essential underscores are removed in the display of the short names below.

Legend summary:

  • * is a wild-card

  • (\d+) in the info column gives the number of Unicode code points matched by this property.

  • D means this is deprecated.

  • O means this is obsolete.

  • S means this is stabilized.

  • T means tighter (stricter) name matching applies.

  • X means use of this form is discouraged, and may not be stable.

  1. NAME INFO
  2. X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
  3. T \p{Age: 1.1} \p{Age=V1_1} (33_979)
  4. T \p{Age: 2.0} \p{Age=V2_0} (144_521)
  5. T \p{Age: 2.1} \p{Age=V2_1} (2)
  6. T \p{Age: 3.0} \p{Age=V3_0} (10_307)
  7. T \p{Age: 3.1} \p{Age=V3_1} (44_978)
  8. T \p{Age: 3.2} \p{Age=V3_2} (1016)
  9. T \p{Age: 4.0} \p{Age=V4_0} (1226)
  10. T \p{Age: 4.1} \p{Age=V4_1} (1273)
  11. T \p{Age: 5.0} \p{Age=V5_0} (1369)
  12. T \p{Age: 5.1} \p{Age=V5_1} (1624)
  13. T \p{Age: 5.2} \p{Age=V5_2} (6648)
  14. T \p{Age: 6.0} \p{Age=V6_0} (2088)
  15. T \p{Age: 6.1} \p{Age=V6_1} (732)
  16. T \p{Age: 6.2} \p{Age=V6_2} (1)
  17. T \p{Age: 6.3} \p{Age=V6_3} (5)
  18. T \p{Age: 7.0} \p{Age=V7_0} (2834)
  19. T \p{Age: 8.0} \p{Age=V8_0} (7716)
  20. \p{Age: NA} \p{Age=Unassigned} (853_793 plus all
  21. above-Unicode code points)
  22. \p{Age: Unassigned} Code point's usage has not been assigned
  23. in any Unicode release thus far. (Short:
  24. \p{Age=NA}) (853_793 plus all above-
  25. Unicode code points)
  26. \p{Age: V1_1} Code point's usage introduced in version
  27. 1.1 (33_979)
  28. \p{Age: V2_0} Code point's usage was introduced in
  29. version 2.0; See also Property
  30. 'Present_In' (144_521)
  31. \p{Age: V2_1} Code point's usage was introduced in
  32. version 2.1; See also Property
  33. 'Present_In' (2)
  34. \p{Age: V3_0} Code point's usage was introduced in
  35. version 3.0; See also Property
  36. 'Present_In' (10_307)
  37. \p{Age: V3_1} Code point's usage was introduced in
  38. version 3.1; See also Property
  39. 'Present_In' (44_978)
  40. \p{Age: V3_2} Code point's usage was introduced in
  41. version 3.2; See also Property
  42. 'Present_In' (1016)
  43. \p{Age: V4_0} Code point's usage was introduced in
  44. version 4.0; See also Property
  45. 'Present_In' (1226)
  46. \p{Age: V4_1} Code point's usage was introduced in
  47. version 4.1; See also Property
  48. 'Present_In' (1273)
  49. \p{Age: V5_0} Code point's usage was introduced in
  50. version 5.0; See also Property
  51. 'Present_In' (1369)
  52. \p{Age: V5_1} Code point's usage was introduced in
  53. version 5.1; See also Property
  54. 'Present_In' (1624)
  55. \p{Age: V5_2} Code point's usage was introduced in
  56. version 5.2; See also Property
  57. 'Present_In' (6648)
  58. \p{Age: V6_0} Code point's usage was introduced in
  59. version 6.0; See also Property
  60. 'Present_In' (2088)
  61. \p{Age: V6_1} Code point's usage was introduced in
  62. version 6.1; See also Property
  63. 'Present_In' (732)
  64. \p{Age: V6_2} Code point's usage was introduced in
  65. version 6.2; See also Property
  66. 'Present_In' (1)
  67. \p{Age: V6_3} Code point's usage was introduced in
  68. version 6.3; See also Property
  69. 'Present_In' (5)
  70. \p{Age: V7_0} Code point's usage was introduced in
  71. version 7.0; See also Property
  72. 'Present_In' (2834)
  73. \p{Age: V8_0} Code point's usage was introduced in
  74. version 8.0; See also Property
  75. 'Present_In' (7716)
  76. \p{Aghb} \p{Caucasian_Albanian} (= \p{Script=
  77. Caucasian_Albanian}) (NOT \p{Block=
  78. Caucasian_Albanian}) (53)
  79. \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  80. (22)
  81. \p{AHex: *} \p{ASCII_Hex_Digit: *}
  82. \p{Ahom} \p{Script=Ahom} (NOT \p{Block=Ahom}) (57)
  83. X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
  84. Alchemical_Symbols}) (128)
  85. X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
  86. \p{InAlchemical}) (128)
  87. \p{All} All code points, including those above
  88. Unicode. Same as qr/./s (1_114_112 plus
  89. all above-Unicode code points)
  90. \p{Alnum} \p{XPosixAlnum} (111_493)
  91. \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y})
  92. (110_943)
  93. \p{Alpha: *} \p{Alphabetic: *}
  94. \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y})
  95. (110_943)
  96. \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (1_003_169
  97. plus all above-Unicode code points)
  98. \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (110_943)
  99. X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
  100. \p{Block=Alphabetic_Presentation_Forms})
  101. (80)
  102. X \p{Alphabetic_Presentation_Forms} \p{Block=
  103. Alphabetic_Presentation_Forms} (Short:
  104. \p{InAlphabeticPF}) (80)
  105. \p{Anatolian_Hieroglyphs} \p{Script=Anatolian_Hieroglyphs} (Short:
  106. \p{Hluw}; NOT \p{Block=
  107. Anatolian_Hieroglyphs}) (583)
  108. X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
  109. \p{Block=
  110. Ancient_Greek_Musical_Notation}) (80)
  111. X \p{Ancient_Greek_Musical_Notation} \p{Block=
  112. Ancient_Greek_Musical_Notation} (Short:
  113. \p{InAncientGreekMusic}) (80)
  114. X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
  115. X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
  116. \p{Any} All Unicode code points: [\x{0000}-
  117. \x{10FFFF}] (1_114_112)
  118. \p{Arab} \p{Arabic} (= \p{Script=Arabic}) (NOT
  119. \p{Block=Arabic}) (1257)
  120. \p{Arabic} \p{Script=Arabic} (Short: \p{Arab}; NOT
  121. \p{Block=Arabic}) (1257)
  122. X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
  123. Arabic_Extended_A}) (96)
  124. X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
  125. \p{InArabicExtA}) (96)
  126. X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
  127. (= \p{Block=
  128. Arabic_Mathematical_Alphabetic_Symbols})
  129. (256)
  130. X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
  131. Arabic_Mathematical_Alphabetic_Symbols}
  132. (Short: \p{InArabicMath}) (256)
  133. X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
  134. \p{Block=Arabic_Presentation_Forms_A})
  135. (688)
  136. X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
  137. \p{Block=Arabic_Presentation_Forms_B})
  138. (144)
  139. X \p{Arabic_Presentation_Forms_A} \p{Block=
  140. Arabic_Presentation_Forms_A} (Short:
  141. \p{InArabicPFA}) (688)
  142. X \p{Arabic_Presentation_Forms_B} \p{Block=
  143. Arabic_Presentation_Forms_B} (Short:
  144. \p{InArabicPFB}) (144)
  145. X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
  146. Arabic_Supplement}) (48)
  147. X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
  148. \p{InArabicSup}) (48)
  149. \p{Armenian} \p{Script=Armenian} (Short: \p{Armn}; NOT
  150. \p{Block=Armenian}) (93)
  151. \p{Armi} \p{Imperial_Aramaic} (= \p{Script=
  152. Imperial_Aramaic}) (NOT \p{Block=
  153. Imperial_Aramaic}) (31)
  154. \p{Armn} \p{Armenian} (= \p{Script=Armenian}) (NOT
  155. \p{Block=Armenian}) (93)
  156. X \p{Arrows} \p{Block=Arrows} (112)
  157. \p{ASCII} \p{Block=Basic_Latin} [[:ASCII:]] (128)
  158. \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  159. (22)
  160. \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
  161. plus all above-Unicode code points)
  162. \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22)
  163. \p{Assigned} All assigned code points (260_253)
  164. \p{Avestan} \p{Script=Avestan} (Short: \p{Avst}; NOT
  165. \p{Block=Avestan}) (61)
  166. \p{Avst} \p{Avestan} (= \p{Script=Avestan}) (NOT
  167. \p{Block=Avestan}) (61)
  168. \p{Bali} \p{Balinese} (= \p{Script=Balinese}) (NOT
  169. \p{Block=Balinese}) (121)
  170. \p{Balinese} \p{Script=Balinese} (Short: \p{Bali}; NOT
  171. \p{Block=Balinese}) (121)
  172. \p{Bamu} \p{Bamum} (= \p{Script=Bamum}) (NOT
  173. \p{Block=Bamum}) (657)
  174. \p{Bamum} \p{Script=Bamum} (Short: \p{Bamu}; NOT
  175. \p{Block=Bamum}) (657)
  176. X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
  177. Bamum_Supplement}) (576)
  178. X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
  179. \p{InBamumSup}) (576)
  180. X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
  181. \p{Bass} \p{Bassa_Vah} (= \p{Script=Bassa_Vah})
  182. (NOT \p{Block=Bassa_Vah}) (36)
  183. \p{Bassa_Vah} \p{Script=Bassa_Vah} (Short: \p{Bass}; NOT
  184. \p{Block=Bassa_Vah}) (36)
  185. \p{Batak} \p{Script=Batak} (Short: \p{Batk}; NOT
  186. \p{Block=Batak}) (56)
  187. \p{Batk} \p{Batak} (= \p{Script=Batak}) (NOT
  188. \p{Block=Batak}) (56)
  189. \p{Bc: *} \p{Bidi_Class: *}
  190. \p{Beng} \p{Bengali} (= \p{Script=Bengali}) (NOT
  191. \p{Block=Bengali}) (93)
  192. \p{Bengali} \p{Script=Bengali} (Short: \p{Beng}; NOT
  193. \p{Block=Bengali}) (93)
  194. \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y})
  195. (12)
  196. \p{Bidi_C: *} \p{Bidi_Control: *}
  197. \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1435)
  198. \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (50)
  199. \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1435)
  200. \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (50)
  201. \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
  202. \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016)
  203. \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016)
  204. \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15)
  205. \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
  206. \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (158)
  207. \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
  208. \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (87)
  209. \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158)
  210. \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12)
  211. \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (87)
  212. \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1)
  213. \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1)
  214. \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_097_474
  215. plus all above-Unicode code points)
  216. \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_097_474 plus
  217. all above-Unicode code points)
  218. \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1)
  219. \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1)
  220. \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1)
  221. \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
  222. \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1)
  223. \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
  224. \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1578)
  225. \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1578)
  226. \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (5174)
  227. \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5174)
  228. \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7)
  229. \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
  230. \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1)
  231. \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1)
  232. \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1)
  233. \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (4077)
  234. \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4077)
  235. \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1)
  236. \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1)
  237. \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1)
  238. \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
  239. \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1)
  240. \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
  241. \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
  242. \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3)
  243. \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17)
  244. \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17)
  245. \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
  246. \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
  247. plus all above-Unicode code points)
  248. \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12)
  249. \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
  250. (545)
  251. \p{Bidi_M: *} \p{Bidi_Mirrored: *}
  252. \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
  253. (545)
  254. \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
  255. plus all above-Unicode code points)
  256. \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545)
  257. \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
  258. (60)
  259. \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60)
  260. \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
  261. (1_113_992 plus all above-Unicode code
  262. points)
  263. \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
  264. plus all above-Unicode code points)
  265. \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
  266. (60)
  267. \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60)
  268. \p{Blank} \p{XPosixBlank} (18)
  269. \p{Blk: *} \p{Block: *}
  270. \p{Block: Aegean_Numbers} (64)
  271. \p{Block: Ahom} (NOT \p{Ahom} NOR \p{Is_Ahom}) (64)
  272. \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
  273. \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128)
  274. \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
  275. (80)
  276. \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
  277. AlphabeticPF}) (80)
  278. \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
  279. NOR \p{Is_Anatolian_Hieroglyphs}) (640)
  280. \p{Block: Ancient_Greek_Music} \p{Block=
  281. Ancient_Greek_Musical_Notation} (80)
  282. \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
  283. AncientGreekMusic}) (80)
  284. \p{Block: Ancient_Greek_Numbers} (80)
  285. \p{Block: Ancient_Symbols} (64)
  286. \p{Block: Arabic} (NOT \p{Arabic} NOR \p{Is_Arabic}) (256)
  287. \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
  288. \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96)
  289. \p{Block: Arabic_Math} \p{Block=
  290. Arabic_Mathematical_Alphabetic_Symbols}
  291. (256)
  292. \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
  293. ArabicMath}) (256)
  294. \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
  295. \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
  296. \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
  297. (688)
  298. \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
  299. (144)
  300. \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
  301. \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48)
  302. \p{Block: Armenian} (NOT \p{Armenian} NOR \p{Is_Armenian}) (96)
  303. \p{Block: Arrows} (112)
  304. \p{Block: ASCII} \p{Block=Basic_Latin} (128)
  305. \p{Block: Avestan} (NOT \p{Avestan} NOR \p{Is_Avestan}) (64)
  306. \p{Block: Balinese} (NOT \p{Balinese} NOR \p{Is_Balinese})
  307. (128)
  308. \p{Block: Bamum} (NOT \p{Bamum} NOR \p{Is_Bamum}) (96)
  309. \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
  310. \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576)
  311. \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}) (128)
  312. \p{Block: Bassa_Vah} (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
  313. (48)
  314. \p{Block: Batak} (NOT \p{Batak} NOR \p{Is_Batak}) (64)
  315. \p{Block: Bengali} (NOT \p{Bengali} NOR \p{Is_Bengali}) (128)
  316. \p{Block: Block_Elements} (32)
  317. \p{Block: Bopomofo} (NOT \p{Bopomofo} NOR \p{Is_Bopomofo}) (48)
  318. \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
  319. \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32)
  320. \p{Block: Box_Drawing} (128)
  321. \p{Block: Brahmi} (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128)
  322. \p{Block: Braille} \p{Block=Braille_Patterns} (256)
  323. \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256)
  324. \p{Block: Buginese} (NOT \p{Buginese} NOR \p{Is_Buginese}) (32)
  325. \p{Block: Buhid} (NOT \p{Buhid} NOR \p{Is_Buhid}) (32)
  326. \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
  327. (256)
  328. \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
  329. ByzantineMusic}) (256)
  330. \p{Block: Canadian_Syllabics} \p{Block=
  331. Unified_Canadian_Aboriginal_Syllabics}
  332. (640)
  333. \p{Block: Carian} (NOT \p{Carian} NOR \p{Is_Carian}) (64)
  334. \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
  335. \p{Is_Caucasian_Albanian}) (64)
  336. \p{Block: Chakma} (NOT \p{Chakma} NOR \p{Is_Chakma}) (80)
  337. \p{Block: Cham} (NOT \p{Cham} NOR \p{Is_Cham}) (96)
  338. \p{Block: Cherokee} (NOT \p{Cherokee} NOR \p{Is_Cherokee}) (96)
  339. \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
  340. \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80)
  341. \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
  342. \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
  343. \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
  344. \p{Block: CJK_Compat_Ideographs} \p{Block=
  345. CJK_Compatibility_Ideographs} (512)
  346. \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
  347. CJK_Compatibility_Ideographs_Supplement}
  348. (544)
  349. \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256)
  350. \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
  351. (32)
  352. \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
  353. CJKCompatIdeographs}) (512)
  354. \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
  355. CJKCompatIdeographsSup}) (544)
  356. \p{Block: CJK_Ext_A} \p{Block=
  357. CJK_Unified_Ideographs_Extension_A}
  358. (6592)
  359. \p{Block: CJK_Ext_B} \p{Block=
  360. CJK_Unified_Ideographs_Extension_B}
  361. (42_720)
  362. \p{Block: CJK_Ext_C} \p{Block=
  363. CJK_Unified_Ideographs_Extension_C}
  364. (4160)
  365. \p{Block: CJK_Ext_D} \p{Block=
  366. CJK_Unified_Ideographs_Extension_D} (224)
  367. \p{Block: CJK_Ext_E} \p{Block=
  368. CJK_Unified_Ideographs_Extension_E}
  369. (5776)
  370. \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
  371. \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
  372. (128)
  373. \p{Block: CJK_Strokes} (48)
  374. \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
  375. \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
  376. (64)
  377. \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992)
  378. \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
  379. CJKExtA}) (6592)
  380. \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
  381. CJKExtB}) (42_720)
  382. \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
  383. CJKExtC}) (4160)
  384. \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
  385. CJKExtD}) (224)
  386. \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
  387. CJKExtE}) (5776)
  388. \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
  389. Diacriticals}) (112)
  390. \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
  391. DiacriticalsExt}) (80)
  392. \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
  393. DiacriticalsForSymbols}) (48)
  394. \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
  395. DiacriticalsSup}) (64)
  396. \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16)
  397. \p{Block: Combining_Marks_For_Symbols} \p{Block=
  398. Combining_Diacritical_Marks_For_Symbols}
  399. (48)
  400. \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
  401. IndicNumberForms}) (16)
  402. \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
  403. \p{Block: Control_Pictures} (64)
  404. \p{Block: Coptic} (NOT \p{Coptic} NOR \p{Is_Coptic}) (128)
  405. \p{Block: Coptic_Epact_Numbers} (32)
  406. \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
  407. \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32)
  408. \p{Block: Cuneiform} (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
  409. (1024)
  410. \p{Block: Cuneiform_Numbers} \p{Block=
  411. Cuneiform_Numbers_And_Punctuation} (128)
  412. \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
  413. CuneiformNumbers}) (128)
  414. \p{Block: Currency_Symbols} (48)
  415. \p{Block: Cypriot_Syllabary} (64)
  416. \p{Block: Cyrillic} (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
  417. (256)
  418. \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
  419. \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
  420. \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32)
  421. \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96)
  422. \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
  423. \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48)
  424. \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
  425. (48)
  426. \p{Block: Deseret} (80)
  427. \p{Block: Devanagari} (NOT \p{Devanagari} NOR \p{Is_Devanagari})
  428. (128)
  429. \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
  430. \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32)
  431. \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
  432. \p{Block: Diacriticals_Ext} \p{Block=
  433. Combining_Diacritical_Marks_Extended}
  434. (80)
  435. \p{Block: Diacriticals_For_Symbols} \p{Block=
  436. Combining_Diacritical_Marks_For_Symbols}
  437. (48)
  438. \p{Block: Diacriticals_Sup} \p{Block=
  439. Combining_Diacritical_Marks_Supplement}
  440. (64)
  441. \p{Block: Dingbats} (192)
  442. \p{Block: Domino} \p{Block=Domino_Tiles} (112)
  443. \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112)
  444. \p{Block: Duployan} (NOT \p{Duployan} NOR \p{Is_Duployan})
  445. (160)
  446. \p{Block: Early_Dynastic_Cuneiform} (208)
  447. \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
  448. \p{Is_Egyptian_Hieroglyphs}) (1072)
  449. \p{Block: Elbasan} (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48)
  450. \p{Block: Emoticons} (80)
  451. \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
  452. \p{Block: Enclosed_Alphanum_Sup} \p{Block=
  453. Enclosed_Alphanumeric_Supplement} (256)
  454. \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
  455. EnclosedAlphanumSup}) (256)
  456. \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
  457. EnclosedAlphanum}) (160)
  458. \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
  459. (256)
  460. \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
  461. EnclosedCJK}) (256)
  462. \p{Block: Enclosed_Ideographic_Sup} \p{Block=
  463. Enclosed_Ideographic_Supplement} (256)
  464. \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
  465. EnclosedIdeographicSup}) (256)
  466. \p{Block: Ethiopic} (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
  467. (384)
  468. \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
  469. \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
  470. \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96)
  471. \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48)
  472. \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
  473. \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32)
  474. \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
  475. \p{Punct} NOR \p{Is_Punctuation}) (112)
  476. \p{Block: Geometric_Shapes} (96)
  477. \p{Block: Geometric_Shapes_Ext} \p{Block=
  478. Geometric_Shapes_Extended} (128)
  479. \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
  480. GeometricShapesExt}) (128)
  481. \p{Block: Georgian} (NOT \p{Georgian} NOR \p{Is_Georgian}) (96)
  482. \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
  483. \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48)
  484. \p{Block: Glagolitic} (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
  485. (96)
  486. \p{Block: Gothic} (NOT \p{Gothic} NOR \p{Is_Gothic}) (32)
  487. \p{Block: Grantha} (NOT \p{Grantha} NOR \p{Is_Grantha}) (128)
  488. \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
  489. NOR \p{Is_Greek}) (144)
  490. \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
  491. NOR \p{Is_Greek}) (144)
  492. \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
  493. \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256)
  494. \p{Block: Gujarati} (NOT \p{Gujarati} NOR \p{Is_Gujarati})
  495. (128)
  496. \p{Block: Gurmukhi} (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
  497. (128)
  498. \p{Block: Half_And_Full_Forms} \p{Block=
  499. Halfwidth_And_Fullwidth_Forms} (240)
  500. \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
  501. \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
  502. HalfAndFullForms}) (240)
  503. \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
  504. NOR \p{Is_Hangul}) (11_184)
  505. \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
  506. (96)
  507. \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}) (256)
  508. \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32)
  509. \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80)
  510. \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
  511. NOR \p{Is_Hangul}) (11_184)
  512. \p{Block: Hanunoo} (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32)
  513. \p{Block: Hatran} (NOT \p{Hatran} NOR \p{Is_Hatran}) (32)
  514. \p{Block: Hebrew} (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112)
  515. \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
  516. HighPUSurrogates}) (128)
  517. \p{Block: High_PU_Surrogates} \p{Block=
  518. High_Private_Use_Surrogates} (128)
  519. \p{Block: High_Surrogates} (896)
  520. \p{Block: Hiragana} (NOT \p{Hiragana} NOR \p{Is_Hiragana}) (96)
  521. \p{Block: IDC} \p{Block=
  522. Ideographic_Description_Characters} (NOT
  523. \p{ID_Continue} NOR \p{Is_IDC}) (16)
  524. \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
  525. NOT \p{ID_Continue} NOR \p{Is_IDC}) (16)
  526. \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
  527. \p{Is_Imperial_Aramaic}) (32)
  528. \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  529. (16)
  530. \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
  531. NOR \p{Is_Inscriptional_Pahlavi}) (32)
  532. \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
  533. NOR \p{Is_Inscriptional_Parthian}) (32)
  534. \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
  535. \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96)
  536. \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
  537. \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
  538. \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
  539. \p{Block: Javanese} (NOT \p{Javanese} NOR \p{Is_Javanese}) (96)
  540. \p{Block: Kaithi} (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80)
  541. \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
  542. \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256)
  543. \p{Block: Kanbun} (16)
  544. \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
  545. \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224)
  546. \p{Block: Kannada} (NOT \p{Kannada} NOR \p{Is_Kannada}) (128)
  547. \p{Block: Katakana} (NOT \p{Katakana} NOR \p{Is_Katakana}) (96)
  548. \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
  549. \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
  550. KatakanaExt}) (16)
  551. \p{Block: Kayah_Li} (NOT \p{Kayah_Li} NOR \p{Is_Kayah_Li}) (48)
  552. \p{Block: Kharoshthi} (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
  553. (96)
  554. \p{Block: Khmer} (NOT \p{Khmer} NOR \p{Is_Khmer}) (128)
  555. \p{Block: Khmer_Symbols} (32)
  556. \p{Block: Khojki} (NOT \p{Khojki} NOR \p{Is_Khojki}) (80)
  557. \p{Block: Khudawadi} (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
  558. (80)
  559. \p{Block: Lao} (NOT \p{Lao} NOR \p{Is_Lao}) (128)
  560. \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
  561. \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
  562. \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128)
  563. \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
  564. \p{Block: Latin_Ext_Additional} \p{Block=
  565. Latin_Extended_Additional} (256)
  566. \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
  567. \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
  568. \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
  569. \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64)
  570. \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128)
  571. \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
  572. LatinExtAdditional}) (256)
  573. \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208)
  574. \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32)
  575. \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224)
  576. \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64)
  577. \p{Block: Lepcha} (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80)
  578. \p{Block: Letterlike_Symbols} (80)
  579. \p{Block: Limbu} (NOT \p{Limbu} NOR \p{Is_Limbu}) (80)
  580. \p{Block: Linear_A} (NOT \p{Linear_A} NOR \p{Is_Linear_A})
  581. (384)
  582. \p{Block: Linear_B_Ideograms} (128)
  583. \p{Block: Linear_B_Syllabary} (128)
  584. \p{Block: Lisu} (48)
  585. \p{Block: Low_Surrogates} (1024)
  586. \p{Block: Lycian} (NOT \p{Lycian} NOR \p{Is_Lycian}) (32)
  587. \p{Block: Lydian} (NOT \p{Lydian} NOR \p{Is_Lydian}) (32)
  588. \p{Block: Mahajani} (NOT \p{Mahajani} NOR \p{Is_Mahajani}) (48)
  589. \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
  590. \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48)
  591. \p{Block: Malayalam} (NOT \p{Malayalam} NOR \p{Is_Malayalam})
  592. (128)
  593. \p{Block: Mandaic} (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32)
  594. \p{Block: Manichaean} (NOT \p{Manichaean} NOR \p{Is_Manichaean})
  595. (64)
  596. \p{Block: Math_Alphanum} \p{Block=
  597. Mathematical_Alphanumeric_Symbols} (1024)
  598. \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
  599. \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
  600. MathAlphanum}) (1024)
  601. \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
  602. (256)
  603. \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
  604. \p{Is_Meetei_Mayek}) (64)
  605. \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
  606. \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
  607. (32)
  608. \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
  609. \p{Is_Mende_Kikakui}) (224)
  610. \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
  611. \p{Is_Meroitic_Cursive}) (96)
  612. \p{Block: Meroitic_Hieroglyphs} (32)
  613. \p{Block: Miao} (NOT \p{Miao} NOR \p{Is_Miao}) (160)
  614. \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
  615. (256)
  616. \p{Block: Misc_Math_Symbols_A} \p{Block=
  617. Miscellaneous_Mathematical_Symbols_A}
  618. (48)
  619. \p{Block: Misc_Math_Symbols_B} \p{Block=
  620. Miscellaneous_Mathematical_Symbols_B}
  621. (128)
  622. \p{Block: Misc_Pictographs} \p{Block=
  623. Miscellaneous_Symbols_And_Pictographs}
  624. (768)
  625. \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
  626. \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
  627. \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
  628. MiscMathSymbolsA}) (48)
  629. \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
  630. MiscMathSymbolsB}) (128)
  631. \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols}) (256)
  632. \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
  633. MiscArrows}) (256)
  634. \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
  635. MiscPictographs}) (768)
  636. \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
  637. (256)
  638. \p{Block: Modi} (NOT \p{Modi} NOR \p{Is_Modi}) (96)
  639. \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
  640. \p{Block: Modifier_Tone_Letters} (32)
  641. \p{Block: Mongolian} (NOT \p{Mongolian} NOR \p{Is_Mongolian})
  642. (176)
  643. \p{Block: Mro} (NOT \p{Mro} NOR \p{Is_Mro}) (48)
  644. \p{Block: Multani} (NOT \p{Multani} NOR \p{Is_Multani}) (48)
  645. \p{Block: Music} \p{Block=Musical_Symbols} (256)
  646. \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256)
  647. \p{Block: Myanmar} (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160)
  648. \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
  649. \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
  650. \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32)
  651. \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32)
  652. \p{Block: Nabataean} (NOT \p{Nabataean} NOR \p{Is_Nabataean})
  653. (48)
  654. \p{Block: NB} \p{Block=No_Block} (849_856 plus all
  655. above-Unicode code points)
  656. \p{Block: New_Tai_Lue} (NOT \p{New_Tai_Lue} NOR
  657. \p{Is_New_Tai_Lue}) (96)
  658. \p{Block: NKo} (NOT \p{Nko} NOR \p{Is_NKo}) (64)
  659. \p{Block: No_Block} (Short: \p{Blk=NB}) (849_856 plus all
  660. above-Unicode code points)
  661. \p{Block: Number_Forms} (64)
  662. \p{Block: OCR} \p{Block=Optical_Character_Recognition}
  663. (32)
  664. \p{Block: Ogham} (NOT \p{Ogham} NOR \p{Is_Ogham}) (32)
  665. \p{Block: Ol_Chiki} (48)
  666. \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
  667. \p{Is_Old_Hungarian}) (128)
  668. \p{Block: Old_Italic} (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
  669. (48)
  670. \p{Block: Old_North_Arabian} (32)
  671. \p{Block: Old_Permic} (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
  672. (48)
  673. \p{Block: Old_Persian} (NOT \p{Old_Persian} NOR
  674. \p{Is_Old_Persian}) (64)
  675. \p{Block: Old_South_Arabian} (32)
  676. \p{Block: Old_Turkic} (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
  677. (80)
  678. \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32)
  679. \p{Block: Oriya} (NOT \p{Oriya} NOR \p{Is_Oriya}) (128)
  680. \p{Block: Ornamental_Dingbats} (48)
  681. \p{Block: Osmanya} (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48)
  682. \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
  683. \p{Is_Pahawh_Hmong}) (144)
  684. \p{Block: Palmyrene} (32)
  685. \p{Block: Pau_Cin_Hau} (NOT \p{Pau_Cin_Hau} NOR
  686. \p{Is_Pau_Cin_Hau}) (64)
  687. \p{Block: Phags_Pa} (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa}) (64)
  688. \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
  689. \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48)
  690. \p{Block: Phoenician} (NOT \p{Phoenician} NOR \p{Is_Phoenician})
  691. (32)
  692. \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
  693. \p{Block: Phonetic_Ext_Sup} \p{Block=
  694. Phonetic_Extensions_Supplement} (64)
  695. \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128)
  696. \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
  697. PhoneticExtSup}) (64)
  698. \p{Block: Playing_Cards} (96)
  699. \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
  700. \p{Private_Use} NOR \p{Is_Private_Use})
  701. (6400)
  702. \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
  703. \p{Private_Use} NOR \p{Is_Private_Use})
  704. (6400)
  705. \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
  706. \p{Is_Psalter_Pahlavi}) (48)
  707. \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
  708. \p{Private_Use} NOR \p{Is_Private_Use})
  709. (6400)
  710. \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
  711. \p{Punct} NOR \p{Is_Punctuation}) (112)
  712. \p{Block: Rejang} (NOT \p{Rejang} NOR \p{Is_Rejang}) (48)
  713. \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
  714. \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32)
  715. \p{Block: Runic} (NOT \p{Runic} NOR \p{Is_Runic}) (96)
  716. \p{Block: Samaritan} (NOT \p{Samaritan} NOR \p{Is_Samaritan})
  717. (64)
  718. \p{Block: Saurashtra} (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
  719. (96)
  720. \p{Block: Sharada} (NOT \p{Sharada} NOR \p{Is_Sharada}) (96)
  721. \p{Block: Shavian} (48)
  722. \p{Block: Shorthand_Format_Controls} (16)
  723. \p{Block: Siddham} (NOT \p{Siddham} NOR \p{Is_Siddham}) (128)
  724. \p{Block: Sinhala} (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128)
  725. \p{Block: Sinhala_Archaic_Numbers} (32)
  726. \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32)
  727. \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
  728. \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
  729. \p{Is_Sora_Sompeng}) (48)
  730. \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
  731. ModifierLetters}) (80)
  732. \p{Block: Specials} (16)
  733. \p{Block: Sundanese} (NOT \p{Sundanese} NOR \p{Is_Sundanese})
  734. (64)
  735. \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
  736. \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16)
  737. \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
  738. \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
  739. \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
  740. \p{Block: Sup_Math_Operators} \p{Block=
  741. Supplemental_Mathematical_Operators}
  742. (256)
  743. \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
  744. (65_536)
  745. \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
  746. (65_536)
  747. \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
  748. \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
  749. Supplemental_Symbols_And_Pictographs}
  750. (256)
  751. \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
  752. \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
  753. SuperAndSub}) (48)
  754. \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16)
  755. \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128)
  756. \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256)
  757. \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
  758. SupMathOperators}) (256)
  759. \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
  760. SupPunctuation}) (128)
  761. \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
  762. SupSymbolsAndPictographs}) (256)
  763. \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
  764. SupPUAA}) (65_536)
  765. \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
  766. SupPUAB}) (65_536)
  767. \p{Block: Sutton_SignWriting} (688)
  768. \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
  769. \p{Is_Syloti_Nagri}) (48)
  770. \p{Block: Syriac} (NOT \p{Syriac} NOR \p{Is_Syriac}) (80)
  771. \p{Block: Tagalog} (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32)
  772. \p{Block: Tagbanwa} (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa}) (32)
  773. \p{Block: Tags} (128)
  774. \p{Block: Tai_Le} (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48)
  775. \p{Block: Tai_Tham} (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
  776. (144)
  777. \p{Block: Tai_Viet} (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet}) (96)
  778. \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
  779. \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96)
  780. \p{Block: Takri} (NOT \p{Takri} NOR \p{Is_Takri}) (80)
  781. \p{Block: Tamil} (NOT \p{Tamil} NOR \p{Is_Tamil}) (128)
  782. \p{Block: Telugu} (NOT \p{Telugu} NOR \p{Is_Telugu}) (128)
  783. \p{Block: Thaana} (NOT \p{Thaana} NOR \p{Is_Thaana}) (64)
  784. \p{Block: Thai} (NOT \p{Thai} NOR \p{Is_Thai}) (128)
  785. \p{Block: Tibetan} (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256)
  786. \p{Block: Tifinagh} (NOT \p{Tifinagh} NOR \p{Is_Tifinagh}) (80)
  787. \p{Block: Tirhuta} (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96)
  788. \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
  789. (128)
  790. \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
  791. TransportAndMap}) (128)
  792. \p{Block: UCAS} \p{Block=
  793. Unified_Canadian_Aboriginal_Syllabics}
  794. (640)
  795. \p{Block: UCAS_Ext} \p{Block=
  796. Unified_Canadian_Aboriginal_Syllabics_-
  797. Extended} (80)
  798. \p{Block: Ugaritic} (NOT \p{Ugaritic} NOR \p{Is_Ugaritic}) (32)
  799. \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
  800. UCAS}) (640)
  801. \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
  802. \p{Blk=UCASExt}) (80)
  803. \p{Block: Vai} (NOT \p{Vai} NOR \p{Is_Vai}) (320)
  804. \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
  805. \p{Variation_Selector} NOR \p{Is_VS})
  806. (16)
  807. \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
  808. (240)
  809. \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
  810. \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48)
  811. \p{Block: Vertical_Forms} (16)
  812. \p{Block: VS} \p{Block=Variation_Selectors} (NOT
  813. \p{Variation_Selector} NOR \p{Is_VS})
  814. (16)
  815. \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
  816. (240)
  817. \p{Block: Warang_Citi} (NOT \p{Warang_Citi} NOR
  818. \p{Is_Warang_Citi}) (96)
  819. \p{Block: Yi_Radicals} (64)
  820. \p{Block: Yi_Syllables} (1168)
  821. \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
  822. \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64)
  823. X \p{Block_Elements} \p{Block=Block_Elements} (32)
  824. \p{Bopo} \p{Bopomofo} (= \p{Script=Bopomofo}) (NOT
  825. \p{Block=Bopomofo}) (70)
  826. \p{Bopomofo} \p{Script=Bopomofo} (Short: \p{Bopo}; NOT
  827. \p{Block=Bopomofo}) (70)
  828. X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
  829. Bopomofo_Extended}) (32)
  830. X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
  831. \p{InBopomofoExt}) (32)
  832. X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
  833. \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *}
  834. \p{Brah} \p{Brahmi} (= \p{Script=Brahmi}) (NOT
  835. \p{Block=Brahmi}) (109)
  836. \p{Brahmi} \p{Script=Brahmi} (Short: \p{Brah}; NOT
  837. \p{Block=Brahmi}) (109)
  838. \p{Brai} \p{Braille} (= \p{Script=Braille}) (256)
  839. \p{Braille} \p{Script=Braille} (Short: \p{Brai}) (256)
  840. X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
  841. \p{InBraille}) (256)
  842. \p{Bugi} \p{Buginese} (= \p{Script=Buginese}) (NOT
  843. \p{Block=Buginese}) (30)
  844. \p{Buginese} \p{Script=Buginese} (Short: \p{Bugi}; NOT
  845. \p{Block=Buginese}) (30)
  846. \p{Buhd} \p{Buhid} (= \p{Script=Buhid}) (NOT
  847. \p{Block=Buhid}) (20)
  848. \p{Buhid} \p{Script=Buhid} (Short: \p{Buhd}; NOT
  849. \p{Block=Buhid}) (20)
  850. X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
  851. Byzantine_Musical_Symbols}) (256)
  852. X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
  853. (Short: \p{InByzantineMusic}) (256)
  854. \p{C} \pC \p{Other} (= \p{General_Category=Other})
  855. (993_590 plus all above-Unicode code
  856. points)
  857. \p{Cakm} \p{Chakma} (= \p{Script=Chakma}) (NOT
  858. \p{Block=Chakma}) (67)
  859. \p{Canadian_Aboriginal} \p{Script=Canadian_Aboriginal} (Short:
  860. \p{Cans}) (710)
  861. X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
  862. (= \p{Block=
  863. Unified_Canadian_Aboriginal_Syllabics})
  864. (640)
  865. T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
  866. Not_Reordered} (1_113_361 plus all
  867. above-Unicode code points)
  868. T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
  869. Overlay} (32)
  870. T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
  871. Nukta} (20)
  872. T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
  873. Kana_Voicing} (2)
  874. T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
  875. Virama} (45)
  876. T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
  877. CCC10} (1)
  878. T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
  879. CCC11} (1)
  880. T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
  881. CCC12} (1)
  882. T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
  883. CCC13} (1)
  884. T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
  885. CCC14} (1)
  886. T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
  887. CCC15} (1)
  888. T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
  889. CCC16} (1)
  890. T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
  891. CCC17} (1)
  892. T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
  893. CCC18} (2)
  894. T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
  895. CCC19} (2)
  896. T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
  897. CCC20} (1)
  898. T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
  899. CCC21} (1)
  900. T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
  901. CCC22} (1)
  902. T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
  903. CCC23} (1)
  904. T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
  905. CCC24} (1)
  906. T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
  907. CCC25} (1)
  908. T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
  909. CCC26} (1)
  910. T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
  911. CCC27} (2)
  912. T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
  913. CCC28} (2)
  914. T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
  915. CCC29} (2)
  916. T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
  917. CCC30} (2)
  918. T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
  919. CCC31} (2)
  920. T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
  921. CCC32} (2)
  922. T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
  923. CCC33} (1)
  924. T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
  925. CCC34} (1)
  926. T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
  927. CCC35} (1)
  928. T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
  929. CCC36} (1)
  930. T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
  931. CCC84} (1)
  932. T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
  933. CCC91} (1)
  934. T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
  935. CCC103} (2)
  936. T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
  937. CCC107} (4)
  938. T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
  939. CCC118} (2)
  940. T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
  941. CCC122} (4)
  942. T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
  943. CCC129} (1)
  944. T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
  945. CCC130} (6)
  946. T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
  947. CCC132} (1)
  948. T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
  949. CCC133} (0)
  950. T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
  951. Attached_Below_Left} (0)
  952. T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
  953. Attached_Below} (5)
  954. T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
  955. Attached_Above} (1)
  956. T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
  957. Attached_Above_Right} (9)
  958. T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
  959. Below_Left} (1)
  960. T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
  961. Below} (153)
  962. T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
  963. Below_Right} (4)
  964. T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
  965. Left} (2)
  966. T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
  967. Right} (1)
  968. T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
  969. Above_Left} (3)
  970. T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
  971. Above} (402)
  972. T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
  973. Above_Right} (4)
  974. T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
  975. Double_Below} (4)
  976. T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
  977. Double_Above} (5)
  978. T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
  979. Iota_Subscript} (1)
  980. \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
  981. Above} (402)
  982. \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (402)
  983. \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (3)
  984. \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (4)
  985. \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
  986. Above_Left} (3)
  987. \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
  988. Above_Right} (4)
  989. \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
  990. Attached_Above} (1)
  991. \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
  992. Attached_Above_Right} (9)
  993. \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
  994. Attached_Below} (5)
  995. \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
  996. Attached_Below_Left} (0)
  997. \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
  998. (1)
  999. \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
  1000. \p{Ccc=ATAR}) (9)
  1001. \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
  1002. (5)
  1003. \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
  1004. ATBL}) (0)
  1005. \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
  1006. Below} (153)
  1007. \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (153)
  1008. \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1)
  1009. \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4)
  1010. \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
  1011. Below_Left} (1)
  1012. \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
  1013. Below_Right} (4)
  1014. \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1)
  1015. \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2)
  1016. \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4)
  1017. \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1)
  1018. \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2)
  1019. \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1)
  1020. \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4)
  1021. \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1)
  1022. \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1)
  1023. \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6)
  1024. \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1)
  1025. \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
  1026. \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1)
  1027. \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1)
  1028. \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1)
  1029. \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1)
  1030. \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2)
  1031. \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2)
  1032. \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1)
  1033. \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1)
  1034. \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1)
  1035. \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1)
  1036. \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1)
  1037. \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1)
  1038. \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1)
  1039. \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2)
  1040. \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2)
  1041. \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2)
  1042. \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2)
  1043. \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2)
  1044. \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2)
  1045. \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1)
  1046. \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1)
  1047. \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1)
  1048. \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1)
  1049. \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1)
  1050. \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1)
  1051. \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
  1052. Double_Above} (5)
  1053. \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
  1054. Double_Below} (4)
  1055. \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) (5)
  1056. \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) (4)
  1057. \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
  1058. (1)
  1059. \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
  1060. Iota_Subscript} (1)
  1061. \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) (2)
  1062. \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
  1063. Kana_Voicing} (2)
  1064. \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
  1065. Left} (2)
  1066. \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2)
  1067. \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
  1068. Nukta} (20)
  1069. \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
  1070. (1_113_361 plus all above-Unicode code
  1071. points)
  1072. \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
  1073. Not_Reordered} (1_113_361 plus all
  1074. above-Unicode code points)
  1075. \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (20)
  1076. \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
  1077. Overlay} (32)
  1078. \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32)
  1079. \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
  1080. Right} (1)
  1081. \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1)
  1082. \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (45)
  1083. \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
  1084. Virama} (45)
  1085. \p{Cans} \p{Canadian_Aboriginal} (= \p{Script=
  1086. Canadian_Aboriginal}) (710)
  1087. \p{Cari} \p{Carian} (= \p{Script=Carian}) (NOT
  1088. \p{Block=Carian}) (49)
  1089. \p{Carian} \p{Script=Carian} (Short: \p{Cari}; NOT
  1090. \p{Block=Carian}) (49)
  1091. \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (2115)
  1092. \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_111_997 plus
  1093. all above-Unicode code points)
  1094. \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (2115)
  1095. \p{Cased} \p{Cased=Y} (3955)
  1096. \p{Cased: N*} (Single: \P{Cased}) (1_110_157 plus all
  1097. above-Unicode code points)
  1098. \p{Cased: Y*} (Single: \p{Cased}) (3955)
  1099. \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
  1100. \p{LC}) (3646)
  1101. \p{Category: *} \p{General_Category: *}
  1102. \p{Caucasian_Albanian} \p{Script=Caucasian_Albanian} (Short:
  1103. \p{Aghb}; NOT \p{Block=
  1104. Caucasian_Albanian}) (53)
  1105. \p{Cc} \p{XPosixCntrl} (= \p{General_Category=
  1106. Control}) (65)
  1107. \p{Ccc: *} \p{Canonical_Combining_Class: *}
  1108. \p{CE} \p{Composition_Exclusion} (=
  1109. \p{Composition_Exclusion=Y}) (81)
  1110. \p{CE: *} \p{Composition_Exclusion: *}
  1111. \p{Cf} \p{Format} (= \p{General_Category=Format})
  1112. (150)
  1113. \p{Chakma} \p{Script=Chakma} (Short: \p{Cakm}; NOT
  1114. \p{Block=Chakma}) (67)
  1115. \p{Cham} \p{Script=Cham} (NOT \p{Block=Cham}) (83)
  1116. \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
  1117. \p{CWCF}) (1297)
  1118. \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
  1119. (1_112_815 plus all above-Unicode code
  1120. points)
  1121. \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
  1122. (1297)
  1123. \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
  1124. \p{CWCM}) (2518)
  1125. \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
  1126. (1_111_594 plus all above-Unicode code
  1127. points)
  1128. \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
  1129. (2518)
  1130. \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
  1131. \p{CWL}) (1233)
  1132. \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
  1133. (1_112_879 plus all above-Unicode code
  1134. points)
  1135. \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1233)
  1136. \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
  1137. Y} (Short: \p{CWKCF}) (10_146)
  1138. \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
  1139. \P{CWKCF}) (1_103_966 plus all above-
  1140. Unicode code points)
  1141. \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
  1142. \p{CWKCF}) (10_146)
  1143. \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
  1144. \p{CWT}) (1289)
  1145. \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
  1146. (1_112_823 plus all above-Unicode code
  1147. points)
  1148. \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1289)
  1149. \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
  1150. \p{CWU}) (1316)
  1151. \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
  1152. (1_112_796 plus all above-Unicode code
  1153. points)
  1154. \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1316)
  1155. \p{Cher} \p{Cherokee} (= \p{Script=Cherokee}) (NOT
  1156. \p{Block=Cherokee}) (172)
  1157. \p{Cherokee} \p{Script=Cherokee} (Short: \p{Cher}; NOT
  1158. \p{Block=Cherokee}) (172)
  1159. X \p{Cherokee_Sup} \p{Cherokee_Supplement} (= \p{Block=
  1160. Cherokee_Supplement}) (80)
  1161. X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
  1162. \p{InCherokeeSup}) (80)
  1163. \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
  1164. Y}) (2115)
  1165. \p{CI: *} \p{Case_Ignorable: *}
  1166. X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
  1167. CJK_Unified_Ideographs}) (20_992)
  1168. X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
  1169. CJK_Compatibility}) (256)
  1170. X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
  1171. CJK_Compatibility_Forms}) (32)
  1172. X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
  1173. \p{Block=CJK_Compatibility_Ideographs})
  1174. (512)
  1175. X \p{CJK_Compat_Ideographs_Sup}
  1176. \p{CJK_Compatibility_Ideographs_-
  1177. Supplement} (= \p{Block=
  1178. CJK_Compatibility_Ideographs_-
  1179. Supplement}) (544)
  1180. X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
  1181. \p{InCJKCompat}) (256)
  1182. X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
  1183. (Short: \p{InCJKCompatForms}) (32)
  1184. X \p{CJK_Compatibility_Ideographs} \p{Block=
  1185. CJK_Compatibility_Ideographs} (Short:
  1186. \p{InCJKCompatIdeographs}) (512)
  1187. X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
  1188. CJK_Compatibility_Ideographs_Supplement}
  1189. (Short: \p{InCJKCompatIdeographsSup})
  1190. (544)
  1191. X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
  1192. \p{Block=
  1193. CJK_Unified_Ideographs_Extension_A})
  1194. (6592)
  1195. X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
  1196. \p{Block=
  1197. CJK_Unified_Ideographs_Extension_B})
  1198. (42_720)
  1199. X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
  1200. \p{Block=
  1201. CJK_Unified_Ideographs_Extension_C})
  1202. (4160)
  1203. X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
  1204. \p{Block=
  1205. CJK_Unified_Ideographs_Extension_D})
  1206. (224)
  1207. X \p{CJK_Ext_E} \p{CJK_Unified_Ideographs_Extension_E} (=
  1208. \p{Block=
  1209. CJK_Unified_Ideographs_Extension_E})
  1210. (5776)
  1211. X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
  1212. CJK_Radicals_Supplement}) (128)
  1213. X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
  1214. (Short: \p{InCJKRadicalsSup}) (128)
  1215. X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
  1216. X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
  1217. \p{Block=CJK_Symbols_And_Punctuation})
  1218. (64)
  1219. X \p{CJK_Symbols_And_Punctuation} \p{Block=
  1220. CJK_Symbols_And_Punctuation} (Short:
  1221. \p{InCJKSymbols}) (64)
  1222. X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
  1223. (Short: \p{InCJK}) (20_992)
  1224. X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
  1225. CJK_Unified_Ideographs_Extension_A}
  1226. (Short: \p{InCJKExtA}) (6592)
  1227. X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
  1228. CJK_Unified_Ideographs_Extension_B}
  1229. (Short: \p{InCJKExtB}) (42_720)
  1230. X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
  1231. CJK_Unified_Ideographs_Extension_C}
  1232. (Short: \p{InCJKExtC}) (4160)
  1233. X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
  1234. CJK_Unified_Ideographs_Extension_D}
  1235. (Short: \p{InCJKExtD}) (224)
  1236. X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
  1237. CJK_Unified_Ideographs_Extension_E}
  1238. (Short: \p{InCJKExtE}) (5776)
  1239. \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
  1240. (Short: \p{Pe}) (73)
  1241. \p{Cn} \p{Unassigned} (= \p{General_Category=
  1242. Unassigned}) (853_859 plus all above-
  1243. Unicode code points)
  1244. \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category=
  1245. Control}) (65)
  1246. \p{Co} \p{Private_Use} (= \p{General_Category=
  1247. Private_Use}) (NOT \p{Private_Use_Area})
  1248. (137_468)
  1249. X \p{Combining_Diacritical_Marks} \p{Block=
  1250. Combining_Diacritical_Marks} (Short:
  1251. \p{InDiacriticals}) (112)
  1252. X \p{Combining_Diacritical_Marks_Extended} \p{Block=
  1253. Combining_Diacritical_Marks_Extended}
  1254. (Short: \p{InDiacriticalsExt}) (80)
  1255. X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
  1256. Combining_Diacritical_Marks_For_Symbols}
  1257. (Short: \p{InDiacriticalsForSymbols})
  1258. (48)
  1259. X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
  1260. Combining_Diacritical_Marks_Supplement}
  1261. (Short: \p{InDiacriticalsSup}) (64)
  1262. X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
  1263. \p{InHalfMarks}) (16)
  1264. \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
  1265. (1963)
  1266. X \p{Combining_Marks_For_Symbols}
  1267. \p{Combining_Diacritical_Marks_For_-
  1268. Symbols} (= \p{Block=
  1269. Combining_Diacritical_Marks_For_-
  1270. Symbols}) (48)
  1271. \p{Common} \p{Script=Common} (Short: \p{Zyyy}) (7179)
  1272. X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  1273. (Short: \p{InIndicNumberForms}) (16)
  1274. \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
  1275. \p{Full_Composition_Exclusion=Y}) (1120)
  1276. \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
  1277. X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
  1278. Hangul_Compatibility_Jamo}) (96)
  1279. \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
  1280. \p{CE}) (81)
  1281. \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
  1282. plus all above-Unicode code points)
  1283. \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81)
  1284. \p{Connector_Punctuation} \p{General_Category=
  1285. Connector_Punctuation} (Short: \p{Pc})
  1286. (10)
  1287. \p{Control} \p{XPosixCntrl} (= \p{General_Category=
  1288. Control}) (65)
  1289. X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
  1290. \p{Copt} \p{Coptic} (= \p{Script=Coptic}) (NOT
  1291. \p{Block=Coptic}) (137)
  1292. \p{Coptic} \p{Script=Coptic} (Short: \p{Copt}; NOT
  1293. \p{Block=Coptic}) (137)
  1294. X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
  1295. X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
  1296. Counting_Rod_Numerals}) (32)
  1297. X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
  1298. \p{InCountingRod}) (32)
  1299. \p{Cprt} \p{Cypriot} (= \p{Script=Cypriot}) (55)
  1300. \p{Cs} \p{Surrogate} (= \p{General_Category=
  1301. Surrogate}) (2048)
  1302. \p{Cuneiform} \p{Script=Cuneiform} (Short: \p{Xsux}; NOT
  1303. \p{Block=Cuneiform}) (1234)
  1304. X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
  1305. \p{Block=
  1306. Cuneiform_Numbers_And_Punctuation}) (128)
  1307. X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
  1308. Cuneiform_Numbers_And_Punctuation}
  1309. (Short: \p{InCuneiformNumbers}) (128)
  1310. \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
  1311. (Short: \p{Sc}) (53)
  1312. X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
  1313. \p{CWCF} \p{Changes_When_Casefolded} (=
  1314. \p{Changes_When_Casefolded=Y}) (1297)
  1315. \p{CWCF: *} \p{Changes_When_Casefolded: *}
  1316. \p{CWCM} \p{Changes_When_Casemapped} (=
  1317. \p{Changes_When_Casemapped=Y}) (2518)
  1318. \p{CWCM: *} \p{Changes_When_Casemapped: *}
  1319. \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
  1320. \p{Changes_When_NFKC_Casefolded=Y})
  1321. (10_146)
  1322. \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
  1323. \p{CWL} \p{Changes_When_Lowercased} (=
  1324. \p{Changes_When_Lowercased=Y}) (1233)
  1325. \p{CWL: *} \p{Changes_When_Lowercased: *}
  1326. \p{CWT} \p{Changes_When_Titlecased} (=
  1327. \p{Changes_When_Titlecased=Y}) (1289)
  1328. \p{CWT: *} \p{Changes_When_Titlecased: *}
  1329. \p{CWU} \p{Changes_When_Uppercased} (=
  1330. \p{Changes_When_Uppercased=Y}) (1316)
  1331. \p{CWU: *} \p{Changes_When_Uppercased: *}
  1332. \p{Cypriot} \p{Script=Cypriot} (Short: \p{Cprt}) (55)
  1333. X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
  1334. \p{Cyrillic} \p{Script=Cyrillic} (Short: \p{Cyrl}; NOT
  1335. \p{Block=Cyrillic}) (434)
  1336. X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
  1337. Cyrillic_Extended_A}) (32)
  1338. X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
  1339. Cyrillic_Extended_B}) (96)
  1340. X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
  1341. \p{InCyrillicExtA}) (32)
  1342. X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
  1343. \p{InCyrillicExtB}) (96)
  1344. X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
  1345. Cyrillic_Supplement}) (48)
  1346. X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
  1347. \p{InCyrillicSup}) (48)
  1348. X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
  1349. Cyrillic_Supplement}) (48)
  1350. \p{Cyrl} \p{Cyrillic} (= \p{Script=Cyrillic}) (NOT
  1351. \p{Block=Cyrillic}) (434)
  1352. \p{Dash} \p{Dash=Y} (28)
  1353. \p{Dash: N*} (Single: \P{Dash}) (1_114_084 plus all
  1354. above-Unicode code points)
  1355. \p{Dash: Y*} (Single: \p{Dash}) (28)
  1356. \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
  1357. (Short: \p{Pd}) (24)
  1358. \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category=
  1359. Decimal_Number}) (550)
  1360. \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
  1361. (13_232)
  1362. \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232)
  1363. \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240)
  1364. \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
  1365. \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720)
  1366. \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
  1367. \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
  1368. \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240)
  1369. \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184)
  1370. \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
  1371. \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20)
  1372. \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
  1373. \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171)
  1374. \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
  1375. \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238)
  1376. \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
  1377. \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82)
  1378. \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
  1379. \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122)
  1380. \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
  1381. \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5)
  1382. \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
  1383. Non_Canonical} (Perl extension) (3661)
  1384. \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
  1385. decompositions (Short: \p{Dt=NonCanon})
  1386. (Perl extension) (3661)
  1387. \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_219 plus
  1388. all above-Unicode code points)
  1389. \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26)
  1390. \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
  1391. \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (284)
  1392. \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (284)
  1393. \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38)
  1394. \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (152)
  1395. \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (152)
  1396. \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
  1397. \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35)
  1398. \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104)
  1399. \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
  1400. Y} (Short: \p{DI}) (4173)
  1401. \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
  1402. (1_109_939 plus all above-Unicode code
  1403. points)
  1404. \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
  1405. (4173)
  1406. \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (16)
  1407. \p{Dep: *} \p{Deprecated: *}
  1408. \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (16)
  1409. \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_096
  1410. plus all above-Unicode code points)
  1411. \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (16)
  1412. \p{Deseret} \p{Script=Deseret} (Short: \p{Dsrt}) (80)
  1413. \p{Deva} \p{Devanagari} (= \p{Script=Devanagari})
  1414. (NOT \p{Block=Devanagari}) (154)
  1415. \p{Devanagari} \p{Script=Devanagari} (Short: \p{Deva};
  1416. NOT \p{Block=Devanagari}) (154)
  1417. X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
  1418. Devanagari_Extended}) (32)
  1419. X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
  1420. \p{InDevanagariExt}) (32)
  1421. \p{DI} \p{Default_Ignorable_Code_Point} (=
  1422. \p{Default_Ignorable_Code_Point=Y})
  1423. (4173)
  1424. \p{DI: *} \p{Default_Ignorable_Code_Point: *}
  1425. \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (773)
  1426. \p{Dia: *} \p{Diacritic: *}
  1427. \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (773)
  1428. \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_339
  1429. plus all above-Unicode code points)
  1430. \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (773)
  1431. X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
  1432. \p{Block=Combining_Diacritical_Marks})
  1433. (112)
  1434. X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended}
  1435. (= \p{Block=
  1436. Combining_Diacritical_Marks_Extended})
  1437. (80)
  1438. X \p{Diacriticals_For_Symbols}
  1439. \p{Combining_Diacritical_Marks_For_-
  1440. Symbols} (= \p{Block=
  1441. Combining_Diacritical_Marks_For_-
  1442. Symbols}) (48)
  1443. X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
  1444. (= \p{Block=
  1445. Combining_Diacritical_Marks_Supplement})
  1446. (64)
  1447. \p{Digit} \p{XPosixDigit} (= \p{General_Category=
  1448. Decimal_Number}) (550)
  1449. X \p{Dingbats} \p{Block=Dingbats} (192)
  1450. X \p{Domino} \p{Domino_Tiles} (= \p{Block=
  1451. Domino_Tiles}) (112)
  1452. X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
  1453. \p{InDomino}) (112)
  1454. \p{Dsrt} \p{Deseret} (= \p{Script=Deseret}) (80)
  1455. \p{Dt: *} \p{Decomposition_Type: *}
  1456. \p{Dupl} \p{Duployan} (= \p{Script=Duployan}) (NOT
  1457. \p{Block=Duployan}) (143)
  1458. \p{Duployan} \p{Script=Duployan} (Short: \p{Dupl}; NOT
  1459. \p{Block=Duployan}) (143)
  1460. \p{Ea: *} \p{East_Asian_Width: *}
  1461. X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
  1462. (208)
  1463. \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_746)
  1464. \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_746)
  1465. \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
  1466. \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104)
  1467. \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
  1468. \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123)
  1469. \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (801_894 plus
  1470. all above-Unicode code points)
  1471. \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
  1472. \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111)
  1473. \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (801_894 plus all
  1474. above-Unicode code points)
  1475. \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (173_134)
  1476. \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (173_134)
  1477. \p{Egyp} \p{Egyptian_Hieroglyphs} (= \p{Script=
  1478. Egyptian_Hieroglyphs}) (NOT \p{Block=
  1479. Egyptian_Hieroglyphs}) (1071)
  1480. \p{Egyptian_Hieroglyphs} \p{Script=Egyptian_Hieroglyphs} (Short:
  1481. \p{Egyp}; NOT \p{Block=
  1482. Egyptian_Hieroglyphs}) (1071)
  1483. \p{Elba} \p{Elbasan} (= \p{Script=Elbasan}) (NOT
  1484. \p{Block=Elbasan}) (40)
  1485. \p{Elbasan} \p{Script=Elbasan} (Short: \p{Elba}; NOT
  1486. \p{Block=Elbasan}) (40)
  1487. X \p{Emoticons} \p{Block=Emoticons} (80)
  1488. X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
  1489. Enclosed_Alphanumerics}) (160)
  1490. X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
  1491. \p{Block=
  1492. Enclosed_Alphanumeric_Supplement}) (256)
  1493. X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
  1494. Enclosed_Alphanumeric_Supplement}
  1495. (Short: \p{InEnclosedAlphanumSup}) (256)
  1496. X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
  1497. (Short: \p{InEnclosedAlphanum}) (160)
  1498. X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
  1499. \p{Block=
  1500. Enclosed_CJK_Letters_And_Months}) (256)
  1501. X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
  1502. Enclosed_CJK_Letters_And_Months} (Short:
  1503. \p{InEnclosedCJK}) (256)
  1504. X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
  1505. (= \p{Block=
  1506. Enclosed_Ideographic_Supplement}) (256)
  1507. X \p{Enclosed_Ideographic_Supplement} \p{Block=
  1508. Enclosed_Ideographic_Supplement} (Short:
  1509. \p{InEnclosedIdeographicSup}) (256)
  1510. \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
  1511. (Short: \p{Me}) (13)
  1512. \p{Ethi} \p{Ethiopic} (= \p{Script=Ethiopic}) (NOT
  1513. \p{Block=Ethiopic}) (495)
  1514. \p{Ethiopic} \p{Script=Ethiopic} (Short: \p{Ethi}; NOT
  1515. \p{Block=Ethiopic}) (495)
  1516. X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
  1517. Ethiopic_Extended}) (96)
  1518. X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
  1519. Ethiopic_Extended_A}) (48)
  1520. X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
  1521. \p{InEthiopicExt}) (96)
  1522. X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
  1523. \p{InEthiopicExtA}) (48)
  1524. X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
  1525. Ethiopic_Supplement}) (32)
  1526. X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
  1527. \p{InEthiopicSup}) (32)
  1528. \p{Ext} \p{Extender} (= \p{Extender=Y}) (38)
  1529. \p{Ext: *} \p{Extender: *}
  1530. \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (38)
  1531. \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_074
  1532. plus all above-Unicode code points)
  1533. \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (38)
  1534. \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
  1535. (Short: \p{Pf}) (10)
  1536. \p{Format} \p{General_Category=Format} (Short:
  1537. \p{Cf}) (150)
  1538. \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
  1539. (Short: \p{CompEx}) (1120)
  1540. \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
  1541. \P{CompEx}) (1_112_992 plus all above-
  1542. Unicode code points)
  1543. \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
  1544. \p{CompEx}) (1120)
  1545. \p{Gc: *} \p{General_Category: *}
  1546. \p{GCB: *} \p{Grapheme_Cluster_Break: *}
  1547. \p{General_Category: C} \p{General_Category=Other} (993_590 plus
  1548. all above-Unicode code points)
  1549. \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
  1550. \p{Gc=LC}, \p{LC}) (3646)
  1551. \p{General_Category: Cc} \p{General_Category=Control} (65)
  1552. \p{General_Category: Cf} \p{General_Category=Format} (150)
  1553. \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
  1554. (73)
  1555. \p{General_Category: Cn} \p{General_Category=Unassigned} (853_859
  1556. plus all above-Unicode code points)
  1557. \p{General_Category: Cntrl} \p{General_Category=Control} (65)
  1558. \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
  1559. \p{General_Category: Combining_Mark} \p{General_Category=Mark}
  1560. (1963)
  1561. \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
  1562. \p{Pc}) (10)
  1563. \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65)
  1564. \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
  1565. \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
  1566. (53)
  1567. \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
  1568. (24)
  1569. \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
  1570. (550)
  1571. \p{General_Category: Digit} \p{General_Category=Decimal_Number}
  1572. (550)
  1573. \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
  1574. (13)
  1575. \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
  1576. (10)
  1577. \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (150)
  1578. \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
  1579. \p{Pi}) (12)
  1580. \p{General_Category: L} \p{General_Category=Letter} (109_591)
  1581. X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3646)
  1582. X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
  1583. the trailing '_' matters in spite of
  1584. loose matching rules. (3646)
  1585. \p{General_Category: LC} \p{General_Category=Cased_Letter} (3646)
  1586. \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (109_591)
  1587. \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
  1588. (236)
  1589. \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) (1)
  1590. \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
  1591. (/i= General_Category=Cased_Letter)
  1592. (1984)
  1593. \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (248)
  1594. \p{General_Category: Lo} \p{General_Category=Other_Letter}
  1595. (105_697)
  1596. \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
  1597. /i= General_Category=Cased_Letter) (1984)
  1598. \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
  1599. (/i= General_Category=Cased_Letter) (31)
  1600. \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
  1601. (/i= General_Category=Cased_Letter)
  1602. (1631)
  1603. \p{General_Category: M} \p{General_Category=Mark} (1963)
  1604. \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (1963)
  1605. \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948)
  1606. \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (383)
  1607. \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
  1608. \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
  1609. (1567)
  1610. \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
  1611. (248)
  1612. \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
  1613. (121)
  1614. \p{General_Category: N} \p{General_Category=Number} (1433)
  1615. \p{General_Category: Nd} \p{General_Category=Decimal_Number} (550)
  1616. \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
  1617. \p{General_Category: No} \p{General_Category=Other_Number} (647)
  1618. \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
  1619. (1567)
  1620. \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1433)
  1621. \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
  1622. (75)
  1623. \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (993_590 plus
  1624. all above-Unicode code points)
  1625. \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
  1626. (105_697)
  1627. \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) (647)
  1628. \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
  1629. (513)
  1630. \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
  1631. (5677)
  1632. \p{General_Category: P} \p{General_Category=Punctuation} (717)
  1633. \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
  1634. \p{Zp}) (1)
  1635. \p{General_Category: Pc} \p{General_Category=
  1636. Connector_Punctuation} (10)
  1637. \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24)
  1638. \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
  1639. (73)
  1640. \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
  1641. (10)
  1642. \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
  1643. (12)
  1644. \p{General_Category: Po} \p{General_Category=Other_Punctuation}
  1645. (513)
  1646. \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
  1647. (137_468)
  1648. \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
  1649. \p{General_Category: Punct} \p{General_Category=Punctuation} (717)
  1650. \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (717)
  1651. \p{General_Category: S} \p{General_Category=Symbol} (6799)
  1652. \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (53)
  1653. \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19)
  1654. \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (121)
  1655. \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
  1656. \p{General_Category: So} \p{General_Category=Other_Symbol} (5677)
  1657. \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
  1658. (17)
  1659. \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) (383)
  1660. \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048)
  1661. \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (6799)
  1662. \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
  1663. /i= General_Category=Cased_Letter) (31)
  1664. \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
  1665. (853_859 plus all above-Unicode code
  1666. points)
  1667. \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
  1668. /i= General_Category=Cased_Letter) (1631)
  1669. \p{General_Category: Z} \p{General_Category=Separator} (19)
  1670. \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
  1671. \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
  1672. (1)
  1673. \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
  1674. X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
  1675. \p{InPunctuation}) (112)
  1676. X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
  1677. X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
  1678. \p{Block=Geometric_Shapes_Extended})
  1679. (128)
  1680. X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
  1681. (Short: \p{InGeometricShapesExt}) (128)
  1682. \p{Geor} \p{Georgian} (= \p{Script=Georgian}) (NOT
  1683. \p{Block=Georgian}) (127)
  1684. \p{Georgian} \p{Script=Georgian} (Short: \p{Geor}; NOT
  1685. \p{Block=Georgian}) (127)
  1686. X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
  1687. Georgian_Supplement}) (48)
  1688. X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
  1689. \p{InGeorgianSup}) (48)
  1690. \p{Glag} \p{Glagolitic} (= \p{Script=Glagolitic})
  1691. (NOT \p{Block=Glagolitic}) (94)
  1692. \p{Glagolitic} \p{Script=Glagolitic} (Short: \p{Glag};
  1693. NOT \p{Block=Glagolitic}) (94)
  1694. \p{Goth} \p{Gothic} (= \p{Script=Gothic}) (NOT
  1695. \p{Block=Gothic}) (27)
  1696. \p{Gothic} \p{Script=Gothic} (Short: \p{Goth}; NOT
  1697. \p{Block=Gothic}) (27)
  1698. \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
  1699. (118_912)
  1700. \p{Gr_Base: *} \p{Grapheme_Base: *}
  1701. \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
  1702. Y}) (1610)
  1703. \p{Gr_Ext: *} \p{Grapheme_Extend: *}
  1704. \p{Gran} \p{Grantha} (= \p{Script=Grantha}) (NOT
  1705. \p{Block=Grantha}) (85)
  1706. \p{Grantha} \p{Script=Grantha} (Short: \p{Gran}; NOT
  1707. \p{Block=Grantha}) (85)
  1708. \p{Graph} \p{XPosixGraph} (258_121)
  1709. \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
  1710. (118_912)
  1711. \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase}) (995_200
  1712. plus all above-Unicode code points)
  1713. \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase}) (118_912)
  1714. \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
  1715. (6030)
  1716. \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (6030)
  1717. \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1)
  1718. \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
  1719. (1610)
  1720. \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1610)
  1721. \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125)
  1722. \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1)
  1723. \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399)
  1724. \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773)
  1725. \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_094_585
  1726. plus all above-Unicode code points)
  1727. \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
  1728. (0)
  1729. \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (0)
  1730. \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
  1731. (26)
  1732. \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
  1733. Regional_Indicator} (26)
  1734. \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
  1735. SpacingMark} (330)
  1736. \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (330)
  1737. \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137)
  1738. \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95)
  1739. \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
  1740. (1_094_585 plus all above-Unicode code
  1741. points)
  1742. \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
  1743. (1610)
  1744. \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_502
  1745. plus all above-Unicode code points)
  1746. \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1610)
  1747. \p{Greek} \p{Script=Greek} (Short: \p{Grek}; NOT
  1748. \p{Greek_And_Coptic}) (516)
  1749. X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
  1750. \p{InGreek}) (144)
  1751. X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
  1752. Greek_Extended}) (256)
  1753. X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
  1754. \p{InGreekExt}) (256)
  1755. \p{Grek} \p{Greek} (= \p{Script=Greek}) (NOT
  1756. \p{Greek_And_Coptic}) (516)
  1757. \p{Gujarati} \p{Script=Gujarati} (Short: \p{Gujr}; NOT
  1758. \p{Block=Gujarati}) (85)
  1759. \p{Gujr} \p{Gujarati} (= \p{Script=Gujarati}) (NOT
  1760. \p{Block=Gujarati}) (85)
  1761. \p{Gurmukhi} \p{Script=Gurmukhi} (Short: \p{Guru}; NOT
  1762. \p{Block=Gurmukhi}) (79)
  1763. \p{Guru} \p{Gurmukhi} (= \p{Script=Gurmukhi}) (NOT
  1764. \p{Block=Gurmukhi}) (79)
  1765. X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
  1766. \p{Block=Halfwidth_And_Fullwidth_Forms})
  1767. (240)
  1768. X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
  1769. Combining_Half_Marks}) (16)
  1770. X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
  1771. Halfwidth_And_Fullwidth_Forms} (Short:
  1772. \p{InHalfAndFullForms}) (240)
  1773. \p{Han} \p{Script=Han} (81_734)
  1774. \p{Hang} \p{Hangul} (= \p{Script=Hangul}) (NOT
  1775. \p{Hangul_Syllables}) (11_739)
  1776. \p{Hangul} \p{Script=Hangul} (Short: \p{Hang}; NOT
  1777. \p{Hangul_Syllables}) (11_739)
  1778. X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
  1779. (Short: \p{InCompatJamo}) (96)
  1780. X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
  1781. (256)
  1782. X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
  1783. (Short: \p{InJamoExtA}) (32)
  1784. X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
  1785. (Short: \p{InJamoExtB}) (80)
  1786. \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
  1787. (125)
  1788. \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125)
  1789. \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
  1790. (399)
  1791. \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399)
  1792. \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
  1793. LVT_Syllable} (10_773)
  1794. \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
  1795. (10_773)
  1796. \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
  1797. Not_Applicable} (1_102_583 plus all
  1798. above-Unicode code points)
  1799. \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
  1800. (1_102_583 plus all above-Unicode code
  1801. points)
  1802. \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
  1803. (137)
  1804. \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137)
  1805. \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
  1806. (95)
  1807. \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95)
  1808. X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
  1809. \p{InHangul}) (11_184)
  1810. \p{Hani} \p{Han} (= \p{Script=Han}) (81_734)
  1811. \p{Hano} \p{Hanunoo} (= \p{Script=Hanunoo}) (NOT
  1812. \p{Block=Hanunoo}) (21)
  1813. \p{Hanunoo} \p{Script=Hanunoo} (Short: \p{Hano}; NOT
  1814. \p{Block=Hanunoo}) (21)
  1815. \p{Hatr} \p{Hatran} (= \p{Script=Hatran}) (NOT
  1816. \p{Block=Hatran}) (26)
  1817. \p{Hatran} \p{Script=Hatran} (Short: \p{Hatr}; NOT
  1818. \p{Block=Hatran}) (26)
  1819. \p{Hebr} \p{Hebrew} (= \p{Script=Hebrew}) (NOT
  1820. \p{Block=Hebrew}) (133)
  1821. \p{Hebrew} \p{Script=Hebrew} (Short: \p{Hebr}; NOT
  1822. \p{Block=Hebrew}) (133)
  1823. \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  1824. \p{Hex: *} \p{Hex_Digit: *}
  1825. \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  1826. \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068
  1827. plus all above-Unicode code points)
  1828. \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44)
  1829. X \p{High_Private_Use_Surrogates} \p{Block=
  1830. High_Private_Use_Surrogates} (Short:
  1831. \p{InHighPUSurrogates}) (128)
  1832. X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
  1833. \p{Block=High_Private_Use_Surrogates})
  1834. (128)
  1835. X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
  1836. \p{Hira} \p{Hiragana} (= \p{Script=Hiragana}) (NOT
  1837. \p{Block=Hiragana}) (91)
  1838. \p{Hiragana} \p{Script=Hiragana} (Short: \p{Hira}; NOT
  1839. \p{Block=Hiragana}) (91)
  1840. \p{Hluw} \p{Anatolian_Hieroglyphs} (= \p{Script=
  1841. Anatolian_Hieroglyphs}) (NOT \p{Block=
  1842. Anatolian_Hieroglyphs}) (583)
  1843. \p{Hmng} \p{Pahawh_Hmong} (= \p{Script=
  1844. Pahawh_Hmong}) (NOT \p{Block=
  1845. Pahawh_Hmong}) (127)
  1846. \p{HorizSpace} \p{XPosixBlank} (18)
  1847. \p{Hst: *} \p{Hangul_Syllable_Type: *}
  1848. \p{Hung} \p{Old_Hungarian} (= \p{Script=
  1849. Old_Hungarian}) (NOT \p{Block=
  1850. Old_Hungarian}) (108)
  1851. D \p{Hyphen} \p{Hyphen=Y} (11)
  1852. D \p{Hyphen: N*} Supplanted by Line_Break property values;
  1853. see www.unicode.org/reports/tr14
  1854. (Single: \P{Hyphen}) (1_114_101 plus all
  1855. above-Unicode code points)
  1856. D \p{Hyphen: Y*} Supplanted by Line_Break property values;
  1857. see www.unicode.org/reports/tr14
  1858. (Single: \p{Hyphen}) (11)
  1859. \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
  1860. \p{Ideographic_Description_Characters})
  1861. (112_352)
  1862. \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (1_001_760
  1863. plus all above-Unicode code points)
  1864. \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (112_352)
  1865. \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (109_830)
  1866. \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (1_004_282
  1867. plus all above-Unicode code points)
  1868. \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (109_830)
  1869. \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
  1870. \p{Ideographic_Description_Characters})
  1871. (112_352)
  1872. \p{IDC: *} \p{ID_Continue: *}
  1873. \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
  1874. (81_404)
  1875. \p{Ideo: *} \p{Ideographic: *}
  1876. \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
  1877. (81_404)
  1878. \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_032_708
  1879. plus all above-Unicode code points)
  1880. \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (81_404)
  1881. X \p{Ideographic_Description_Characters} \p{Block=
  1882. Ideographic_Description_Characters}
  1883. (Short: \p{InIDC}) (16)
  1884. \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (109_830)
  1885. \p{IDS: *} \p{ID_Start: *}
  1886. \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
  1887. \p{IDSB}) (10)
  1888. \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
  1889. (1_114_102 plus all above-Unicode code
  1890. points)
  1891. \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10)
  1892. \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
  1893. \p{IDST}) (2)
  1894. \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
  1895. (1_114_110 plus all above-Unicode code
  1896. points)
  1897. \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2)
  1898. \p{IDSB} \p{IDS_Binary_Operator} (=
  1899. \p{IDS_Binary_Operator=Y}) (10)
  1900. \p{IDSB: *} \p{IDS_Binary_Operator: *}
  1901. \p{IDST} \p{IDS_Trinary_Operator} (=
  1902. \p{IDS_Trinary_Operator=Y}) (2)
  1903. \p{IDST: *} \p{IDS_Trinary_Operator: *}
  1904. \p{Imperial_Aramaic} \p{Script=Imperial_Aramaic} (Short:
  1905. \p{Armi}; NOT \p{Block=
  1906. Imperial_Aramaic}) (31)
  1907. \p{In: *} \p{Present_In: *} (Perl extension)
  1908. X \p{In_*} \p{Block: *}
  1909. X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
  1910. Common_Indic_Number_Forms}) (16)
  1911. \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
  1912. (256)
  1913. \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
  1914. BottomAndRight}) (2)
  1915. \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (55)
  1916. \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
  1917. LeftAndRight}) (21)
  1918. \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_113_142
  1919. plus all above-Unicode code points)
  1920. \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
  1921. Overstruck}) (10)
  1922. \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (249)
  1923. \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (324)
  1924. \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
  1925. TopAndBottom}) (10)
  1926. \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
  1927. \p{InPC=TopAndBottomAndRight}) (1)
  1928. \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
  1929. TopAndLeft}) (6)
  1930. \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
  1931. \p{InPC=TopAndLeftAndRight}) (4)
  1932. \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
  1933. TopAndRight}) (13)
  1934. \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
  1935. VisualOrderLeft}) (19)
  1936. \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
  1937. (13)
  1938. \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (60)
  1939. \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
  1940. \p{InSC=BrahmiJoiningNumber}) (20)
  1941. \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
  1942. CantillationMark}) (52)
  1943. \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
  1944. (1805)
  1945. \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
  1946. ConsonantDead}) (7)
  1947. \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
  1948. ConsonantFinal}) (62)
  1949. \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
  1950. \p{InSC=ConsonantHeadLetter}) (5)
  1951. \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
  1952. ConsonantKiller}) (2)
  1953. \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
  1954. ConsonantMedial}) (22)
  1955. \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
  1956. \p{InSC=ConsonantPlaceholder}) (13)
  1957. \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
  1958. \p{InSC=ConsonantPrecedingRepha}) (1)
  1959. \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
  1960. ConsonantPrefixed}) (2)
  1961. \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
  1962. ConsonantSubjoined}) (61)
  1963. \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
  1964. \p{InSC=ConsonantSucceedingRepha}) (4)
  1965. \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
  1966. \p{InSC=ConsonantWithStacker}) (4)
  1967. \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
  1968. GeminationMark}) (2)
  1969. \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
  1970. InvisibleStacker}) (7)
  1971. \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1)
  1972. \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
  1973. ModifyingLetter}) (1)
  1974. \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
  1975. NonJoiner}) (1)
  1976. \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (23)
  1977. \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (420)
  1978. \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
  1979. NumberJoiner}) (1)
  1980. \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
  1981. (1_110_379 plus all above-Unicode code
  1982. points)
  1983. \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
  1984. PureKiller}) (16)
  1985. \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
  1986. RegisterShifter}) (2)
  1987. \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
  1988. SyllableModifier}) (20)
  1989. \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
  1990. ToneLetter}) (7)
  1991. \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
  1992. (42)
  1993. \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (22)
  1994. \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga}) (29)
  1995. \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30)
  1996. \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
  1997. VowelDependent}) (572)
  1998. \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
  1999. VowelIndependent}) (404)
  2000. \p{Inherited} \p{Script=Inherited} (Short: \p{Zinh})
  2001. (563)
  2002. \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
  2003. (Short: \p{Pi}) (12)
  2004. \p{InPC: *} \p{Indic_Positional_Category: *}
  2005. \p{InSC: *} \p{Indic_Syllabic_Category: *}
  2006. \p{Inscriptional_Pahlavi} \p{Script=Inscriptional_Pahlavi} (Short:
  2007. \p{Phli}; NOT \p{Block=
  2008. Inscriptional_Pahlavi}) (27)
  2009. \p{Inscriptional_Parthian} \p{Script=Inscriptional_Parthian}
  2010. (Short: \p{Prti}; NOT \p{Block=
  2011. Inscriptional_Parthian}) (30)
  2012. X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
  2013. IPA_Extensions}) (96)
  2014. X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
  2015. \p{InIPAExt}) (96)
  2016. \p{Is_*} \p{*} (Any exceptions are individually
  2017. noted beginning with the word NOT.) If
  2018. an entry has flag(s) at its beginning,
  2019. like "D", the "Is_" form has the same
  2020. flag(s)
  2021. \p{Ital} \p{Old_Italic} (= \p{Script=Old_Italic})
  2022. (NOT \p{Block=Old_Italic}) (36)
  2023. X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
  2024. (256)
  2025. X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
  2026. Hangul_Jamo_Extended_A}) (32)
  2027. X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
  2028. Hangul_Jamo_Extended_B}) (80)
  2029. \p{Java} \p{Javanese} (= \p{Script=Javanese}) (NOT
  2030. \p{Block=Javanese}) (90)
  2031. \p{Javanese} \p{Script=Javanese} (Short: \p{Java}; NOT
  2032. \p{Block=Javanese}) (90)
  2033. \p{Jg: *} \p{Joining_Group: *}
  2034. \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
  2035. \p{Join_C: *} \p{Join_Control: *}
  2036. \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
  2037. \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
  2038. plus all above-Unicode code points)
  2039. \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2)
  2040. \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (8)
  2041. \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1)
  2042. \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10)
  2043. \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (21)
  2044. \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2)
  2045. \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
  2046. BurushaskiYehBarree}) (2)
  2047. \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15)
  2048. \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4)
  2049. \p{Joining_Group: E} (Short: \p{Jg=E}) (1)
  2050. \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7)
  2051. \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1)
  2052. \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10)
  2053. \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1)
  2054. \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (14)
  2055. \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3)
  2056. \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (18)
  2057. \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
  2058. HamzaOnHehGoal}) (1)
  2059. \p{Joining_Group: He} (Short: \p{Jg=He}) (1)
  2060. \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1)
  2061. \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2)
  2062. \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1)
  2063. \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (6)
  2064. \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1)
  2065. \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1)
  2066. \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2)
  2067. \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (7)
  2068. \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1)
  2069. \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
  2070. ManichaeanAleph}) (1)
  2071. \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
  2072. (2)
  2073. \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
  2074. (2)
  2075. \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
  2076. ManichaeanDaleth}) (1)
  2077. \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
  2078. ManichaeanDhamedh}) (1)
  2079. \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
  2080. (1)
  2081. \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
  2082. ManichaeanGimel}) (2)
  2083. \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
  2084. (1)
  2085. \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
  2086. ManichaeanHundred}) (1)
  2087. \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
  2088. (3)
  2089. \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
  2090. ManichaeanLamedh}) (1)
  2091. \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem}) (1)
  2092. \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun}) (1)
  2093. \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne}) (1)
  2094. \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2)
  2095. \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
  2096. (3)
  2097. \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
  2098. (1)
  2099. \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
  2100. ManichaeanSadhe}) (1)
  2101. \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
  2102. ManichaeanSamekh}) (1)
  2103. \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw}) (1)
  2104. \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen}) (1)
  2105. \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
  2106. (1)
  2107. \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
  2108. ManichaeanThamedh}) (1)
  2109. \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
  2110. ManichaeanTwenty}) (1)
  2111. \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw}) (1)
  2112. \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
  2113. (1)
  2114. \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
  2115. ManichaeanZayin}) (2)
  2116. \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4)
  2117. \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1)
  2118. \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
  2119. (1_113_826 plus all above-Unicode code
  2120. points)
  2121. \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8)
  2122. \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1)
  2123. \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1)
  2124. \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1)
  2125. \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5)
  2126. \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1)
  2127. \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (18)
  2128. \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1)
  2129. \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1)
  2130. \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6)
  2131. \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1)
  2132. \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11)
  2133. \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1)
  2134. \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1)
  2135. \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1)
  2136. \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1)
  2137. \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1)
  2138. \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4)
  2139. \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1)
  2140. \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3)
  2141. \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
  2142. Hamza_On_Heh_Goal} (1)
  2143. \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2)
  2144. \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16)
  2145. \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (10)
  2146. \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2)
  2147. \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1)
  2148. \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1)
  2149. \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1)
  2150. \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1)
  2151. \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1)
  2152. \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (4)
  2153. \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (426)
  2154. \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (426)
  2155. \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4)
  2156. \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (3)
  2157. \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (3)
  2158. \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_852 plus
  2159. all above-Unicode code points)
  2160. \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (111)
  2161. \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (111)
  2162. \p{Joining_Type: T} \p{Joining_Type=Transparent} (1716)
  2163. \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1716)
  2164. \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_111_852
  2165. plus all above-Unicode code points)
  2166. \p{Jt: *} \p{Joining_Type: *}
  2167. \p{Kaithi} \p{Script=Kaithi} (Short: \p{Kthi}; NOT
  2168. \p{Block=Kaithi}) (66)
  2169. \p{Kali} \p{Kayah_Li} (= \p{Script=Kayah_Li}) (NOT
  2170. \p{Block=Kayah_Li}) (47)
  2171. \p{Kana} \p{Katakana} (= \p{Script=Katakana}) (NOT
  2172. \p{Block=Katakana}) (300)
  2173. X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
  2174. Kana_Supplement}) (256)
  2175. X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
  2176. \p{InKanaSup}) (256)
  2177. X \p{Kanbun} \p{Block=Kanbun} (16)
  2178. X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
  2179. Kangxi_Radicals}) (224)
  2180. X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
  2181. \p{InKangxi}) (224)
  2182. \p{Kannada} \p{Script=Kannada} (Short: \p{Knda}; NOT
  2183. \p{Block=Kannada}) (87)
  2184. \p{Katakana} \p{Script=Katakana} (Short: \p{Kana}; NOT
  2185. \p{Block=Katakana}) (300)
  2186. X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
  2187. \p{Block=Katakana_Phonetic_Extensions})
  2188. (16)
  2189. X \p{Katakana_Phonetic_Extensions} \p{Block=
  2190. Katakana_Phonetic_Extensions} (Short:
  2191. \p{InKatakanaExt}) (16)
  2192. \p{Kayah_Li} \p{Script=Kayah_Li} (Short: \p{Kali}; NOT
  2193. \p{Block=Kayah_Li}) (47)
  2194. \p{Khar} \p{Kharoshthi} (= \p{Script=Kharoshthi})
  2195. (NOT \p{Block=Kharoshthi}) (65)
  2196. \p{Kharoshthi} \p{Script=Kharoshthi} (Short: \p{Khar};
  2197. NOT \p{Block=Kharoshthi}) (65)
  2198. \p{Khmer} \p{Script=Khmer} (Short: \p{Khmr}; NOT
  2199. \p{Block=Khmer}) (146)
  2200. X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
  2201. \p{Khmr} \p{Khmer} (= \p{Script=Khmer}) (NOT
  2202. \p{Block=Khmer}) (146)
  2203. \p{Khoj} \p{Khojki} (= \p{Script=Khojki}) (NOT
  2204. \p{Block=Khojki}) (61)
  2205. \p{Khojki} \p{Script=Khojki} (Short: \p{Khoj}; NOT
  2206. \p{Block=Khojki}) (61)
  2207. \p{Khudawadi} \p{Script=Khudawadi} (Short: \p{Sind}; NOT
  2208. \p{Block=Khudawadi}) (69)
  2209. \p{Knda} \p{Kannada} (= \p{Script=Kannada}) (NOT
  2210. \p{Block=Kannada}) (87)
  2211. \p{Kthi} \p{Kaithi} (= \p{Script=Kaithi}) (NOT
  2212. \p{Block=Kaithi}) (66)
  2213. \p{L} \pL \p{Letter} (= \p{General_Category=Letter})
  2214. (109_591)
  2215. X \p{L&} \p{Cased_Letter} (= \p{General_Category=
  2216. Cased_Letter}) (3646)
  2217. X \p{L_} \p{Cased_Letter} (= \p{General_Category=
  2218. Cased_Letter}) Note the trailing '_'
  2219. matters in spite of loose matching
  2220. rules. (3646)
  2221. \p{Lana} \p{Tai_Tham} (= \p{Script=Tai_Tham}) (NOT
  2222. \p{Block=Tai_Tham}) (127)
  2223. \p{Lao} \p{Script=Lao} (NOT \p{Block=Lao}) (67)
  2224. \p{Laoo} \p{Lao} (= \p{Script=Lao}) (NOT \p{Block=
  2225. Lao}) (67)
  2226. \p{Latin} \p{Script=Latin} (Short: \p{Latn}) (1349)
  2227. X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
  2228. Latin_1_Supplement}) (128)
  2229. X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
  2230. Latin_1_Supplement}) (128)
  2231. X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
  2232. \p{InLatin1}) (128)
  2233. X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
  2234. Latin_Extended_A}) (128)
  2235. X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
  2236. \p{Block=Latin_Extended_Additional})
  2237. (256)
  2238. X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
  2239. Latin_Extended_B}) (208)
  2240. X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
  2241. Latin_Extended_C}) (32)
  2242. X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
  2243. Latin_Extended_D}) (224)
  2244. X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block=
  2245. Latin_Extended_E}) (64)
  2246. X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
  2247. \p{InLatinExtA}) (128)
  2248. X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
  2249. (Short: \p{InLatinExtAdditional}) (256)
  2250. X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
  2251. \p{InLatinExtB}) (208)
  2252. X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
  2253. \p{InLatinExtC}) (32)
  2254. X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
  2255. \p{InLatinExtD}) (224)
  2256. X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short:
  2257. \p{InLatinExtE}) (64)
  2258. \p{Latn} \p{Latin} (= \p{Script=Latin}) (1349)
  2259. \p{Lb: *} \p{Line_Break: *}
  2260. \p{LC} \p{Cased_Letter} (= \p{General_Category=
  2261. Cased_Letter}) (3646)
  2262. \p{Lepc} \p{Lepcha} (= \p{Script=Lepcha}) (NOT
  2263. \p{Block=Lepcha}) (74)
  2264. \p{Lepcha} \p{Script=Lepcha} (Short: \p{Lepc}; NOT
  2265. \p{Block=Lepcha}) (74)
  2266. \p{Letter} \p{General_Category=Letter} (Short: \p{L})
  2267. (109_591)
  2268. \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
  2269. \p{Nl}) (236)
  2270. X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
  2271. \p{Limb} \p{Limbu} (= \p{Script=Limbu}) (NOT
  2272. \p{Block=Limbu}) (68)
  2273. \p{Limbu} \p{Script=Limbu} (Short: \p{Limb}; NOT
  2274. \p{Block=Limbu}) (68)
  2275. \p{Lina} \p{Linear_A} (= \p{Script=Linear_A}) (NOT
  2276. \p{Block=Linear_A}) (341)
  2277. \p{Linb} \p{Linear_B} (= \p{Script=Linear_B}) (211)
  2278. \p{Line_Break: AI} \p{Line_Break=Ambiguous} (689)
  2279. \p{Line_Break: AL} \p{Line_Break=Alphabetic} (19_294)
  2280. \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (19_294)
  2281. \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (689)
  2282. \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
  2283. \p{Line_Break: BA} \p{Line_Break=Break_After} (206)
  2284. \p{Line_Break: BB} \p{Line_Break=Break_Before} (23)
  2285. \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
  2286. \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (206)
  2287. \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (23)
  2288. \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3)
  2289. \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1)
  2290. \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1)
  2291. \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
  2292. \p{Line_Break: CJ} \p{Line_Break=
  2293. Conditional_Japanese_Starter} (51)
  2294. \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (90)
  2295. \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2)
  2296. \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (90)
  2297. \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (1957)
  2298. \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (1957)
  2299. \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (734)
  2300. \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
  2301. (51)
  2302. \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1)
  2303. \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
  2304. \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
  2305. \p{Line_Break: EX} \p{Line_Break=Exclamation} (36)
  2306. \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (36)
  2307. \p{Line_Break: GL} \p{Line_Break=Glue} (18)
  2308. \p{Line_Break: Glue} (Short: \p{Lb=GL}) (18)
  2309. \p{Line_Break: H2} (Short: \p{Lb=H2}) (399)
  2310. \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773)
  2311. \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (74)
  2312. \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (74)
  2313. \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
  2314. \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1)
  2315. \p{Line_Break: ID} \p{Line_Break=Ideographic} (162_973)
  2316. \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (162_973)
  2317. \p{Line_Break: IN} \p{Line_Break=Inseparable} (6)
  2318. \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13)
  2319. \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6)
  2320. \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
  2321. \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
  2322. \p{Line_Break: JL} (Short: \p{Lb=JL}) (125)
  2323. \p{Line_Break: JT} (Short: \p{Lb=JT}) (137)
  2324. \p{Line_Break: JV} (Short: \p{Lb=JV}) (95)
  2325. \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
  2326. \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1)
  2327. \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4)
  2328. \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1)
  2329. \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
  2330. \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (29)
  2331. \p{Line_Break: NS} \p{Line_Break=Nonstarter} (29)
  2332. \p{Line_Break: NU} \p{Line_Break=Numeric} (542)
  2333. \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (542)
  2334. \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (85)
  2335. \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (85)
  2336. \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (30)
  2337. \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (30)
  2338. \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (65)
  2339. \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (65)
  2340. \p{Line_Break: QU} \p{Line_Break=Quotation} (39)
  2341. \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39)
  2342. \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26)
  2343. \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
  2344. \p{Line_Break: SA} \p{Line_Break=Complex_Context} (734)
  2345. D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
  2346. \p{Line_Break: SP} \p{Line_Break=Space} (1)
  2347. \p{Line_Break: Space} (Short: \p{Lb=SP}) (1)
  2348. D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
  2349. should never appear in well-formed text,
  2350. and therefore shouldn't be the basis for
  2351. line breaking (Short: \p{Lb=SG}) (2048)
  2352. \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
  2353. \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (913_536 plus all
  2354. above-Unicode code points)
  2355. \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
  2356. \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2)
  2357. \p{Line_Break: XX} \p{Line_Break=Unknown} (913_536 plus all
  2358. above-Unicode code points)
  2359. \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
  2360. \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1)
  2361. \p{Line_Separator} \p{General_Category=Line_Separator}
  2362. (Short: \p{Zl}) (1)
  2363. \p{Linear_A} \p{Script=Linear_A} (Short: \p{Lina}; NOT
  2364. \p{Block=Linear_A}) (341)
  2365. \p{Linear_B} \p{Script=Linear_B} (Short: \p{Linb}) (211)
  2366. X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
  2367. X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
  2368. \p{Lisu} \p{Script=Lisu} (48)
  2369. \p{Ll} \p{Lowercase_Letter} (=
  2370. \p{General_Category=Lowercase_Letter})
  2371. (/i= General_Category=Cased_Letter)
  2372. (1984)
  2373. \p{Lm} \p{Modifier_Letter} (=
  2374. \p{General_Category=Modifier_Letter})
  2375. (248)
  2376. \p{Lo} \p{Other_Letter} (= \p{General_Category=
  2377. Other_Letter}) (105_697)
  2378. \p{LOE} \p{Logical_Order_Exception} (=
  2379. \p{Logical_Order_Exception=Y}) (19)
  2380. \p{LOE: *} \p{Logical_Order_Exception: *}
  2381. \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
  2382. \p{LOE}) (19)
  2383. \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
  2384. (1_114_093 plus all above-Unicode code
  2385. points)
  2386. \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19)
  2387. X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
  2388. \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
  2389. Cased=Yes) (2173)
  2390. \p{Lower: *} \p{Lowercase: *}
  2391. \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
  2392. Cased=Yes) (2173)
  2393. \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
  2394. No) (1_111_939 plus all above-Unicode
  2395. code points)
  2396. \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
  2397. Yes) (2173)
  2398. \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
  2399. (Short: \p{Ll}; /i= General_Category=
  2400. Cased_Letter) (1984)
  2401. \p{Lt} \p{Titlecase_Letter} (=
  2402. \p{General_Category=Titlecase_Letter})
  2403. (/i= General_Category=Cased_Letter) (31)
  2404. \p{Lu} \p{Uppercase_Letter} (=
  2405. \p{General_Category=Uppercase_Letter})
  2406. (/i= General_Category=Cased_Letter)
  2407. (1631)
  2408. \p{Lyci} \p{Lycian} (= \p{Script=Lycian}) (NOT
  2409. \p{Block=Lycian}) (29)
  2410. \p{Lycian} \p{Script=Lycian} (Short: \p{Lyci}; NOT
  2411. \p{Block=Lycian}) (29)
  2412. \p{Lydi} \p{Lydian} (= \p{Script=Lydian}) (NOT
  2413. \p{Block=Lydian}) (27)
  2414. \p{Lydian} \p{Script=Lydian} (Short: \p{Lydi}; NOT
  2415. \p{Block=Lydian}) (27)
  2416. \p{M} \pM \p{Mark} (= \p{General_Category=Mark})
  2417. (1963)
  2418. \p{Mahajani} \p{Script=Mahajani} (Short: \p{Mahj}; NOT
  2419. \p{Block=Mahajani}) (39)
  2420. \p{Mahj} \p{Mahajani} (= \p{Script=Mahajani}) (NOT
  2421. \p{Block=Mahajani}) (39)
  2422. X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
  2423. Mahjong_Tiles}) (48)
  2424. X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
  2425. \p{InMahjong}) (48)
  2426. \p{Malayalam} \p{Script=Malayalam} (Short: \p{Mlym}; NOT
  2427. \p{Block=Malayalam}) (100)
  2428. \p{Mand} \p{Mandaic} (= \p{Script=Mandaic}) (NOT
  2429. \p{Block=Mandaic}) (29)
  2430. \p{Mandaic} \p{Script=Mandaic} (Short: \p{Mand}; NOT
  2431. \p{Block=Mandaic}) (29)
  2432. \p{Mani} \p{Manichaean} (= \p{Script=Manichaean})
  2433. (NOT \p{Block=Manichaean}) (51)
  2434. \p{Manichaean} \p{Script=Manichaean} (Short: \p{Mani};
  2435. NOT \p{Block=Manichaean}) (51)
  2436. \p{Mark} \p{General_Category=Mark} (Short: \p{M})
  2437. (1963)
  2438. \p{Math} \p{Math=Y} (2310)
  2439. \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all
  2440. above-Unicode code points)
  2441. \p{Math: Y*} (Single: \p{Math}) (2310)
  2442. X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
  2443. \p{Block=
  2444. Mathematical_Alphanumeric_Symbols})
  2445. (1024)
  2446. X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
  2447. Mathematical_Operators}) (256)
  2448. \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
  2449. \p{Sm}) (948)
  2450. X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
  2451. Mathematical_Alphanumeric_Symbols}
  2452. (Short: \p{InMathAlphanum}) (1024)
  2453. X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
  2454. (Short: \p{InMathOperators}) (256)
  2455. \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
  2456. Spacing_Mark}) (383)
  2457. \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
  2458. Enclosing_Mark}) (13)
  2459. \p{Meetei_Mayek} \p{Script=Meetei_Mayek} (Short: \p{Mtei};
  2460. NOT \p{Block=Meetei_Mayek}) (79)
  2461. X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
  2462. Meetei_Mayek_Extensions}) (32)
  2463. X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
  2464. (Short: \p{InMeeteiMayekExt}) (32)
  2465. \p{Mend} \p{Mende_Kikakui} (= \p{Script=
  2466. Mende_Kikakui}) (NOT \p{Block=
  2467. Mende_Kikakui}) (213)
  2468. \p{Mende_Kikakui} \p{Script=Mende_Kikakui} (Short: \p{Mend};
  2469. NOT \p{Block=Mende_Kikakui}) (213)
  2470. \p{Merc} \p{Meroitic_Cursive} (= \p{Script=
  2471. Meroitic_Cursive}) (NOT \p{Block=
  2472. Meroitic_Cursive}) (90)
  2473. \p{Mero} \p{Meroitic_Hieroglyphs} (= \p{Script=
  2474. Meroitic_Hieroglyphs}) (32)
  2475. \p{Meroitic_Cursive} \p{Script=Meroitic_Cursive} (Short:
  2476. \p{Merc}; NOT \p{Block=
  2477. Meroitic_Cursive}) (90)
  2478. \p{Meroitic_Hieroglyphs} \p{Script=Meroitic_Hieroglyphs} (Short:
  2479. \p{Mero}) (32)
  2480. \p{Miao} \p{Script=Miao} (NOT \p{Block=Miao}) (133)
  2481. X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
  2482. \p{Block=
  2483. Miscellaneous_Symbols_And_Arrows}) (256)
  2484. X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
  2485. (= \p{Block=
  2486. Miscellaneous_Mathematical_Symbols_A})
  2487. (48)
  2488. X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
  2489. (= \p{Block=
  2490. Miscellaneous_Mathematical_Symbols_B})
  2491. (128)
  2492. X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
  2493. (= \p{Block=
  2494. Miscellaneous_Symbols_And_Pictographs})
  2495. (768)
  2496. X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
  2497. Miscellaneous_Symbols}) (256)
  2498. X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
  2499. Miscellaneous_Technical}) (256)
  2500. X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
  2501. Miscellaneous_Mathematical_Symbols_A}
  2502. (Short: \p{InMiscMathSymbolsA}) (48)
  2503. X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
  2504. Miscellaneous_Mathematical_Symbols_B}
  2505. (Short: \p{InMiscMathSymbolsB}) (128)
  2506. X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
  2507. \p{InMiscSymbols}) (256)
  2508. X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
  2509. Miscellaneous_Symbols_And_Arrows}
  2510. (Short: \p{InMiscArrows}) (256)
  2511. X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
  2512. Miscellaneous_Symbols_And_Pictographs}
  2513. (Short: \p{InMiscPictographs}) (768)
  2514. X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
  2515. (Short: \p{InMiscTechnical}) (256)
  2516. \p{Mlym} \p{Malayalam} (= \p{Script=Malayalam})
  2517. (NOT \p{Block=Malayalam}) (100)
  2518. \p{Mn} \p{Nonspacing_Mark} (=
  2519. \p{General_Category=Nonspacing_Mark})
  2520. (1567)
  2521. \p{Modi} \p{Script=Modi} (NOT \p{Block=Modi}) (79)
  2522. \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
  2523. (Short: \p{Lm}) (248)
  2524. X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
  2525. Spacing_Modifier_Letters}) (80)
  2526. \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
  2527. (Short: \p{Sk}) (121)
  2528. X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
  2529. \p{Mong} \p{Mongolian} (= \p{Script=Mongolian})
  2530. (NOT \p{Block=Mongolian}) (153)
  2531. \p{Mongolian} \p{Script=Mongolian} (Short: \p{Mong}; NOT
  2532. \p{Block=Mongolian}) (153)
  2533. \p{Mro} \p{Script=Mro} (NOT \p{Block=Mro}) (43)
  2534. \p{Mroo} \p{Mro} (= \p{Script=Mro}) (NOT \p{Block=
  2535. Mro}) (43)
  2536. \p{Mtei} \p{Meetei_Mayek} (= \p{Script=
  2537. Meetei_Mayek}) (NOT \p{Block=
  2538. Meetei_Mayek}) (79)
  2539. \p{Mult} \p{Multani} (= \p{Script=Multani}) (NOT
  2540. \p{Block=Multani}) (38)
  2541. \p{Multani} \p{Script=Multani} (Short: \p{Mult}; NOT
  2542. \p{Block=Multani}) (38)
  2543. X \p{Music} \p{Musical_Symbols} (= \p{Block=
  2544. Musical_Symbols}) (256)
  2545. X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
  2546. \p{InMusic}) (256)
  2547. \p{Myanmar} \p{Script=Myanmar} (Short: \p{Mymr}; NOT
  2548. \p{Block=Myanmar}) (223)
  2549. X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
  2550. Myanmar_Extended_A}) (32)
  2551. X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block=
  2552. Myanmar_Extended_B}) (32)
  2553. X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
  2554. \p{InMyanmarExtA}) (32)
  2555. X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short:
  2556. \p{InMyanmarExtB}) (32)
  2557. \p{Mymr} \p{Myanmar} (= \p{Script=Myanmar}) (NOT
  2558. \p{Block=Myanmar}) (223)
  2559. \p{N} \pN \p{Number} (= \p{General_Category=Number})
  2560. (1433)
  2561. \p{Nabataean} \p{Script=Nabataean} (Short: \p{Nbat}; NOT
  2562. \p{Block=Nabataean}) (40)
  2563. \p{Narb} \p{Old_North_Arabian} (= \p{Script=
  2564. Old_North_Arabian}) (32)
  2565. X \p{NB} \p{No_Block} (= \p{Block=No_Block})
  2566. (849_856 plus all above-Unicode code
  2567. points)
  2568. \p{Nbat} \p{Nabataean} (= \p{Script=Nabataean})
  2569. (NOT \p{Block=Nabataean}) (40)
  2570. \p{NChar} \p{Noncharacter_Code_Point} (=
  2571. \p{Noncharacter_Code_Point=Y}) (66)
  2572. \p{NChar: *} \p{Noncharacter_Code_Point: *}
  2573. \p{Nd} \p{XPosixDigit} (= \p{General_Category=
  2574. Decimal_Number}) (550)
  2575. \p{New_Tai_Lue} \p{Script=New_Tai_Lue} (Short: \p{Talu};
  2576. NOT \p{Block=New_Tai_Lue}) (83)
  2577. \p{NFC_QC: *} \p{NFC_Quick_Check: *}
  2578. \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (110)
  2579. \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110)
  2580. \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
  2581. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  2582. (1120)
  2583. \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
  2584. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  2585. (1120)
  2586. \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
  2587. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  2588. (1_112_882 plus all above-Unicode code
  2589. points)
  2590. \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
  2591. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  2592. (1_112_882 plus all above-Unicode code
  2593. points)
  2594. \p{NFD_QC: *} \p{NFD_Quick_Check: *}
  2595. \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
  2596. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  2597. (13_232)
  2598. \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
  2599. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  2600. (13_232)
  2601. \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
  2602. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  2603. (1_100_880 plus all above-Unicode code
  2604. points)
  2605. \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
  2606. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  2607. (1_100_880 plus all above-Unicode code
  2608. points)
  2609. \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
  2610. \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110)
  2611. \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110)
  2612. \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
  2613. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  2614. (4793)
  2615. \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
  2616. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  2617. (4793)
  2618. \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
  2619. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  2620. (1_109_209 plus all above-Unicode code
  2621. points)
  2622. \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
  2623. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  2624. (1_109_209 plus all above-Unicode code
  2625. points)
  2626. \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
  2627. \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
  2628. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  2629. (16_893)
  2630. \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
  2631. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  2632. (16_893)
  2633. \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
  2634. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  2635. (1_097_219 plus all above-Unicode code
  2636. points)
  2637. \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
  2638. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  2639. (1_097_219 plus all above-Unicode code
  2640. points)
  2641. \p{Nko} \p{Script=Nko} (NOT \p{NKo}) (59)
  2642. \p{Nkoo} \p{Nko} (= \p{Script=Nko}) (NOT \p{NKo})
  2643. (59)
  2644. \p{Nl} \p{Letter_Number} (= \p{General_Category=
  2645. Letter_Number}) (236)
  2646. \p{No} \p{Other_Number} (= \p{General_Category=
  2647. Other_Number}) (647)
  2648. X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
  2649. (849_856 plus all above-Unicode code
  2650. points)
  2651. \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
  2652. \p{NChar}) (66)
  2653. \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
  2654. (1_114_046 plus all above-Unicode code
  2655. points)
  2656. \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
  2657. (66)
  2658. \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
  2659. (Short: \p{Mn}) (1567)
  2660. \p{Nt: *} \p{Numeric_Type: *}
  2661. \p{Number} \p{General_Category=Number} (Short: \p{N})
  2662. (1433)
  2663. X \p{Number_Forms} \p{Block=Number_Forms} (64)
  2664. \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (550)
  2665. \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (550)
  2666. \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
  2667. \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128)
  2668. \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_598 plus all
  2669. above-Unicode code points)
  2670. \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (836)
  2671. \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (836)
  2672. T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1)
  2673. T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (71)
  2674. T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (3)
  2675. T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1)
  2676. T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (1)
  2677. T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1)
  2678. T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (5)
  2679. T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1)
  2680. T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (3)
  2681. T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (3)
  2682. T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (1)
  2683. T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (12)
  2684. T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (6)
  2685. T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1)
  2686. T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1)
  2687. T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1)
  2688. T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (13)
  2689. T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1)
  2690. T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1)
  2691. T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1)
  2692. T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (7)
  2693. T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (7)
  2694. T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1)
  2695. T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (3)
  2696. T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1)
  2697. T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1)
  2698. T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (117)
  2699. T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1)
  2700. T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (117)
  2701. T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1)
  2702. T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (119)
  2703. T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1)
  2704. T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (111)
  2705. T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1)
  2706. T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (109)
  2707. T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1)
  2708. T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (96)
  2709. T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1)
  2710. T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (95)
  2711. T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1)
  2712. T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (91)
  2713. T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1)
  2714. T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (95)
  2715. T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (53)
  2716. T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (6)
  2717. T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (6)
  2718. T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (4)
  2719. T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (4)
  2720. T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (4)
  2721. T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (5)
  2722. T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (5)
  2723. T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (5)
  2724. T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (5)
  2725. T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (30)
  2726. T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1)
  2727. T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1)
  2728. T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1)
  2729. T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1)
  2730. T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1)
  2731. T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1)
  2732. T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1)
  2733. T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1)
  2734. T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1)
  2735. T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (15)
  2736. T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1)
  2737. T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1)
  2738. T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1)
  2739. T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1)
  2740. T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1)
  2741. T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1)
  2742. T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1)
  2743. T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1)
  2744. T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1)
  2745. T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (15)
  2746. T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1)
  2747. T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1)
  2748. T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1)
  2749. T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1)
  2750. T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1)
  2751. T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1)
  2752. T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1)
  2753. T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1)
  2754. T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1)
  2755. T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (26)
  2756. T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (10)
  2757. T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (10)
  2758. T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (9)
  2759. T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (9)
  2760. T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (29)
  2761. T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (4)
  2762. T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (5)
  2763. T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (4)
  2764. T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (14)
  2765. T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (4)
  2766. T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (4)
  2767. T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (4)
  2768. T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (5)
  2769. T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (20)
  2770. T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (2)
  2771. T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (2)
  2772. T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (2)
  2773. T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (6)
  2774. T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (2)
  2775. T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (2)
  2776. T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (2)
  2777. T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (2)
  2778. T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (9)
  2779. T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (2)
  2780. T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (2)
  2781. T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (2)
  2782. T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (5)
  2783. T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (2)
  2784. T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (2)
  2785. T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (2)
  2786. T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (2)
  2787. T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (2)
  2788. T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (1)
  2789. T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1)
  2790. T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1)
  2791. T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1)
  2792. T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1)
  2793. T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1)
  2794. T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1)
  2795. T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1)
  2796. T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1)
  2797. T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1)
  2798. T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1)
  2799. T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
  2800. (3)
  2801. T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
  2802. 10000000000}) (1)
  2803. T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
  2804. 1000000000000}) (2)
  2805. \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_598 plus all
  2806. above-Unicode code points)
  2807. \p{Nv: *} \p{Numeric_Value: *}
  2808. X \p{OCR} \p{Optical_Character_Recognition} (=
  2809. \p{Block=Optical_Character_Recognition})
  2810. (32)
  2811. \p{Ogam} \p{Ogham} (= \p{Script=Ogham}) (NOT
  2812. \p{Block=Ogham}) (29)
  2813. \p{Ogham} \p{Script=Ogham} (Short: \p{Ogam}; NOT
  2814. \p{Block=Ogham}) (29)
  2815. \p{Ol_Chiki} \p{Script=Ol_Chiki} (Short: \p{Olck}) (48)
  2816. \p{Olck} \p{Ol_Chiki} (= \p{Script=Ol_Chiki}) (48)
  2817. \p{Old_Hungarian} \p{Script=Old_Hungarian} (Short: \p{Hung};
  2818. NOT \p{Block=Old_Hungarian}) (108)
  2819. \p{Old_Italic} \p{Script=Old_Italic} (Short: \p{Ital};
  2820. NOT \p{Block=Old_Italic}) (36)
  2821. \p{Old_North_Arabian} \p{Script=Old_North_Arabian} (Short:
  2822. \p{Narb}) (32)
  2823. \p{Old_Permic} \p{Script=Old_Permic} (Short: \p{Perm};
  2824. NOT \p{Block=Old_Permic}) (43)
  2825. \p{Old_Persian} \p{Script=Old_Persian} (Short: \p{Xpeo};
  2826. NOT \p{Block=Old_Persian}) (50)
  2827. \p{Old_South_Arabian} \p{Script=Old_South_Arabian} (Short:
  2828. \p{Sarb}) (32)
  2829. \p{Old_Turkic} \p{Script=Old_Turkic} (Short: \p{Orkh};
  2830. NOT \p{Block=Old_Turkic}) (73)
  2831. \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
  2832. (Short: \p{Ps}) (75)
  2833. X \p{Optical_Character_Recognition} \p{Block=
  2834. Optical_Character_Recognition} (Short:
  2835. \p{InOCR}) (32)
  2836. \p{Oriya} \p{Script=Oriya} (Short: \p{Orya}; NOT
  2837. \p{Block=Oriya}) (90)
  2838. \p{Orkh} \p{Old_Turkic} (= \p{Script=Old_Turkic})
  2839. (NOT \p{Block=Old_Turkic}) (73)
  2840. X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
  2841. \p{Orya} \p{Oriya} (= \p{Script=Oriya}) (NOT
  2842. \p{Block=Oriya}) (90)
  2843. \p{Osma} \p{Osmanya} (= \p{Script=Osmanya}) (NOT
  2844. \p{Block=Osmanya}) (40)
  2845. \p{Osmanya} \p{Script=Osmanya} (Short: \p{Osma}; NOT
  2846. \p{Block=Osmanya}) (40)
  2847. \p{Other} \p{General_Category=Other} (Short: \p{C})
  2848. (993_590 plus all above-Unicode code
  2849. points)
  2850. \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
  2851. \p{Lo}) (105_697)
  2852. \p{Other_Number} \p{General_Category=Other_Number} (Short:
  2853. \p{No}) (647)
  2854. \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
  2855. (Short: \p{Po}) (513)
  2856. \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
  2857. \p{So}) (5677)
  2858. \p{P} \pP \p{Punct} (= \p{General_Category=
  2859. Punctuation}) (NOT
  2860. \p{General_Punctuation}) (717)
  2861. \p{Pahawh_Hmong} \p{Script=Pahawh_Hmong} (Short: \p{Hmng};
  2862. NOT \p{Block=Pahawh_Hmong}) (127)
  2863. \p{Palm} \p{Palmyrene} (= \p{Script=Palmyrene}) (32)
  2864. \p{Palmyrene} \p{Script=Palmyrene} (Short: \p{Palm}) (32)
  2865. \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
  2866. (Short: \p{Zp}) (1)
  2867. \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
  2868. Y}) (2760)
  2869. \p{Pat_Syn: *} \p{Pattern_Syntax: *}
  2870. \p{Pat_WS} \p{Pattern_White_Space} (=
  2871. \p{Pattern_White_Space=Y}) (11)
  2872. \p{Pat_WS: *} \p{Pattern_White_Space: *}
  2873. \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
  2874. (2760)
  2875. \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
  2876. (1_111_352 plus all above-Unicode code
  2877. points)
  2878. \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760)
  2879. \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
  2880. \p{PatWS}) (11)
  2881. \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
  2882. (1_114_101 plus all above-Unicode code
  2883. points)
  2884. \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11)
  2885. \p{Pau_Cin_Hau} \p{Script=Pau_Cin_Hau} (Short: \p{Pauc};
  2886. NOT \p{Block=Pau_Cin_Hau}) (57)
  2887. \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script=Pau_Cin_Hau})
  2888. (NOT \p{Block=Pau_Cin_Hau}) (57)
  2889. \p{Pc} \p{Connector_Punctuation} (=
  2890. \p{General_Category=
  2891. Connector_Punctuation}) (10)
  2892. \p{Pd} \p{Dash_Punctuation} (=
  2893. \p{General_Category=Dash_Punctuation})
  2894. (24)
  2895. \p{Pe} \p{Close_Punctuation} (=
  2896. \p{General_Category=Close_Punctuation})
  2897. (73)
  2898. \p{PerlSpace} \p{PosixSpace} (6)
  2899. \p{PerlWord} \p{PosixWord} (63)
  2900. \p{Perm} \p{Old_Permic} (= \p{Script=Old_Permic})
  2901. (NOT \p{Block=Old_Permic}) (43)
  2902. \p{Pf} \p{Final_Punctuation} (=
  2903. \p{General_Category=Final_Punctuation})
  2904. (10)
  2905. \p{Phag} \p{Phags_Pa} (= \p{Script=Phags_Pa}) (NOT
  2906. \p{Block=Phags_Pa}) (56)
  2907. \p{Phags_Pa} \p{Script=Phags_Pa} (Short: \p{Phag}; NOT
  2908. \p{Block=Phags_Pa}) (56)
  2909. X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
  2910. Phaistos_Disc}) (48)
  2911. X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
  2912. \p{InPhaistos}) (48)
  2913. \p{Phli} \p{Inscriptional_Pahlavi} (= \p{Script=
  2914. Inscriptional_Pahlavi}) (NOT \p{Block=
  2915. Inscriptional_Pahlavi}) (27)
  2916. \p{Phlp} \p{Psalter_Pahlavi} (= \p{Script=
  2917. Psalter_Pahlavi}) (NOT \p{Block=
  2918. Psalter_Pahlavi}) (29)
  2919. \p{Phnx} \p{Phoenician} (= \p{Script=Phoenician})
  2920. (NOT \p{Block=Phoenician}) (29)
  2921. \p{Phoenician} \p{Script=Phoenician} (Short: \p{Phnx};
  2922. NOT \p{Block=Phoenician}) (29)
  2923. X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
  2924. Phonetic_Extensions}) (128)
  2925. X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
  2926. \p{Block=
  2927. Phonetic_Extensions_Supplement}) (64)
  2928. X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
  2929. \p{InPhoneticExt}) (128)
  2930. X \p{Phonetic_Extensions_Supplement} \p{Block=
  2931. Phonetic_Extensions_Supplement} (Short:
  2932. \p{InPhoneticExtSup}) (64)
  2933. \p{Pi} \p{Initial_Punctuation} (=
  2934. \p{General_Category=
  2935. Initial_Punctuation}) (12)
  2936. X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
  2937. \p{Plrd} \p{Miao} (= \p{Script=Miao}) (NOT
  2938. \p{Block=Miao}) (133)
  2939. \p{Po} \p{Other_Punctuation} (=
  2940. \p{General_Category=Other_Punctuation})
  2941. (513)
  2942. \p{PosixAlnum} [A-Za-z0-9] (62)
  2943. \p{PosixAlpha} [A-Za-z] (52)
  2944. \p{PosixBlank} \t and ' ' (2)
  2945. \p{PosixCntrl} ASCII control characters: NUL, SOH, STX,
  2946. ETX, EOT, ENQ, ACK, BEL, BS, HT, LF, VT,
  2947. FF, CR, SO, SI, DLE, DC1, DC2, DC3, DC4,
  2948. NAK, SYN, ETB, CAN, EOM, SUB, ESC, FS,
  2949. GS, RS, US, and DEL (33)
  2950. \p{PosixDigit} [0-9] (10)
  2951. \p{PosixGraph} [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~0-9A-Za-
  2952. z] (94)
  2953. \p{PosixLower} [a-z] (/i= PosixAlpha) (26)
  2954. \p{PosixPrint} [- 0-9A-Za-z!"#$%&'()*+,./:;<=
  2955. >?@[\\]^_`{|}~] (95)
  2956. \p{PosixPunct} [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~] (32)
  2957. \p{PosixSpace} \t, \n, \cK, \f, \r, and ' '. (\cK is
  2958. vertical tab) (Short: \p{PerlSpace}) (6)
  2959. \p{PosixUpper} [A-Z] (/i= PosixAlpha) (26)
  2960. \p{PosixWord} \w, restricted to ASCII = [A-Za-z0-9_]
  2961. (Short: \p{PerlWord}) (63)
  2962. \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} [0-9A-Fa-f] (Short:
  2963. \p{AHex}) (22)
  2964. T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
  2965. extension) (33_979)
  2966. T \p{Present_In: 2.0} Code point's usage introduced in version
  2967. 2.0 or earlier (Short: \p{In=2.0}) (Perl
  2968. extension) (178_500)
  2969. T \p{Present_In: 2.1} Code point's usage introduced in version
  2970. 2.1 or earlier (Short: \p{In=2.1}) (Perl
  2971. extension) (178_502)
  2972. T \p{Present_In: 3.0} Code point's usage introduced in version
  2973. 3.0 or earlier (Short: \p{In=3.0}) (Perl
  2974. extension) (188_809)
  2975. T \p{Present_In: 3.1} Code point's usage introduced in version
  2976. 3.1 or earlier (Short: \p{In=3.1}) (Perl
  2977. extension) (233_787)
  2978. T \p{Present_In: 3.2} Code point's usage introduced in version
  2979. 3.2 or earlier (Short: \p{In=3.2}) (Perl
  2980. extension) (234_803)
  2981. T \p{Present_In: 4.0} Code point's usage introduced in version
  2982. 4.0 or earlier (Short: \p{In=4.0}) (Perl
  2983. extension) (236_029)
  2984. T \p{Present_In: 4.1} Code point's usage introduced in version
  2985. 4.1 or earlier (Short: \p{In=4.1}) (Perl
  2986. extension) (237_302)
  2987. T \p{Present_In: 5.0} Code point's usage introduced in version
  2988. 5.0 or earlier (Short: \p{In=5.0}) (Perl
  2989. extension) (238_671)
  2990. T \p{Present_In: 5.1} Code point's usage introduced in version
  2991. 5.1 or earlier (Short: \p{In=5.1}) (Perl
  2992. extension) (240_295)
  2993. T \p{Present_In: 5.2} Code point's usage introduced in version
  2994. 5.2 or earlier (Short: \p{In=5.2}) (Perl
  2995. extension) (246_943)
  2996. T \p{Present_In: 6.0} Code point's usage introduced in version
  2997. 6.0 or earlier (Short: \p{In=6.0}) (Perl
  2998. extension) (249_031)
  2999. T \p{Present_In: 6.1} Code point's usage introduced in version
  3000. 6.1 or earlier (Short: \p{In=6.1}) (Perl
  3001. extension) (249_763)
  3002. T \p{Present_In: 6.2} Code point's usage introduced in version
  3003. 6.2 or earlier (Short: \p{In=6.2}) (Perl
  3004. extension) (249_764)
  3005. T \p{Present_In: 6.3} Code point's usage introduced in version
  3006. 6.3 or earlier (Short: \p{In=6.3}) (Perl
  3007. extension) (249_769)
  3008. T \p{Present_In: 7.0} Code point's usage introduced in version
  3009. 7.0 or earlier (Short: \p{In=7.0}) (Perl
  3010. extension) (252_603)
  3011. T \p{Present_In: 8.0} Code point's usage introduced in version
  3012. 8.0 or earlier (Short: \p{In=8.0}) (Perl
  3013. extension) (260_319)
  3014. \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
  3015. Unassigned}) (Perl extension) (853_793
  3016. plus all above-Unicode code points)
  3017. \p{Print} \p{XPosixPrint} (258_138)
  3018. \p{Private_Use} \p{General_Category=Private_Use} (Short:
  3019. \p{Co}; NOT \p{Private_Use_Area})
  3020. (137_468)
  3021. X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
  3022. \p{InPUA}) (6400)
  3023. \p{Prti} \p{Inscriptional_Parthian} (= \p{Script=
  3024. Inscriptional_Parthian}) (NOT \p{Block=
  3025. Inscriptional_Parthian}) (30)
  3026. \p{Ps} \p{Open_Punctuation} (=
  3027. \p{General_Category=Open_Punctuation})
  3028. (75)
  3029. \p{Psalter_Pahlavi} \p{Script=Psalter_Pahlavi} (Short:
  3030. \p{Phlp}; NOT \p{Block=Psalter_Pahlavi})
  3031. (29)
  3032. X \p{PUA} \p{Private_Use_Area} (= \p{Block=
  3033. Private_Use_Area}) (6400)
  3034. \p{Punct} \p{General_Category=Punctuation} (Short:
  3035. \p{P}; NOT \p{General_Punctuation}) (717)
  3036. \p{Punctuation} \p{Punct} (= \p{General_Category=
  3037. Punctuation}) (NOT
  3038. \p{General_Punctuation}) (717)
  3039. \p{Qaac} \p{Coptic} (= \p{Script=Coptic}) (NOT
  3040. \p{Block=Coptic}) (137)
  3041. \p{Qaai} \p{Inherited} (= \p{Script=Inherited})
  3042. (563)
  3043. \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
  3044. Y}) (30)
  3045. \p{QMark: *} \p{Quotation_Mark: *}
  3046. \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
  3047. (30)
  3048. \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082
  3049. plus all above-Unicode code points)
  3050. \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30)
  3051. \p{Radical} \p{Radical=Y} (329)
  3052. \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all
  3053. above-Unicode code points)
  3054. \p{Radical: Y*} (Single: \p{Radical}) (329)
  3055. \p{Rejang} \p{Script=Rejang} (Short: \p{Rjng}; NOT
  3056. \p{Block=Rejang}) (37)
  3057. \p{Rjng} \p{Rejang} (= \p{Script=Rejang}) (NOT
  3058. \p{Block=Rejang}) (37)
  3059. X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
  3060. Rumi_Numeral_Symbols}) (32)
  3061. X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
  3062. \p{InRumi}) (32)
  3063. \p{Runic} \p{Script=Runic} (Short: \p{Runr}; NOT
  3064. \p{Block=Runic}) (86)
  3065. \p{Runr} \p{Runic} (= \p{Script=Runic}) (NOT
  3066. \p{Block=Runic}) (86)
  3067. \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol})
  3068. (6799)
  3069. \p{Samaritan} \p{Script=Samaritan} (Short: \p{Samr}; NOT
  3070. \p{Block=Samaritan}) (61)
  3071. \p{Samr} \p{Samaritan} (= \p{Script=Samaritan})
  3072. (NOT \p{Block=Samaritan}) (61)
  3073. \p{Sarb} \p{Old_South_Arabian} (= \p{Script=
  3074. Old_South_Arabian}) (32)
  3075. \p{Saur} \p{Saurashtra} (= \p{Script=Saurashtra})
  3076. (NOT \p{Block=Saurashtra}) (81)
  3077. \p{Saurashtra} \p{Script=Saurashtra} (Short: \p{Saur};
  3078. NOT \p{Block=Saurashtra}) (81)
  3079. \p{SB: *} \p{Sentence_Break: *}
  3080. \p{Sc} \p{Currency_Symbol} (=
  3081. \p{General_Category=Currency_Symbol})
  3082. (53)
  3083. \p{Sc: *} \p{Script: *}
  3084. \p{Script: Aghb} \p{Script=Caucasian_Albanian} (53)
  3085. \p{Script: Ahom} (Short: \p{Sc=Ahom}, \p{Ahom}) (57)
  3086. \p{Script: Anatolian_Hieroglyphs} (Short: \p{Sc=Hluw}, \p{Hluw})
  3087. (583)
  3088. \p{Script: Arab} \p{Script=Arabic} (1257)
  3089. \p{Script: Arabic} (Short: \p{Sc=Arab}, \p{Arab}) (1257)
  3090. \p{Script: Armenian} (Short: \p{Sc=Armn}, \p{Armn}) (93)
  3091. \p{Script: Armi} \p{Script=Imperial_Aramaic} (31)
  3092. \p{Script: Armn} \p{Script=Armenian} (93)
  3093. \p{Script: Avestan} (Short: \p{Sc=Avst}, \p{Avst}) (61)
  3094. \p{Script: Avst} \p{Script=Avestan} (61)
  3095. \p{Script: Bali} \p{Script=Balinese} (121)
  3096. \p{Script: Balinese} (Short: \p{Sc=Bali}, \p{Bali}) (121)
  3097. \p{Script: Bamu} \p{Script=Bamum} (657)
  3098. \p{Script: Bamum} (Short: \p{Sc=Bamu}, \p{Bamu}) (657)
  3099. \p{Script: Bass} \p{Script=Bassa_Vah} (36)
  3100. \p{Script: Bassa_Vah} (Short: \p{Sc=Bass}, \p{Bass}) (36)
  3101. \p{Script: Batak} (Short: \p{Sc=Batk}, \p{Batk}) (56)
  3102. \p{Script: Batk} \p{Script=Batak} (56)
  3103. \p{Script: Beng} \p{Script=Bengali} (93)
  3104. \p{Script: Bengali} (Short: \p{Sc=Beng}, \p{Beng}) (93)
  3105. \p{Script: Bopo} \p{Script=Bopomofo} (70)
  3106. \p{Script: Bopomofo} (Short: \p{Sc=Bopo}, \p{Bopo}) (70)
  3107. \p{Script: Brah} \p{Script=Brahmi} (109)
  3108. \p{Script: Brahmi} (Short: \p{Sc=Brah}, \p{Brah}) (109)
  3109. \p{Script: Brai} \p{Script=Braille} (256)
  3110. \p{Script: Braille} (Short: \p{Sc=Brai}, \p{Brai}) (256)
  3111. \p{Script: Bugi} \p{Script=Buginese} (30)
  3112. \p{Script: Buginese} (Short: \p{Sc=Bugi}, \p{Bugi}) (30)
  3113. \p{Script: Buhd} \p{Script=Buhid} (20)
  3114. \p{Script: Buhid} (Short: \p{Sc=Buhd}, \p{Buhd}) (20)
  3115. \p{Script: Cakm} \p{Script=Chakma} (67)
  3116. \p{Script: Canadian_Aboriginal} (Short: \p{Sc=Cans}, \p{Cans})
  3117. (710)
  3118. \p{Script: Cans} \p{Script=Canadian_Aboriginal} (710)
  3119. \p{Script: Cari} \p{Script=Carian} (49)
  3120. \p{Script: Carian} (Short: \p{Sc=Cari}, \p{Cari}) (49)
  3121. \p{Script: Caucasian_Albanian} (Short: \p{Sc=Aghb}, \p{Aghb}) (53)
  3122. \p{Script: Chakma} (Short: \p{Sc=Cakm}, \p{Cakm}) (67)
  3123. \p{Script: Cham} (Short: \p{Sc=Cham}, \p{Cham}) (83)
  3124. \p{Script: Cher} \p{Script=Cherokee} (172)
  3125. \p{Script: Cherokee} (Short: \p{Sc=Cher}, \p{Cher}) (172)
  3126. \p{Script: Common} (Short: \p{Sc=Zyyy}, \p{Zyyy}) (7179)
  3127. \p{Script: Copt} \p{Script=Coptic} (137)
  3128. \p{Script: Coptic} (Short: \p{Sc=Copt}, \p{Copt}) (137)
  3129. \p{Script: Cprt} \p{Script=Cypriot} (55)
  3130. \p{Script: Cuneiform} (Short: \p{Sc=Xsux}, \p{Xsux}) (1234)
  3131. \p{Script: Cypriot} (Short: \p{Sc=Cprt}, \p{Cprt}) (55)
  3132. \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}, \p{Cyrl}) (434)
  3133. \p{Script: Cyrl} \p{Script=Cyrillic} (434)
  3134. \p{Script: Deseret} (Short: \p{Sc=Dsrt}, \p{Dsrt}) (80)
  3135. \p{Script: Deva} \p{Script=Devanagari} (154)
  3136. \p{Script: Devanagari} (Short: \p{Sc=Deva}, \p{Deva}) (154)
  3137. \p{Script: Dsrt} \p{Script=Deseret} (80)
  3138. \p{Script: Dupl} \p{Script=Duployan} (143)
  3139. \p{Script: Duployan} (Short: \p{Sc=Dupl}, \p{Dupl}) (143)
  3140. \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (1071)
  3141. \p{Script: Egyptian_Hieroglyphs} (Short: \p{Sc=Egyp}, \p{Egyp})
  3142. (1071)
  3143. \p{Script: Elba} \p{Script=Elbasan} (40)
  3144. \p{Script: Elbasan} (Short: \p{Sc=Elba}, \p{Elba}) (40)
  3145. \p{Script: Ethi} \p{Script=Ethiopic} (495)
  3146. \p{Script: Ethiopic} (Short: \p{Sc=Ethi}, \p{Ethi}) (495)
  3147. \p{Script: Geor} \p{Script=Georgian} (127)
  3148. \p{Script: Georgian} (Short: \p{Sc=Geor}, \p{Geor}) (127)
  3149. \p{Script: Glag} \p{Script=Glagolitic} (94)
  3150. \p{Script: Glagolitic} (Short: \p{Sc=Glag}, \p{Glag}) (94)
  3151. \p{Script: Goth} \p{Script=Gothic} (27)
  3152. \p{Script: Gothic} (Short: \p{Sc=Goth}, \p{Goth}) (27)
  3153. \p{Script: Gran} \p{Script=Grantha} (85)
  3154. \p{Script: Grantha} (Short: \p{Sc=Gran}, \p{Gran}) (85)
  3155. \p{Script: Greek} (Short: \p{Sc=Grek}, \p{Grek}) (516)
  3156. \p{Script: Grek} \p{Script=Greek} (516)
  3157. \p{Script: Gujarati} (Short: \p{Sc=Gujr}, \p{Gujr}) (85)
  3158. \p{Script: Gujr} \p{Script=Gujarati} (85)
  3159. \p{Script: Gurmukhi} (Short: \p{Sc=Guru}, \p{Guru}) (79)
  3160. \p{Script: Guru} \p{Script=Gurmukhi} (79)
  3161. \p{Script: Han} (Short: \p{Sc=Han}, \p{Han}) (81_734)
  3162. \p{Script: Hang} \p{Script=Hangul} (11_739)
  3163. \p{Script: Hangul} (Short: \p{Sc=Hang}, \p{Hang}) (11_739)
  3164. \p{Script: Hani} \p{Script=Han} (81_734)
  3165. \p{Script: Hano} \p{Script=Hanunoo} (21)
  3166. \p{Script: Hanunoo} (Short: \p{Sc=Hano}, \p{Hano}) (21)
  3167. \p{Script: Hatr} \p{Script=Hatran} (26)
  3168. \p{Script: Hatran} (Short: \p{Sc=Hatr}, \p{Hatr}) (26)
  3169. \p{Script: Hebr} \p{Script=Hebrew} (133)
  3170. \p{Script: Hebrew} (Short: \p{Sc=Hebr}, \p{Hebr}) (133)
  3171. \p{Script: Hira} \p{Script=Hiragana} (91)
  3172. \p{Script: Hiragana} (Short: \p{Sc=Hira}, \p{Hira}) (91)
  3173. \p{Script: Hluw} \p{Script=Anatolian_Hieroglyphs} (583)
  3174. \p{Script: Hmng} \p{Script=Pahawh_Hmong} (127)
  3175. \p{Script: Hung} \p{Script=Old_Hungarian} (108)
  3176. \p{Script: Imperial_Aramaic} (Short: \p{Sc=Armi}, \p{Armi}) (31)
  3177. \p{Script: Inherited} (Short: \p{Sc=Zinh}, \p{Zinh}) (563)
  3178. \p{Script: Inscriptional_Pahlavi} (Short: \p{Sc=Phli}, \p{Phli})
  3179. (27)
  3180. \p{Script: Inscriptional_Parthian} (Short: \p{Sc=Prti}, \p{Prti})
  3181. (30)
  3182. \p{Script: Ital} \p{Script=Old_Italic} (36)
  3183. \p{Script: Java} \p{Script=Javanese} (90)
  3184. \p{Script: Javanese} (Short: \p{Sc=Java}, \p{Java}) (90)
  3185. \p{Script: Kaithi} (Short: \p{Sc=Kthi}, \p{Kthi}) (66)
  3186. \p{Script: Kali} \p{Script=Kayah_Li} (47)
  3187. \p{Script: Kana} \p{Script=Katakana} (300)
  3188. \p{Script: Kannada} (Short: \p{Sc=Knda}, \p{Knda}) (87)
  3189. \p{Script: Katakana} (Short: \p{Sc=Kana}, \p{Kana}) (300)
  3190. \p{Script: Kayah_Li} (Short: \p{Sc=Kali}, \p{Kali}) (47)
  3191. \p{Script: Khar} \p{Script=Kharoshthi} (65)
  3192. \p{Script: Kharoshthi} (Short: \p{Sc=Khar}, \p{Khar}) (65)
  3193. \p{Script: Khmer} (Short: \p{Sc=Khmr}, \p{Khmr}) (146)
  3194. \p{Script: Khmr} \p{Script=Khmer} (146)
  3195. \p{Script: Khoj} \p{Script=Khojki} (61)
  3196. \p{Script: Khojki} (Short: \p{Sc=Khoj}, \p{Khoj}) (61)
  3197. \p{Script: Khudawadi} (Short: \p{Sc=Sind}, \p{Sind}) (69)
  3198. \p{Script: Knda} \p{Script=Kannada} (87)
  3199. \p{Script: Kthi} \p{Script=Kaithi} (66)
  3200. \p{Script: Lana} \p{Script=Tai_Tham} (127)
  3201. \p{Script: Lao} (Short: \p{Sc=Lao}, \p{Lao}) (67)
  3202. \p{Script: Laoo} \p{Script=Lao} (67)
  3203. \p{Script: Latin} (Short: \p{Sc=Latn}, \p{Latn}) (1349)
  3204. \p{Script: Latn} \p{Script=Latin} (1349)
  3205. \p{Script: Lepc} \p{Script=Lepcha} (74)
  3206. \p{Script: Lepcha} (Short: \p{Sc=Lepc}, \p{Lepc}) (74)
  3207. \p{Script: Limb} \p{Script=Limbu} (68)
  3208. \p{Script: Limbu} (Short: \p{Sc=Limb}, \p{Limb}) (68)
  3209. \p{Script: Lina} \p{Script=Linear_A} (341)
  3210. \p{Script: Linb} \p{Script=Linear_B} (211)
  3211. \p{Script: Linear_A} (Short: \p{Sc=Lina}, \p{Lina}) (341)
  3212. \p{Script: Linear_B} (Short: \p{Sc=Linb}, \p{Linb}) (211)
  3213. \p{Script: Lisu} (Short: \p{Sc=Lisu}, \p{Lisu}) (48)
  3214. \p{Script: Lyci} \p{Script=Lycian} (29)
  3215. \p{Script: Lycian} (Short: \p{Sc=Lyci}, \p{Lyci}) (29)
  3216. \p{Script: Lydi} \p{Script=Lydian} (27)
  3217. \p{Script: Lydian} (Short: \p{Sc=Lydi}, \p{Lydi}) (27)
  3218. \p{Script: Mahajani} (Short: \p{Sc=Mahj}, \p{Mahj}) (39)
  3219. \p{Script: Mahj} \p{Script=Mahajani} (39)
  3220. \p{Script: Malayalam} (Short: \p{Sc=Mlym}, \p{Mlym}) (100)
  3221. \p{Script: Mand} \p{Script=Mandaic} (29)
  3222. \p{Script: Mandaic} (Short: \p{Sc=Mand}, \p{Mand}) (29)
  3223. \p{Script: Mani} \p{Script=Manichaean} (51)
  3224. \p{Script: Manichaean} (Short: \p{Sc=Mani}, \p{Mani}) (51)
  3225. \p{Script: Meetei_Mayek} (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
  3226. \p{Script: Mend} \p{Script=Mende_Kikakui} (213)
  3227. \p{Script: Mende_Kikakui} (Short: \p{Sc=Mend}, \p{Mend}) (213)
  3228. \p{Script: Merc} \p{Script=Meroitic_Cursive} (90)
  3229. \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (32)
  3230. \p{Script: Meroitic_Cursive} (Short: \p{Sc=Merc}, \p{Merc}) (90)
  3231. \p{Script: Meroitic_Hieroglyphs} (Short: \p{Sc=Mero}, \p{Mero})
  3232. (32)
  3233. \p{Script: Miao} (Short: \p{Sc=Miao}, \p{Miao}) (133)
  3234. \p{Script: Mlym} \p{Script=Malayalam} (100)
  3235. \p{Script: Modi} (Short: \p{Sc=Modi}, \p{Modi}) (79)
  3236. \p{Script: Mong} \p{Script=Mongolian} (153)
  3237. \p{Script: Mongolian} (Short: \p{Sc=Mong}, \p{Mong}) (153)
  3238. \p{Script: Mro} (Short: \p{Sc=Mro}, \p{Mro}) (43)
  3239. \p{Script: Mroo} \p{Script=Mro} (43)
  3240. \p{Script: Mtei} \p{Script=Meetei_Mayek} (79)
  3241. \p{Script: Mult} \p{Script=Multani} (38)
  3242. \p{Script: Multani} (Short: \p{Sc=Mult}, \p{Mult}) (38)
  3243. \p{Script: Myanmar} (Short: \p{Sc=Mymr}, \p{Mymr}) (223)
  3244. \p{Script: Mymr} \p{Script=Myanmar} (223)
  3245. \p{Script: Nabataean} (Short: \p{Sc=Nbat}, \p{Nbat}) (40)
  3246. \p{Script: Narb} \p{Script=Old_North_Arabian} (32)
  3247. \p{Script: Nbat} \p{Script=Nabataean} (40)
  3248. \p{Script: New_Tai_Lue} (Short: \p{Sc=Talu}, \p{Talu}) (83)
  3249. \p{Script: Nko} (Short: \p{Sc=Nko}, \p{Nko}) (59)
  3250. \p{Script: Nkoo} \p{Script=Nko} (59)
  3251. \p{Script: Ogam} \p{Script=Ogham} (29)
  3252. \p{Script: Ogham} (Short: \p{Sc=Ogam}, \p{Ogam}) (29)
  3253. \p{Script: Ol_Chiki} (Short: \p{Sc=Olck}, \p{Olck}) (48)
  3254. \p{Script: Olck} \p{Script=Ol_Chiki} (48)
  3255. \p{Script: Old_Hungarian} (Short: \p{Sc=Hung}, \p{Hung}) (108)
  3256. \p{Script: Old_Italic} (Short: \p{Sc=Ital}, \p{Ital}) (36)
  3257. \p{Script: Old_North_Arabian} (Short: \p{Sc=Narb}, \p{Narb}) (32)
  3258. \p{Script: Old_Permic} (Short: \p{Sc=Perm}, \p{Perm}) (43)
  3259. \p{Script: Old_Persian} (Short: \p{Sc=Xpeo}, \p{Xpeo}) (50)
  3260. \p{Script: Old_South_Arabian} (Short: \p{Sc=Sarb}, \p{Sarb}) (32)
  3261. \p{Script: Old_Turkic} (Short: \p{Sc=Orkh}, \p{Orkh}) (73)
  3262. \p{Script: Oriya} (Short: \p{Sc=Orya}, \p{Orya}) (90)
  3263. \p{Script: Orkh} \p{Script=Old_Turkic} (73)
  3264. \p{Script: Orya} \p{Script=Oriya} (90)
  3265. \p{Script: Osma} \p{Script=Osmanya} (40)
  3266. \p{Script: Osmanya} (Short: \p{Sc=Osma}, \p{Osma}) (40)
  3267. \p{Script: Pahawh_Hmong} (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
  3268. \p{Script: Palm} \p{Script=Palmyrene} (32)
  3269. \p{Script: Palmyrene} (Short: \p{Sc=Palm}, \p{Palm}) (32)
  3270. \p{Script: Pau_Cin_Hau} (Short: \p{Sc=Pauc}, \p{Pauc}) (57)
  3271. \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (57)
  3272. \p{Script: Perm} \p{Script=Old_Permic} (43)
  3273. \p{Script: Phag} \p{Script=Phags_Pa} (56)
  3274. \p{Script: Phags_Pa} (Short: \p{Sc=Phag}, \p{Phag}) (56)
  3275. \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (27)
  3276. \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29)
  3277. \p{Script: Phnx} \p{Script=Phoenician} (29)
  3278. \p{Script: Phoenician} (Short: \p{Sc=Phnx}, \p{Phnx}) (29)
  3279. \p{Script: Plrd} \p{Script=Miao} (133)
  3280. \p{Script: Prti} \p{Script=Inscriptional_Parthian} (30)
  3281. \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}, \p{Phlp}) (29)
  3282. \p{Script: Qaac} \p{Script=Coptic} (137)
  3283. \p{Script: Qaai} \p{Script=Inherited} (563)
  3284. \p{Script: Rejang} (Short: \p{Sc=Rjng}, \p{Rjng}) (37)
  3285. \p{Script: Rjng} \p{Script=Rejang} (37)
  3286. \p{Script: Runic} (Short: \p{Sc=Runr}, \p{Runr}) (86)
  3287. \p{Script: Runr} \p{Script=Runic} (86)
  3288. \p{Script: Samaritan} (Short: \p{Sc=Samr}, \p{Samr}) (61)
  3289. \p{Script: Samr} \p{Script=Samaritan} (61)
  3290. \p{Script: Sarb} \p{Script=Old_South_Arabian} (32)
  3291. \p{Script: Saur} \p{Script=Saurashtra} (81)
  3292. \p{Script: Saurashtra} (Short: \p{Sc=Saur}, \p{Saur}) (81)
  3293. \p{Script: Sgnw} \p{Script=SignWriting} (672)
  3294. \p{Script: Sharada} (Short: \p{Sc=Shrd}, \p{Shrd}) (94)
  3295. \p{Script: Shavian} (Short: \p{Sc=Shaw}, \p{Shaw}) (48)
  3296. \p{Script: Shaw} \p{Script=Shavian} (48)
  3297. \p{Script: Shrd} \p{Script=Sharada} (94)
  3298. \p{Script: Sidd} \p{Script=Siddham} (92)
  3299. \p{Script: Siddham} (Short: \p{Sc=Sidd}, \p{Sidd}) (92)
  3300. \p{Script: SignWriting} (Short: \p{Sc=Sgnw}, \p{Sgnw}) (672)
  3301. \p{Script: Sind} \p{Script=Khudawadi} (69)
  3302. \p{Script: Sinh} \p{Script=Sinhala} (110)
  3303. \p{Script: Sinhala} (Short: \p{Sc=Sinh}, \p{Sinh}) (110)
  3304. \p{Script: Sora} \p{Script=Sora_Sompeng} (35)
  3305. \p{Script: Sora_Sompeng} (Short: \p{Sc=Sora}, \p{Sora}) (35)
  3306. \p{Script: Sund} \p{Script=Sundanese} (72)
  3307. \p{Script: Sundanese} (Short: \p{Sc=Sund}, \p{Sund}) (72)
  3308. \p{Script: Sylo} \p{Script=Syloti_Nagri} (44)
  3309. \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}, \p{Sylo}) (44)
  3310. \p{Script: Syrc} \p{Script=Syriac} (77)
  3311. \p{Script: Syriac} (Short: \p{Sc=Syrc}, \p{Syrc}) (77)
  3312. \p{Script: Tagalog} (Short: \p{Sc=Tglg}, \p{Tglg}) (20)
  3313. \p{Script: Tagb} \p{Script=Tagbanwa} (18)
  3314. \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}, \p{Tagb}) (18)
  3315. \p{Script: Tai_Le} (Short: \p{Sc=Tale}, \p{Tale}) (35)
  3316. \p{Script: Tai_Tham} (Short: \p{Sc=Lana}, \p{Lana}) (127)
  3317. \p{Script: Tai_Viet} (Short: \p{Sc=Tavt}, \p{Tavt}) (72)
  3318. \p{Script: Takr} \p{Script=Takri} (66)
  3319. \p{Script: Takri} (Short: \p{Sc=Takr}, \p{Takr}) (66)
  3320. \p{Script: Tale} \p{Script=Tai_Le} (35)
  3321. \p{Script: Talu} \p{Script=New_Tai_Lue} (83)
  3322. \p{Script: Tamil} (Short: \p{Sc=Taml}, \p{Taml}) (72)
  3323. \p{Script: Taml} \p{Script=Tamil} (72)
  3324. \p{Script: Tavt} \p{Script=Tai_Viet} (72)
  3325. \p{Script: Telu} \p{Script=Telugu} (96)
  3326. \p{Script: Telugu} (Short: \p{Sc=Telu}, \p{Telu}) (96)
  3327. \p{Script: Tfng} \p{Script=Tifinagh} (59)
  3328. \p{Script: Tglg} \p{Script=Tagalog} (20)
  3329. \p{Script: Thaa} \p{Script=Thaana} (50)
  3330. \p{Script: Thaana} (Short: \p{Sc=Thaa}, \p{Thaa}) (50)
  3331. \p{Script: Thai} (Short: \p{Sc=Thai}, \p{Thai}) (86)
  3332. \p{Script: Tibetan} (Short: \p{Sc=Tibt}, \p{Tibt}) (207)
  3333. \p{Script: Tibt} \p{Script=Tibetan} (207)
  3334. \p{Script: Tifinagh} (Short: \p{Sc=Tfng}, \p{Tfng}) (59)
  3335. \p{Script: Tirh} \p{Script=Tirhuta} (82)
  3336. \p{Script: Tirhuta} (Short: \p{Sc=Tirh}, \p{Tirh}) (82)
  3337. \p{Script: Ugar} \p{Script=Ugaritic} (31)
  3338. \p{Script: Ugaritic} (Short: \p{Sc=Ugar}, \p{Ugar}) (31)
  3339. \p{Script: Unknown} (Short: \p{Sc=Zzzz}, \p{Zzzz}) (993_375
  3340. plus all above-Unicode code points)
  3341. \p{Script: Vai} (Short: \p{Sc=Vai}, \p{Vai}) (300)
  3342. \p{Script: Vaii} \p{Script=Vai} (300)
  3343. \p{Script: Wara} \p{Script=Warang_Citi} (84)
  3344. \p{Script: Warang_Citi} (Short: \p{Sc=Wara}, \p{Wara}) (84)
  3345. \p{Script: Xpeo} \p{Script=Old_Persian} (50)
  3346. \p{Script: Xsux} \p{Script=Cuneiform} (1234)
  3347. \p{Script: Yi} (Short: \p{Sc=Yi}, \p{Yi}) (1220)
  3348. \p{Script: Yiii} \p{Script=Yi} (1220)
  3349. \p{Script: Zinh} \p{Script=Inherited} (563)
  3350. \p{Script: Zyyy} \p{Script=Common} (7179)
  3351. \p{Script: Zzzz} \p{Script=Unknown} (993_375 plus all
  3352. above-Unicode code points)
  3353. \p{Script_Extensions: Aghb} \p{Script_Extensions=
  3354. Caucasian_Albanian} (53)
  3355. \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}) (57)
  3356. \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw})
  3357. (583)
  3358. \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1301)
  3359. \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}) (1301)
  3360. \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}) (94)
  3361. \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
  3362. (31)
  3363. \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (94)
  3364. \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}) (61)
  3365. \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
  3366. \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
  3367. \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}) (121)
  3368. \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
  3369. \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}) (657)
  3370. \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
  3371. \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}) (36)
  3372. \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}) (56)
  3373. \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
  3374. \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (97)
  3375. \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}) (97)
  3376. \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (306)
  3377. \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}) (306)
  3378. \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
  3379. \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}) (109)
  3380. \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
  3381. \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}) (256)
  3382. \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
  3383. \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}) (31)
  3384. \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
  3385. \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}) (22)
  3386. \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (87)
  3387. \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans})
  3388. (710)
  3389. \p{Script_Extensions: Cans} \p{Script_Extensions=
  3390. Canadian_Aboriginal} (710)
  3391. \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
  3392. \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}) (49)
  3393. \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb})
  3394. (53)
  3395. \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}) (87)
  3396. \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}) (83)
  3397. \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
  3398. \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}) (172)
  3399. \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}) (6772)
  3400. \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
  3401. \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}) (165)
  3402. \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
  3403. \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}) (1234)
  3404. \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}) (112)
  3405. \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}) (436)
  3406. \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (436)
  3407. \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}) (80)
  3408. \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
  3409. \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}) (210)
  3410. \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
  3411. \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
  3412. \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}) (147)
  3413. \p{Script_Extensions: Egyp} \p{Script_Extensions=
  3414. Egyptian_Hieroglyphs} (1071)
  3415. \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp})
  3416. (1071)
  3417. \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
  3418. \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}) (40)
  3419. \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
  3420. \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}) (495)
  3421. \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (128)
  3422. \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}) (128)
  3423. \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (96)
  3424. \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}) (96)
  3425. \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
  3426. \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}) (27)
  3427. \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (113)
  3428. \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}) (113)
  3429. \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}) (520)
  3430. \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (520)
  3431. \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}) (99)
  3432. \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (99)
  3433. \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}) (93)
  3434. \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (93)
  3435. \p{Script_Extensions: Han} (Short: \p{Scx=Han}) (82_007)
  3436. \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_971)
  3437. \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}) (11_971)
  3438. \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (82_007)
  3439. \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
  3440. \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}) (23)
  3441. \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
  3442. \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}) (26)
  3443. \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (133)
  3444. \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}) (133)
  3445. \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (356)
  3446. \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}) (356)
  3447. \p{Script_Extensions: Hluw} \p{Script_Extensions=
  3448. Anatolian_Hieroglyphs} (583)
  3449. \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
  3450. (127)
  3451. \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
  3452. (108)
  3453. \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi}) (31)
  3454. \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}) (495)
  3455. \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli})
  3456. (27)
  3457. \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
  3458. Prti}) (30)
  3459. \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (36)
  3460. \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
  3461. \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}) (91)
  3462. \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}) (86)
  3463. \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
  3464. \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (565)
  3465. \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}) (93)
  3466. \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}) (565)
  3467. \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}) (48)
  3468. \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (65)
  3469. \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}) (65)
  3470. \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}) (146)
  3471. \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
  3472. \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (71)
  3473. \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}) (71)
  3474. \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}) (81)
  3475. \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (93)
  3476. \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (86)
  3477. \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
  3478. \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}) (67)
  3479. \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (67)
  3480. \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}) (1368)
  3481. \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1368)
  3482. \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
  3483. \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}) (74)
  3484. \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
  3485. \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}) (69)
  3486. \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (341)
  3487. \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
  3488. \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}) (341)
  3489. \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}) (268)
  3490. \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}) (48)
  3491. \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
  3492. \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}) (29)
  3493. \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
  3494. \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}) (27)
  3495. \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}) (61)
  3496. \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
  3497. \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}) (105)
  3498. \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
  3499. \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}) (30)
  3500. \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
  3501. \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}) (52)
  3502. \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei}) (79)
  3503. \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
  3504. (213)
  3505. \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend}) (213)
  3506. \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
  3507. (90)
  3508. \p{Script_Extensions: Mero} \p{Script_Extensions=
  3509. Meroitic_Hieroglyphs} (32)
  3510. \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc}) (90)
  3511. \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero})
  3512. (32)
  3513. \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}) (133)
  3514. \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (105)
  3515. \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}) (89)
  3516. \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (156)
  3517. \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}) (156)
  3518. \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}) (43)
  3519. \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
  3520. \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
  3521. \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
  3522. \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}) (48)
  3523. \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}) (224)
  3524. \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
  3525. \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}) (40)
  3526. \p{Script_Extensions: Narb} \p{Script_Extensions=
  3527. Old_North_Arabian} (32)
  3528. \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
  3529. \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}) (83)
  3530. \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}) (59)
  3531. \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (59)
  3532. \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
  3533. \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}) (29)
  3534. \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}) (48)
  3535. \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
  3536. \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung}) (108)
  3537. \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}) (36)
  3538. \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb}) (32)
  3539. \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}) (44)
  3540. \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}) (50)
  3541. \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb}) (32)
  3542. \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}) (73)
  3543. \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}) (94)
  3544. \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
  3545. \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (94)
  3546. \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
  3547. \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}) (40)
  3548. \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng}) (127)
  3549. \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
  3550. \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}) (32)
  3551. \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}) (57)
  3552. \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
  3553. \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
  3554. \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
  3555. \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}) (59)
  3556. \p{Script_Extensions: Phli} \p{Script_Extensions=
  3557. Inscriptional_Pahlavi} (27)
  3558. \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
  3559. (30)
  3560. \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
  3561. \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}) (29)
  3562. \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (133)
  3563. \p{Script_Extensions: Prti} \p{Script_Extensions=
  3564. Inscriptional_Parthian} (30)
  3565. \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp}) (30)
  3566. \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
  3567. \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (495)
  3568. \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}) (37)
  3569. \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
  3570. \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}) (86)
  3571. \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
  3572. \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}) (61)
  3573. \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
  3574. \p{Script_Extensions: Sarb} \p{Script_Extensions=
  3575. Old_South_Arabian} (32)
  3576. \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (81)
  3577. \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}) (81)
  3578. \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
  3579. \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}) (100)
  3580. \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}) (48)
  3581. \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
  3582. \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (100)
  3583. \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
  3584. \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}) (92)
  3585. \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}) (672)
  3586. \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
  3587. \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112)
  3588. \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}) (112)
  3589. \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
  3590. \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora}) (35)
  3591. \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
  3592. \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}) (72)
  3593. \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56)
  3594. \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo}) (56)
  3595. \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (93)
  3596. \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}) (93)
  3597. \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}) (22)
  3598. \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
  3599. \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}) (20)
  3600. \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}) (45)
  3601. \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}) (127)
  3602. \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}) (72)
  3603. \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (78)
  3604. \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}) (78)
  3605. \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
  3606. \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
  3607. \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}) (78)
  3608. \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (78)
  3609. \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
  3610. \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (101)
  3611. \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}) (101)
  3612. \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
  3613. \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
  3614. \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (65)
  3615. \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}) (65)
  3616. \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}) (86)
  3617. \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}) (207)
  3618. \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
  3619. \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}) (59)
  3620. \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (94)
  3621. \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}) (94)
  3622. \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
  3623. \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}) (31)
  3624. \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}) (993_375 plus
  3625. all above-Unicode code points)
  3626. \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}) (300)
  3627. \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
  3628. \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
  3629. \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}) (84)
  3630. \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
  3631. \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
  3632. \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}) (1246)
  3633. \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
  3634. \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (495)
  3635. \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (6772)
  3636. \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (993_375
  3637. plus all above-Unicode code points)
  3638. \p{Scx: *} \p{Script_Extensions: *}
  3639. \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
  3640. \p{SD: *} \p{Soft_Dotted: *}
  3641. \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
  3642. \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4)
  3643. \p{Sentence_Break: CL} \p{Sentence_Break=Close} (187)
  3644. \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187)
  3645. \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1)
  3646. \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (1967)
  3647. \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (1967)
  3648. \p{Sentence_Break: FO} \p{Sentence_Break=Format} (148)
  3649. \p{Sentence_Break: Format} (Short: \p{SB=FO}) (148)
  3650. \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (106_002)
  3651. \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1)
  3652. \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2172)
  3653. \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2172)
  3654. \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (542)
  3655. \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (542)
  3656. \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (106_002)
  3657. \p{Sentence_Break: Other} (Short: \p{SB=XX}) (1_001_140 plus all
  3658. above-Unicode code points)
  3659. \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
  3660. \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26)
  3661. \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
  3662. \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3)
  3663. \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20)
  3664. \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (117)
  3665. \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (117)
  3666. \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1782)
  3667. \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1782)
  3668. \p{Sentence_Break: XX} \p{Sentence_Break=Other} (1_001_140 plus
  3669. all above-Unicode code points)
  3670. \p{Separator} \p{General_Category=Separator} (Short:
  3671. \p{Z}) (19)
  3672. \p{Sgnw} \p{SignWriting} (= \p{Script=SignWriting})
  3673. (672)
  3674. \p{Sharada} \p{Script=Sharada} (Short: \p{Shrd}; NOT
  3675. \p{Block=Sharada}) (94)
  3676. \p{Shavian} \p{Script=Shavian} (Short: \p{Shaw}) (48)
  3677. \p{Shaw} \p{Shavian} (= \p{Script=Shavian}) (48)
  3678. X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
  3679. (16)
  3680. \p{Shrd} \p{Sharada} (= \p{Script=Sharada}) (NOT
  3681. \p{Block=Sharada}) (94)
  3682. \p{Sidd} \p{Siddham} (= \p{Script=Siddham}) (NOT
  3683. \p{Block=Siddham}) (92)
  3684. \p{Siddham} \p{Script=Siddham} (Short: \p{Sidd}; NOT
  3685. \p{Block=Siddham}) (92)
  3686. \p{SignWriting} \p{Script=SignWriting} (Short: \p{Sgnw})
  3687. (672)
  3688. \p{Sind} \p{Khudawadi} (= \p{Script=Khudawadi})
  3689. (NOT \p{Block=Khudawadi}) (69)
  3690. \p{Sinh} \p{Sinhala} (= \p{Script=Sinhala}) (NOT
  3691. \p{Block=Sinhala}) (110)
  3692. \p{Sinhala} \p{Script=Sinhala} (Short: \p{Sinh}; NOT
  3693. \p{Block=Sinhala}) (110)
  3694. X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
  3695. \p{Sk} \p{Modifier_Symbol} (=
  3696. \p{General_Category=Modifier_Symbol})
  3697. (121)
  3698. \p{Sm} \p{Math_Symbol} (= \p{General_Category=
  3699. Math_Symbol}) (948)
  3700. X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
  3701. \p{InSmallForms}) (32)
  3702. X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
  3703. Small_Form_Variants}) (32)
  3704. \p{So} \p{Other_Symbol} (= \p{General_Category=
  3705. Other_Symbol}) (5677)
  3706. \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
  3707. \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
  3708. all above-Unicode code points)
  3709. \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46)
  3710. \p{Sora} \p{Sora_Sompeng} (= \p{Script=
  3711. Sora_Sompeng}) (NOT \p{Block=
  3712. Sora_Sompeng}) (35)
  3713. \p{Sora_Sompeng} \p{Script=Sora_Sompeng} (Short: \p{Sora};
  3714. NOT \p{Block=Sora_Sompeng}) (35)
  3715. \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25)
  3716. \p{Space: *} \p{White_Space: *}
  3717. \p{Space_Separator} \p{General_Category=Space_Separator}
  3718. (Short: \p{Zs}) (17)
  3719. \p{SpacePerl} \p{XPosixSpace} (25)
  3720. \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
  3721. \p{Mc}) (383)
  3722. X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
  3723. (Short: \p{InModifierLetters}) (80)
  3724. X \p{Specials} \p{Block=Specials} (16)
  3725. \p{STerm} \p{STerm=Y} (120)
  3726. \p{STerm: N*} (Single: \P{STerm}) (1_113_992 plus all
  3727. above-Unicode code points)
  3728. \p{STerm: Y*} (Single: \p{STerm}) (120)
  3729. \p{Sund} \p{Sundanese} (= \p{Script=Sundanese})
  3730. (NOT \p{Block=Sundanese}) (72)
  3731. \p{Sundanese} \p{Script=Sundanese} (Short: \p{Sund}; NOT
  3732. \p{Block=Sundanese}) (72)
  3733. X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
  3734. Sundanese_Supplement}) (16)
  3735. X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
  3736. \p{InSundaneseSup}) (16)
  3737. X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
  3738. Supplemental_Arrows_A}) (16)
  3739. X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
  3740. Supplemental_Arrows_B}) (128)
  3741. X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block=
  3742. Supplemental_Arrows_C}) (256)
  3743. X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
  3744. \p{Block=
  3745. Supplemental_Mathematical_Operators})
  3746. (256)
  3747. X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
  3748. \p{Block=
  3749. Supplementary_Private_Use_Area_A})
  3750. (65_536)
  3751. X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
  3752. \p{Block=
  3753. Supplementary_Private_Use_Area_B})
  3754. (65_536)
  3755. X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
  3756. Supplemental_Punctuation}) (128)
  3757. X \p{Sup_Symbols_And_Pictographs}
  3758. \p{Supplemental_Symbols_And_Pictographs}
  3759. (= \p{Block=
  3760. Supplemental_Symbols_And_Pictographs})
  3761. (256)
  3762. X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
  3763. \p{Block=Superscripts_And_Subscripts})
  3764. (48)
  3765. X \p{Superscripts_And_Subscripts} \p{Block=
  3766. Superscripts_And_Subscripts} (Short:
  3767. \p{InSuperAndSub}) (48)
  3768. X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
  3769. \p{InSupArrowsA}) (16)
  3770. X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
  3771. \p{InSupArrowsB}) (128)
  3772. X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
  3773. \p{InSupArrowsC}) (256)
  3774. X \p{Supplemental_Mathematical_Operators} \p{Block=
  3775. Supplemental_Mathematical_Operators}
  3776. (Short: \p{InSupMathOperators}) (256)
  3777. X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
  3778. (Short: \p{InSupPunctuation}) (128)
  3779. X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
  3780. Supplemental_Symbols_And_Pictographs}
  3781. (Short: \p{InSupSymbolsAndPictographs})
  3782. (256)
  3783. X \p{Supplementary_Private_Use_Area_A} \p{Block=
  3784. Supplementary_Private_Use_Area_A}
  3785. (Short: \p{InSupPUAA}) (65_536)
  3786. X \p{Supplementary_Private_Use_Area_B} \p{Block=
  3787. Supplementary_Private_Use_Area_B}
  3788. (Short: \p{InSupPUAB}) (65_536)
  3789. \p{Surrogate} \p{General_Category=Surrogate} (Short:
  3790. \p{Cs}) (2048)
  3791. X \p{Sutton_SignWriting} \p{Block=Sutton_SignWriting} (688)
  3792. \p{Sylo} \p{Syloti_Nagri} (= \p{Script=
  3793. Syloti_Nagri}) (NOT \p{Block=
  3794. Syloti_Nagri}) (44)
  3795. \p{Syloti_Nagri} \p{Script=Syloti_Nagri} (Short: \p{Sylo};
  3796. NOT \p{Block=Syloti_Nagri}) (44)
  3797. \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
  3798. (6799)
  3799. \p{Syrc} \p{Syriac} (= \p{Script=Syriac}) (NOT
  3800. \p{Block=Syriac}) (77)
  3801. \p{Syriac} \p{Script=Syriac} (Short: \p{Syrc}; NOT
  3802. \p{Block=Syriac}) (77)
  3803. \p{Tagalog} \p{Script=Tagalog} (Short: \p{Tglg}; NOT
  3804. \p{Block=Tagalog}) (20)
  3805. \p{Tagb} \p{Tagbanwa} (= \p{Script=Tagbanwa}) (NOT
  3806. \p{Block=Tagbanwa}) (18)
  3807. \p{Tagbanwa} \p{Script=Tagbanwa} (Short: \p{Tagb}; NOT
  3808. \p{Block=Tagbanwa}) (18)
  3809. X \p{Tags} \p{Block=Tags} (128)
  3810. \p{Tai_Le} \p{Script=Tai_Le} (Short: \p{Tale}; NOT
  3811. \p{Block=Tai_Le}) (35)
  3812. \p{Tai_Tham} \p{Script=Tai_Tham} (Short: \p{Lana}; NOT
  3813. \p{Block=Tai_Tham}) (127)
  3814. \p{Tai_Viet} \p{Script=Tai_Viet} (Short: \p{Tavt}; NOT
  3815. \p{Block=Tai_Viet}) (72)
  3816. X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
  3817. Tai_Xuan_Jing_Symbols}) (96)
  3818. X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
  3819. \p{InTaiXuanJing}) (96)
  3820. \p{Takr} \p{Takri} (= \p{Script=Takri}) (NOT
  3821. \p{Block=Takri}) (66)
  3822. \p{Takri} \p{Script=Takri} (Short: \p{Takr}; NOT
  3823. \p{Block=Takri}) (66)
  3824. \p{Tale} \p{Tai_Le} (= \p{Script=Tai_Le}) (NOT
  3825. \p{Block=Tai_Le}) (35)
  3826. \p{Talu} \p{New_Tai_Lue} (= \p{Script=New_Tai_Lue})
  3827. (NOT \p{Block=New_Tai_Lue}) (83)
  3828. \p{Tamil} \p{Script=Tamil} (Short: \p{Taml}; NOT
  3829. \p{Block=Tamil}) (72)
  3830. \p{Taml} \p{Tamil} (= \p{Script=Tamil}) (NOT
  3831. \p{Block=Tamil}) (72)
  3832. \p{Tavt} \p{Tai_Viet} (= \p{Script=Tai_Viet}) (NOT
  3833. \p{Block=Tai_Viet}) (72)
  3834. \p{Telu} \p{Telugu} (= \p{Script=Telugu}) (NOT
  3835. \p{Block=Telugu}) (96)
  3836. \p{Telugu} \p{Script=Telugu} (Short: \p{Telu}; NOT
  3837. \p{Block=Telugu}) (96)
  3838. \p{Term} \p{Terminal_Punctuation} (=
  3839. \p{Terminal_Punctuation=Y}) (238)
  3840. \p{Term: *} \p{Terminal_Punctuation: *}
  3841. \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
  3842. \p{Term}) (238)
  3843. \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
  3844. (1_113_874 plus all above-Unicode code
  3845. points)
  3846. \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (238)
  3847. \p{Tfng} \p{Tifinagh} (= \p{Script=Tifinagh}) (NOT
  3848. \p{Block=Tifinagh}) (59)
  3849. \p{Tglg} \p{Tagalog} (= \p{Script=Tagalog}) (NOT
  3850. \p{Block=Tagalog}) (20)
  3851. \p{Thaa} \p{Thaana} (= \p{Script=Thaana}) (NOT
  3852. \p{Block=Thaana}) (50)
  3853. \p{Thaana} \p{Script=Thaana} (Short: \p{Thaa}; NOT
  3854. \p{Block=Thaana}) (50)
  3855. \p{Thai} \p{Script=Thai} (NOT \p{Block=Thai}) (86)
  3856. \p{Tibetan} \p{Script=Tibetan} (Short: \p{Tibt}; NOT
  3857. \p{Block=Tibetan}) (207)
  3858. \p{Tibt} \p{Tibetan} (= \p{Script=Tibetan}) (NOT
  3859. \p{Block=Tibetan}) (207)
  3860. \p{Tifinagh} \p{Script=Tifinagh} (Short: \p{Tfng}; NOT
  3861. \p{Block=Tifinagh}) (59)
  3862. \p{Tirh} \p{Tirhuta} (= \p{Script=Tirhuta}) (NOT
  3863. \p{Block=Tirhuta}) (82)
  3864. \p{Tirhuta} \p{Script=Tirhuta} (Short: \p{Tirh}; NOT
  3865. \p{Block=Tirhuta}) (82)
  3866. \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
  3867. \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
  3868. Cased=Yes) (31)
  3869. \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
  3870. (Short: \p{Lt}; /i= General_Category=
  3871. Cased_Letter) (31)
  3872. X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
  3873. Transport_And_Map_Symbols}) (128)
  3874. X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
  3875. (Short: \p{InTransportAndMap}) (128)
  3876. X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
  3877. (= \p{Block=
  3878. Unified_Canadian_Aboriginal_Syllabics})
  3879. (640)
  3880. X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
  3881. Extended} (= \p{Block=
  3882. Unified_Canadian_Aboriginal_Syllabics_-
  3883. Extended}) (80)
  3884. \p{Ugar} \p{Ugaritic} (= \p{Script=Ugaritic}) (NOT
  3885. \p{Block=Ugaritic}) (31)
  3886. \p{Ugaritic} \p{Script=Ugaritic} (Short: \p{Ugar}; NOT
  3887. \p{Block=Ugaritic}) (31)
  3888. \p{UIdeo} \p{Unified_Ideograph} (=
  3889. \p{Unified_Ideograph=Y}) (80_388)
  3890. \p{UIdeo: *} \p{Unified_Ideograph: *}
  3891. \p{Unassigned} \p{General_Category=Unassigned} (Short:
  3892. \p{Cn}) (853_859 plus all above-Unicode
  3893. code points)
  3894. \p{Unicode} \p{Any} (1_114_112)
  3895. X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
  3896. Unified_Canadian_Aboriginal_Syllabics}
  3897. (Short: \p{InUCAS}) (640)
  3898. X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
  3899. Unified_Canadian_Aboriginal_Syllabics_-
  3900. Extended} (Short: \p{InUCASExt}) (80)
  3901. \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
  3902. (80_388)
  3903. \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
  3904. (1_033_724 plus all above-Unicode code
  3905. points)
  3906. \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (80_388)
  3907. \p{Unknown} \p{Script=Unknown} (Short: \p{Zzzz})
  3908. (993_375 plus all above-Unicode code
  3909. points)
  3910. \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
  3911. Cased=Yes) (1751)
  3912. \p{Upper: *} \p{Uppercase: *}
  3913. \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
  3914. Cased=Yes) (1751)
  3915. \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
  3916. No) (1_112_361 plus all above-Unicode
  3917. code points)
  3918. \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
  3919. Yes) (1751)
  3920. \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
  3921. (Short: \p{Lu}; /i= General_Category=
  3922. Cased_Letter) (1631)
  3923. \p{Vai} \p{Script=Vai} (NOT \p{Block=Vai}) (300)
  3924. \p{Vaii} \p{Vai} (= \p{Script=Vai}) (NOT \p{Block=
  3925. Vai}) (300)
  3926. \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
  3927. NOT \p{Variation_Selectors}) (259)
  3928. \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
  3929. plus all above-Unicode code points)
  3930. \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259)
  3931. X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
  3932. \p{InVS}) (16)
  3933. X \p{Variation_Selectors_Supplement} \p{Block=
  3934. Variation_Selectors_Supplement} (Short:
  3935. \p{InVSSup}) (240)
  3936. X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
  3937. Vedic_Extensions}) (48)
  3938. X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
  3939. \p{InVedicExt}) (48)
  3940. X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
  3941. \p{VertSpace} \v (7)
  3942. \p{VS} \p{Variation_Selector} (=
  3943. \p{Variation_Selector=Y}) (NOT
  3944. \p{Variation_Selectors}) (259)
  3945. \p{VS: *} \p{Variation_Selector: *}
  3946. X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
  3947. \p{Block=
  3948. Variation_Selectors_Supplement}) (240)
  3949. \p{Wara} \p{Warang_Citi} (= \p{Script=Warang_Citi})
  3950. (NOT \p{Block=Warang_Citi}) (84)
  3951. \p{Warang_Citi} \p{Script=Warang_Citi} (Short: \p{Wara};
  3952. NOT \p{Block=Warang_Citi}) (84)
  3953. \p{WB: *} \p{Word_Break: *}
  3954. \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25)
  3955. \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087
  3956. plus all above-Unicode code points)
  3957. \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25)
  3958. \p{Word} \p{XPosixWord} (112_482)
  3959. \p{Word_Break: ALetter} (Short: \p{WB=LE}) (27_697)
  3960. \p{Word_Break: CR} (Short: \p{WB=CR}) (1)
  3961. \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1)
  3962. \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1)
  3963. \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (10)
  3964. \p{Word_Break: Extend} (Short: \p{WB=Extend}) (1967)
  3965. \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (10)
  3966. \p{Word_Break: FO} \p{Word_Break=Format} (147)
  3967. \p{Word_Break: Format} (Short: \p{WB=FO}) (147)
  3968. \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (74)
  3969. \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (74)
  3970. \p{Word_Break: KA} \p{Word_Break=Katakana} (310)
  3971. \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310)
  3972. \p{Word_Break: LE} \p{Word_Break=ALetter} (27_697)
  3973. \p{Word_Break: LF} (Short: \p{WB=LF}) (1)
  3974. \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7)
  3975. \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9)
  3976. \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15)
  3977. \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7)
  3978. \p{Word_Break: ML} \p{Word_Break=MidLetter} (9)
  3979. \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
  3980. \p{Word_Break: Newline} (Short: \p{WB=NL}) (5)
  3981. \p{Word_Break: NL} \p{Word_Break=Newline} (5)
  3982. \p{Word_Break: NU} \p{Word_Break=Numeric} (541)
  3983. \p{Word_Break: Numeric} (Short: \p{WB=NU}) (541)
  3984. \p{Word_Break: Other} (Short: \p{WB=XX}) (1_083_300 plus all
  3985. above-Unicode code points)
  3986. \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26)
  3987. \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
  3988. \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1)
  3989. \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1)
  3990. \p{Word_Break: XX} \p{Word_Break=Other} (1_083_300 plus all
  3991. above-Unicode code points)
  3992. \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25)
  3993. \p{WSpace: *} \p{White_Space: *}
  3994. \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  3995. \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
  3996. (112_333)
  3997. \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (1_001_779
  3998. plus all above-Unicode code points)
  3999. \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (112_333)
  4000. \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (109_807)
  4001. \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (1_004_305
  4002. plus all above-Unicode code points)
  4003. \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (109_807)
  4004. \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
  4005. (112_333)
  4006. \p{XIDC: *} \p{XID_Continue: *}
  4007. \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (109_807)
  4008. \p{XIDS: *} \p{XID_Start: *}
  4009. \p{Xpeo} \p{Old_Persian} (= \p{Script=Old_Persian})
  4010. (NOT \p{Block=Old_Persian}) (50)
  4011. \p{XPerlSpace} \p{XPosixSpace} (25)
  4012. \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short:
  4013. \p{Alnum}) (111_493)
  4014. \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha})
  4015. (110_943)
  4016. \p{XPosixBlank} \h, Horizontal white space (Short:
  4017. \p{Blank}) (18)
  4018. \p{XPosixCntrl} \p{General_Category=Control} Control
  4019. characters (Short: \p{Cc}) (65)
  4020. \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9]
  4021. + all other decimal digits (Short:
  4022. \p{Nd}) (550)
  4023. \p{XPosixGraph} Characters that are graphical (Short:
  4024. \p{Graph}) (258_121)
  4025. \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i=
  4026. Cased=Yes) (2173)
  4027. \p{XPosixPrint} Characters that are graphical plus space
  4028. characters (but no controls) (Short:
  4029. \p{Print}) (258_138)
  4030. \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (726)
  4031. \p{XPosixSpace} \s including beyond ASCII and vertical tab
  4032. (Short: \p{SpacePerl}) (25)
  4033. \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i=
  4034. Cased=Yes) (1751)
  4035. \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} +
  4036. \pM + \p{Pc} (Short: \p{Word}) (112_482)
  4037. \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
  4038. \p{Xsux} \p{Cuneiform} (= \p{Script=Cuneiform})
  4039. (NOT \p{Block=Cuneiform}) (1234)
  4040. \p{Yi} \p{Script=Yi} (1220)
  4041. X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
  4042. X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
  4043. \p{Yiii} \p{Yi} (= \p{Script=Yi}) (1220)
  4044. X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
  4045. Yijing_Hexagram_Symbols}) (64)
  4046. X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
  4047. (Short: \p{InYijing}) (64)
  4048. \p{Z} \pZ \p{Separator} (= \p{General_Category=
  4049. Separator}) (19)
  4050. \p{Zinh} \p{Inherited} (= \p{Script=Inherited})
  4051. (563)
  4052. \p{Zl} \p{Line_Separator} (= \p{General_Category=
  4053. Line_Separator}) (1)
  4054. \p{Zp} \p{Paragraph_Separator} (=
  4055. \p{General_Category=
  4056. Paragraph_Separator}) (1)
  4057. \p{Zs} \p{Space_Separator} (=
  4058. \p{General_Category=Space_Separator})
  4059. (17)
  4060. \p{Zyyy} \p{Common} (= \p{Script=Common}) (7179)
  4061. \p{Zzzz} \p{Unknown} (= \p{Script=Unknown})
  4062. (993_375 plus all above-Unicode code
  4063. points)
  4064. TX\p{_CanonDCIJ} (For internal use by Perl, not necessarily
  4065. stable) (= \p{Soft_Dotted=Y}) (46)
  4066. TX\p{_Case_Ignorable} (For internal use by Perl, not necessarily
  4067. stable) (= \p{Case_Ignorable=Y}) (2115)
  4068. TX\p{_CombAbove} (For internal use by Perl, not necessarily
  4069. stable) (= \p{Canonical_Combining_Class=
  4070. Above}) (402)

Legal \p{} and \P{} constructs that match no characters

Unicode has some property-value pairs that currently don't match anything. This happens generally either because they are obsolete, or they exist for symmetry with other forms, but no language has yet been encoded that uses them. In this version of Unicode, the following match zero code points:

  • \p{Canonical_Combining_Class=Attached_Below_Left}
  • \p{Canonical_Combining_Class=CCC133}
  • \p{Grapheme_Cluster_Break=Prepend}

Properties accessible through Unicode::UCD

The value of any Unicode (not including Perl extensions) character property mentioned above for any single code point is available through charprop() in Unicode::UCD. charprops_all() in Unicode::UCD returns the values of all the Unicode properties for a given code point.

Besides these, all the Unicode character properties mentioned above (except for those marked as for internal use by Perl) are also accessible by prop_invlist() in Unicode::UCD.

Due to their nature, not all Unicode character properties are suitable for regular expression matches, nor prop_invlist() . The remaining non-provisional, non-internal ones are accessible via prop_invmap() in Unicode::UCD (except for those that this Perl installation hasn't included; see below for which those are).

For compatibility with other parts of Perl, all the single forms given in the table in the section above are recognized. BUT, there are some ambiguities between some Perl extensions and the Unicode properties, all of which are silently resolved in favor of the official Unicode property. To avoid surprises, you should only use prop_invmap() for forms listed in the table below, which omits the non-recommended ones. The affected forms are the Perl single form equivalents of Unicode properties, such as \p{sc} being a single-form equivalent of \p{gc=sc} , which is treated by prop_invmap() as the Script property, whose short name is sc . The table indicates the current ambiguities in the INFO column, beginning with the word "NOT" .

The standard Unicode properties listed below are documented in http://www.unicode.org/reports/tr44/; Perl_Decimal_Digit is documented in prop_invmap() in Unicode::UCD. The other Perl extensions are in Other Properties in perlunicode;

The first column in the table is a name for the property; the second column is an alternative name, if any, plus possibly some annotations. The alternative name is the property's full name, unless that would simply repeat the first column, in which case the second column indicates the property's short name (if different). The annotations are given only in the entry for the full name. If a property is obsolete, etc, the entry will be flagged with the same characters used in the table in the section above, like D or S.

  1. NAME INFO
  2. Age
  3. AHex ASCII_Hex_Digit
  4. All (Perl extension). All code points,
  5. including those above Unicode. Same as
  6. qr/./s
  7. Alnum XPosixAlnum. (Perl extension)
  8. Alpha Alphabetic
  9. Alphabetic (Short: Alpha)
  10. Any (Perl extension). All Unicode code
  11. points: [\x{0000}-\x{10FFFF}]
  12. ASCII Block=ASCII. (Perl extension).
  13. [[:ASCII:]]
  14. ASCII_Hex_Digit (Short: AHex)
  15. Assigned (Perl extension). All assigned code points
  16. Bc Bidi_Class
  17. Bidi_C Bidi_Control
  18. Bidi_Class (Short: bc)
  19. Bidi_Control (Short: Bidi_C)
  20. Bidi_M Bidi_Mirrored
  21. Bidi_Mirrored (Short: Bidi_M)
  22. Bidi_Mirroring_Glyph (Short: bmg)
  23. Bidi_Paired_Bracket (Short: bpb)
  24. Bidi_Paired_Bracket_Type (Short: bpt)
  25. Blank XPosixBlank. (Perl extension)
  26. Blk Block
  27. Block (Short: blk)
  28. Bmg Bidi_Mirroring_Glyph
  29. Bpb Bidi_Paired_Bracket
  30. Bpt Bidi_Paired_Bracket_Type
  31. Canonical_Combining_Class (Short: ccc)
  32. Case_Folding (Short: cf)
  33. Case_Ignorable (Short: CI)
  34. Cased
  35. Category General_Category
  36. Ccc Canonical_Combining_Class
  37. CE Composition_Exclusion
  38. Cf Case_Folding; NOT 'cf' meaning
  39. 'General_Category=Format'
  40. Changes_When_Casefolded (Short: CWCF)
  41. Changes_When_Casemapped (Short: CWCM)
  42. Changes_When_Lowercased (Short: CWL)
  43. Changes_When_NFKC_Casefolded (Short: CWKCF)
  44. Changes_When_Titlecased (Short: CWT)
  45. Changes_When_Uppercased (Short: CWU)
  46. CI Case_Ignorable
  47. Cntrl General_Category=XPosixCntrl. (Perl
  48. extension)
  49. Comp_Ex Full_Composition_Exclusion
  50. Composition_Exclusion (Short: CE)
  51. CWCF Changes_When_Casefolded
  52. CWCM Changes_When_Casemapped
  53. CWKCF Changes_When_NFKC_Casefolded
  54. CWL Changes_When_Lowercased
  55. CWT Changes_When_Titlecased
  56. CWU Changes_When_Uppercased
  57. Dash
  58. Decomposition_Mapping (Short: dm)
  59. Decomposition_Type (Short: dt)
  60. Default_Ignorable_Code_Point (Short: DI)
  61. Dep Deprecated
  62. Deprecated (Short: Dep)
  63. DI Default_Ignorable_Code_Point
  64. Dia Diacritic
  65. Diacritic (Short: Dia)
  66. Digit General_Category=XPosixDigit. (Perl
  67. extension)
  68. Dm Decomposition_Mapping
  69. Dt Decomposition_Type
  70. Ea East_Asian_Width
  71. East_Asian_Width (Short: ea)
  72. Ext Extender
  73. Extender (Short: Ext)
  74. Full_Composition_Exclusion (Short: Comp_Ex)
  75. Gc General_Category
  76. GCB Grapheme_Cluster_Break
  77. General_Category (Short: gc)
  78. Gr_Base Grapheme_Base
  79. Gr_Ext Grapheme_Extend
  80. Graph XPosixGraph. (Perl extension)
  81. Grapheme_Base (Short: Gr_Base)
  82. Grapheme_Cluster_Break (Short: GCB)
  83. Grapheme_Extend (Short: Gr_Ext)
  84. Hangul_Syllable_Type (Short: hst)
  85. Hex Hex_Digit
  86. Hex_Digit (Short: Hex)
  87. HorizSpace XPosixBlank. (Perl extension)
  88. Hst Hangul_Syllable_Type
  89. D Hyphen Supplanted by Line_Break property values;
  90. see www.unicode.org/reports/tr14
  91. ID_Continue (Short: IDC)
  92. ID_Start (Short: IDS)
  93. IDC ID_Continue
  94. Ideo Ideographic
  95. Ideographic (Short: Ideo)
  96. IDS ID_Start
  97. IDS_Binary_Operator (Short: IDSB)
  98. IDS_Trinary_Operator (Short: IDST)
  99. IDSB IDS_Binary_Operator
  100. IDST IDS_Trinary_Operator
  101. In Present_In. (Perl extension)
  102. Indic_Positional_Category (Short: InPC)
  103. Indic_Syllabic_Category (Short: InSC)
  104. InPC Indic_Positional_Category
  105. InSC Indic_Syllabic_Category
  106. Isc ISO_Comment; NOT 'isc' meaning
  107. 'General_Category=Other'
  108. ISO_Comment (Short: isc)
  109. Jg Joining_Group
  110. Join_C Join_Control
  111. Join_Control (Short: Join_C)
  112. Joining_Group (Short: jg)
  113. Joining_Type (Short: jt)
  114. Jt Joining_Type
  115. Lb Line_Break
  116. Lc Lowercase_Mapping; NOT 'lc' meaning
  117. 'General_Category=Cased_Letter'
  118. Line_Break (Short: lb)
  119. LOE Logical_Order_Exception
  120. Logical_Order_Exception (Short: LOE)
  121. Lower Lowercase
  122. Lowercase (Short: Lower)
  123. Lowercase_Mapping (Short: lc)
  124. Math
  125. Na Name
  126. Na1 Unicode_1_Name
  127. Name (Short: na)
  128. Name_Alias
  129. NChar Noncharacter_Code_Point
  130. NFC_QC NFC_Quick_Check
  131. NFC_Quick_Check (Short: NFC_QC)
  132. NFD_QC NFD_Quick_Check
  133. NFD_Quick_Check (Short: NFD_QC)
  134. NFKC_Casefold (Short: NFKC_CF)
  135. NFKC_CF NFKC_Casefold
  136. NFKC_QC NFKC_Quick_Check
  137. NFKC_Quick_Check (Short: NFKC_QC)
  138. NFKD_QC NFKD_Quick_Check
  139. NFKD_Quick_Check (Short: NFKD_QC)
  140. Noncharacter_Code_Point (Short: NChar)
  141. Nt Numeric_Type
  142. Numeric_Type (Short: nt)
  143. Numeric_Value (Short: nv)
  144. Nv Numeric_Value
  145. Pat_Syn Pattern_Syntax
  146. Pat_WS Pattern_White_Space
  147. Pattern_Syntax (Short: Pat_Syn)
  148. Pattern_White_Space (Short: Pat_WS)
  149. Perl_Decimal_Digit (Perl extension)
  150. PerlSpace PosixSpace. (Perl extension)
  151. PerlWord PosixWord. (Perl extension)
  152. PosixAlnum (Perl extension). [A-Za-z0-9]
  153. PosixAlpha (Perl extension). [A-Za-z]
  154. PosixBlank (Perl extension). \t and ' '
  155. PosixCntrl (Perl extension). ASCII control
  156. characters: NUL, SOH, STX, ETX, EOT, ENQ,
  157. ACK, BEL, BS, HT, LF, VT, FF, CR, SO, SI,
  158. DLE, DC1, DC2, DC3, DC4, NAK, SYN, ETB,
  159. CAN, EOM, SUB, ESC, FS, GS, RS, US, and DEL
  160. PosixDigit (Perl extension). [0-9]
  161. PosixGraph (Perl extension). [-!"#$%&'()*+,./:;<=
  162. >?@[\\]^_`{|}~0-9A-Za-z]
  163. PosixLower (Perl extension). [a-z]
  164. PosixPrint (Perl extension). [- 0-9A-Za-
  165. z!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~]
  166. PosixPunct (Perl extension). [-!"#$%&'()*+,./:;<=
  167. >?@[\\]^_`{|}~]
  168. PosixSpace (Perl extension). \t, \n, \cK, \f, \r,
  169. and ' '. (\cK is vertical tab)
  170. PosixUpper (Perl extension). [A-Z]
  171. PosixWord (Perl extension). \w, restricted to ASCII
  172. = [A-Za-z0-9_]
  173. PosixXDigit (Perl extension). [0-9A-Fa-f]
  174. Present_In (Short: In). (Perl extension)
  175. Print XPosixPrint. (Perl extension)
  176. Punct General_Category=Punct. (Perl extension)
  177. QMark Quotation_Mark
  178. Quotation_Mark (Short: QMark)
  179. Radical
  180. SB Sentence_Break
  181. Sc Script; NOT 'sc' meaning
  182. 'General_Category=Currency_Symbol'
  183. Scf Simple_Case_Folding
  184. Script (Short: sc)
  185. Script_Extensions (Short: scx)
  186. Scx Script_Extensions
  187. SD Soft_Dotted
  188. Sentence_Break (Short: SB)
  189. Sfc Simple_Case_Folding
  190. Simple_Case_Folding (Short: scf)
  191. Simple_Lowercase_Mapping (Short: slc)
  192. Simple_Titlecase_Mapping (Short: stc)
  193. Simple_Uppercase_Mapping (Short: suc)
  194. Slc Simple_Lowercase_Mapping
  195. Soft_Dotted (Short: SD)
  196. Space White_Space
  197. SpacePerl XPosixSpace. (Perl extension)
  198. Stc Simple_Titlecase_Mapping
  199. STerm
  200. Suc Simple_Uppercase_Mapping
  201. Tc Titlecase_Mapping
  202. Term Terminal_Punctuation
  203. Terminal_Punctuation (Short: Term)
  204. Title Titlecase. (Perl extension)
  205. Titlecase (Short: Title). (Perl extension). (=
  206. \p{Gc=Lt})
  207. Titlecase_Mapping (Short: tc)
  208. Uc Uppercase_Mapping
  209. UIdeo Unified_Ideograph
  210. Unicode Any. (Perl extension)
  211. Unicode_1_Name (Short: na1)
  212. Unified_Ideograph (Short: UIdeo)
  213. Upper Uppercase
  214. Uppercase (Short: Upper)
  215. Uppercase_Mapping (Short: uc)
  216. Variation_Selector (Short: VS)
  217. VertSpace (Perl extension). \v
  218. VS Variation_Selector
  219. WB Word_Break
  220. White_Space (Short: WSpace)
  221. Word XPosixWord. (Perl extension)
  222. Word_Break (Short: WB)
  223. WSpace White_Space
  224. XDigit XPosixXDigit. (Perl extension)
  225. XID_Continue (Short: XIDC)
  226. XID_Start (Short: XIDS)
  227. XIDC XID_Continue
  228. XIDS XID_Start
  229. XPerlSpace XPosixSpace. (Perl extension)
  230. XPosixAlnum (Short: Alnum). (Perl extension).
  231. Alphabetic and (decimal) Numeric
  232. XPosixAlpha (Perl extension)
  233. XPosixBlank (Short: Blank). (Perl extension). \h,
  234. Horizontal white space
  235. XPosixCntrl General_Category=XPosixCntrl (Short:
  236. Cntrl). (Perl extension). Control
  237. characters
  238. XPosixDigit General_Category=XPosixDigit (Short:
  239. Digit). (Perl extension). [0-9] + all
  240. other decimal digits
  241. XPosixGraph (Short: Graph). (Perl extension).
  242. Characters that are graphical
  243. XPosixLower (Perl extension)
  244. XPosixPrint (Short: Print). (Perl extension).
  245. Characters that are graphical plus space
  246. characters (but no controls)
  247. XPosixPunct (Perl extension). \p{Punct} + ASCII-range
  248. \p{Symbol}
  249. XPosixSpace (Perl extension). \s including beyond
  250. ASCII and vertical tab
  251. XPosixUpper (Perl extension)
  252. XPosixWord (Short: Word). (Perl extension). \w,
  253. including beyond ASCII; = \p{Alnum} + \pM
  254. + \p{Pc}
  255. XPosixXDigit (Short: XDigit). (Perl extension)

Properties accessible through other means

Certain properties are accessible also via core function calls. These are:

  1. Lowercase_Mapping lc() and lcfirst()
  2. Titlecase_Mapping ucfirst()
  3. Uppercase_Mapping uc()

Also, Case_Folding is accessible through the /i modifier in regular expressions, the \F transliteration escape, and the fc operator.

And, the Name and Name_Aliases properties are accessible through the \N{} interpolation in double-quoted strings and regular expressions; and functions charnames::viacode() , charnames::vianame() , and charnames::string_vianame() (which require a use charnames (); to be specified.

Finally, most properties related to decomposition are accessible via Unicode::Normalize.

Unicode character properties that are NOT accepted by Perl

Perl will generate an error for a few character properties in Unicode when used in a regular expression. The non-Unihan ones are listed below, with the reasons they are not accepted, perhaps with work-arounds. The short names for the properties are listed enclosed in (parentheses). As described after the list, an installation can change the defaults and choose to accept any of these. The list is machine generated based on the choices made for the installation that generated this document.

  • Expands_On_NFC (XO_NFC)
  • Expands_On_NFD (XO_NFD)
  • Expands_On_NFKC (XO_NFKC)
  • Expands_On_NFKD (XO_NFKD)

    Deprecated by Unicode. These are characters that expand to more than one character in the specified normalization form, but whether they actually take up more bytes or not depends on the encoding being used. For example, a UTF-8 encoded character may expand to a different number of bytes than a UTF-32 encoded character.

  • Grapheme_Link (Gr_Link)

    Deprecated by Unicode: Duplicates ccc=vr (Canonical_Combining_Class=Virama)

  • Jamo_Short_Name (JSN)
  • Other_Alphabetic (OAlpha)
  • Other_Default_Ignorable_Code_Point (ODI)
  • Other_Grapheme_Extend (OGr_Ext)
  • Other_ID_Continue (OIDC)
  • Other_ID_Start (OIDS)
  • Other_Lowercase (OLower)
  • Other_Math (OMath)
  • Other_Uppercase (OUpper)

    Used by Unicode internally for generating other properties and not intended to be used stand-alone

  • Script=Katakana_Or_Hiragana (sc=Hrkt)

    Obsolete. All code points previously matched by this have been moved to "Script=Common". Consider instead using "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or both)

  • Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)

    All code points that would be matched by this are matched by either "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"

An installation can choose to allow any of these to be matched by downloading the Unicode database from http://www.unicode.org/Public/ to $Config{privlib} /unicore/ in the Perl source tree, changing the controlling lists contained in the program $Config{privlib} /unicore/mktables and then re-compiling and installing. (%Config is available from the Config module).

Also, perl can be recompiled to operate on an earlier version of the Unicode standard. Further information is at $Config{privlib} /unicore/README.perl.

Other information in the Unicode data base

The Unicode data base is delivered in two different formats. The XML version is valid for more modern Unicode releases. The other version is a collection of files. The two are intended to give equivalent information. Perl uses the older form; this allows you to recompile Perl to use early Unicode releases.

The only non-character property that Perl currently supports is Named Sequences, in which a sequence of code points is given a name and generally treated as a single entity. (Perl supports these via the \N{...} double-quotish construct, charnames::string_vianame(name) in charnames, and namedseq() in Unicode::UCD.

Below is a list of the files in the Unicode data base that Perl doesn't currently use, along with very brief descriptions of their purposes. Some of the names of the files have been shortened from those that Unicode uses, in order to allow them to be distinguishable from similarly named files on file systems for which only the first 8 characters of a name are significant.

  • auxiliary/GraphemeBreakTest.html
  • auxiliary/LineBreakTest.html
  • auxiliary/SentenceBreakTest.html
  • auxiliary/WordBreakTest.html

    Documentation of validation Tests

  • BidiCharacterTest.txt
  • BidiTest.txt
  • NormTest.txt

    Validation Tests

  • CJKRadicals.txt

    Maps the kRSUnicode property values to corresponding code points

  • EmojiSources.txt

    Maps certain Unicode code points to their legacy Japanese cell-phone values

  • Index.txt

    Alphabetical index of Unicode characters

  • NamedSqProv.txt

    Named sequences proposed for inclusion in a later version of the Unicode Standard; if you need them now, you can append this file to NamedSequences.txt and recompile perl

  • NamesList.html

    Describes the format and contents of NamesList.txt

  • NamesList.txt

    Annotated list of characters

  • NormalizationCorrections.txt

    Documentation of corrections already incorporated into the Unicode data base

  • ReadMe.txt

    Documentation

  • StandardizedVariants.html

    Provides a visual display of the standard variant sequences derived from StandardizedVariants.txt.

  • StandardizedVariants.txt

    Certain glyph variations for character display are standardized. This lists the non-Unihan ones; the Unihan ones are also not used by Perl, and are in a separate Unicode data base http://www.unicode.org/ivd

  • USourceData.txt

    Documentation of status and cross reference of proposals for encoding by Unicode of Unihan characters

  • USourceGlyphs.pdf

    Pictures of the characters in USourceData.txt

SEE ALSO

http://www.unicode.org/reports/tr44/

perlrecharclass

perlunicode