Modules

  • ABCDE
  • FGHIL
  • MNOPS
  • TUX

Tools

perluniprops

Perl 5 version 18.0 documentation
Recently read

perluniprops

NAME

perluniprops - Index of Unicode Version 6.2.0 character properties in Perl

DESCRIPTION

This document provides information about the portion of the Unicode database that deals with character properties, that is the portion that is defined on single code points. (Other information in the Unicode data base below briefly mentions other data that Unicode provides.)

Perl can provide access to all non-provisional Unicode character properties, though not all are enabled by default. The omitted ones are the Unihan properties (accessible via the CPAN module Unicode::Unihan) and certain deprecated or Unicode-internal properties. (An installation may choose to recompile Perl's tables to change this. See Unicode character properties that are NOT accepted by Perl.)

For most purposes, access to Unicode properties from the Perl core is through regular expression matches, as described in the next section. For some special purposes, and to access the properties that are not suitable for regular expression matching, all the Unicode character properties that Perl handles are accessible via the standard Unicode::UCD module, as described in the section Properties accessible through Unicode::UCD.

Perl also provides some additional extensions and short-cut synonyms for Unicode properties.

This document merely lists all available properties and does not attempt to explain what each property really means. There is a brief description of each Perl extension; see Other Properties in perlunicode for more information on these. There is some detail about Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but to find out about the intricacies of the official Unicode properties, refer to the Unicode standard. A good starting place is http://www.unicode.org/reports/tr44/.

Note that you can define your own properties; see User-Defined Character Properties in perlunicode.

Properties accessible through \p{} and \P{}

The Perl regular expression \p{} and \P{} constructs give access to most of the Unicode character properties. The table below shows all these constructs, both single and compound forms.

Compound forms consist of two components, separated by an equals sign or a colon. The first component is the property name, and the second component is the particular value of the property to match against, for example, \p{Script: Greek} and \p{Script=Greek} both mean to match characters whose Script property is Greek.

Single forms, like \p{Greek} , are mostly Perl-defined shortcuts for their equivalent compound forms. The table shows these equivalences. (In our example, \p{Greek} is a just a shortcut for \p{Script=Greek} .) There are also a few Perl-defined single forms that are not shortcuts for a compound form. One such is \p{Word} . These are also listed in the table.

In parsing these constructs, Perl always ignores Upper/lower case differences everywhere within the {braces}. Thus \p{Greek} means the same thing as \p{greek} . But note that changing the case of the "p" or "P" before the left brace completely changes the meaning of the construct, from "match" (for \p{} ) to "doesn't match" (for \P{} ). Casing in this document is for improved legibility.

Also, white space, hyphens, and underscores are normally ignored everywhere between the {braces}, and hence can be freely added or removed even if the /x modifier hasn't been specified on the regular expression. But a 'T' at the beginning of an entry in the table below means that tighter (stricter) rules are used for that entry:

  • Single form (\p{name} ) tighter rules:

    White space, hyphens, and underscores ARE significant except for:

    • white space adjacent to a non-word character
    • underscores separating digits in numbers

    That means, for example, that you can freely add or remove white space adjacent to (but within) the braces without affecting the meaning.

  • Compound form (\p{name=value} or \p{name:value} ) tighter rules:

    The tighter rules given above for the single form apply to everything to the right of the colon or equals; the looser rules still apply to everything to the left.

    That means, for example, that you can freely add or remove white space adjacent to (but within) the braces and the colon or equal sign.

Some properties are considered obsolete by Unicode, but still available. There are several varieties of obsolescence:

  • Stabilized

    A property may be stabilized. Such a determination does not indicate that the property should or should not be used; instead it is a declaration that the property will not be maintained nor extended for newly encoded characters. Such properties are marked with an 'S' in the table.

  • Deprecated

    A property may be deprecated, perhaps because its original intent has been replaced by another property, or because its specification was somehow defective. This means that its use is strongly discouraged, so much so that a warning will be issued if used, unless the regular expression is in the scope of a no warnings 'deprecated' statement. A 'D' flags each such entry in the table, and the entry there for the longest, most descriptive version of the property will give the reason it is deprecated, and perhaps advice. Perl may issue such a warning, even for properties that aren't officially deprecated by Unicode, when there used to be characters or code points that were matched by them, but no longer. This is to warn you that your program may not work like it did on earlier Unicode releases.

    A deprecated property may be made unavailable in a future Perl version, so it is best to move away from them.

    A deprecated property may also be stabilized, but this fact is not shown.

  • Obsolete

    Properties marked with an 'O' in the table are considered (plain) obsolete. Generally this designation is given to properties that Unicode once used for internal purposes (but not any longer).

Some Perl extensions are present for backwards compatibility and are discouraged from being used, but are not obsolete. An 'X' flags each such entry in the table. Future Unicode versions may force some of these extensions to be removed without warning, replaced by another property with the same name that means something different. Use the equivalent shown instead.

Matches in the Block property have shortcuts that begin with "In_". For example, \p{Block=Latin1} can be written as \p{In_Latin1} . For backward compatibility, if there is no conflict with another shortcut, these may also be written as \p{Latin1} or \p{Is_Latin1} . But, N.B., there are numerous such conflicting shortcuts. Use of these forms for Block is discouraged, and are flagged as such, not only because of the potential confusion as to what is meant, but also because a later release of Unicode may preempt the shortcut, and your program would no longer be correct. Use the "In_" form instead to avoid this, or even more clearly, use the compound form, e.g., \p{blk:latin1} . See Blocks in perlunicode for more information about this.

The table below has two columns. The left column contains the \p{} constructs to look up, possibly preceded by the flags mentioned above; and the right column contains information about them, like a description, or synonyms. It shows both the single and compound forms for each property that has them. If the left column is a short name for a property, the right column will give its longer, more descriptive name; and if the left column is the longest name, the right column will show any equivalent shortest name, in both single and compound forms if applicable.

The right column will also caution you if a property means something different than what might normally be expected.

All single forms are Perl extensions; a few compound forms are as well, and are noted as such.

Numbers in (parentheses) indicate the total number of code points matched by the property. For emphasis, those properties that match no code points at all are listed as well in a separate section following the table.

Most properties match the same code points regardless of whether "/i" case-insensitive matching is specified or not. But a few properties are affected. These are shown with the notation

  1. (/i= other_property)

in the second column. Under case-insensitive matching they match the same code pode points as the property "other_property".

There is no description given for most non-Perl defined properties (See http://www.unicode.org/reports/tr44/ for that).

For compactness, '*' is used as a wildcard instead of showing all possible combinations. For example, entries like:

  1. \p{Gc: *} \p{General_Category: *}

mean that 'Gc' is a synonym for 'General_Category', and anything that is valid for the latter is also valid for the former. Similarly,

  1. \p{Is_*} \p{*}

means that if and only if, for example, \p{Foo} exists, then \p{Is_Foo} and \p{IsFoo} are also valid and all mean the same thing. And similarly, \p{Foo=Bar} means the same as \p{Is_Foo=Bar} and \p{IsFoo=Bar} . "*" here is restricted to something not beginning with an underscore.

Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries for the other possibilities. Note that not all properties which have values 'Yes' and 'No' are binary, and they have all their values spelled out without using this wild card, and a NOT clause in their description that highlights their not being binary. These also require the compound form to match them, whereas true binary properties have both single and compound forms available.

Note that all non-essential underscores are removed in the display of the short names below.

Legend summary:

  • * is a wild-card
  • (\d+) in the info column gives the number of code points matched by this property.
  • D means this is deprecated.
  • O means this is obsolete.
  • S means this is stabilized.
  • T means tighter (stricter) name matching applies.
  • X means use of this form is discouraged, and may not be stable.
  1. NAME INFO
  2. X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
  3. T \p{Age: 1.1} \p{Age=V1_1} (33_979)
  4. T \p{Age: 2.0} \p{Age=V2_0} (144_521)
  5. T \p{Age: 2.1} \p{Age=V2_1} (2)
  6. T \p{Age: 3.0} \p{Age=V3_0} (10_307)
  7. T \p{Age: 3.1} \p{Age=V3_1} (44_978)
  8. T \p{Age: 3.2} \p{Age=V3_2} (1016)
  9. T \p{Age: 4.0} \p{Age=V4_0} (1226)
  10. T \p{Age: 4.1} \p{Age=V4_1} (1273)
  11. T \p{Age: 5.0} \p{Age=V5_0} (1369)
  12. T \p{Age: 5.1} \p{Age=V5_1} (1624)
  13. T \p{Age: 5.2} \p{Age=V5_2} (6648)
  14. T \p{Age: 6.0} \p{Age=V6_0} (2088)
  15. T \p{Age: 6.1} \p{Age=V6_1} (732)
  16. T \p{Age: 6.2} \p{Age=V6_2} (1)
  17. \p{Age: NA} \p{Age=Unassigned} (864_348)
  18. \p{Age: Unassigned} Code point's usage has not been assigned
  19. in any Unicode release thus far. (Short:
  20. \p{Age=NA}) (864_348)
  21. \p{Age: V1_1} Code point's usage introduced in version
  22. 1.1 (33_979)
  23. \p{Age: V2_0} Code point's usage was introduced in
  24. version 2.0; See also Property
  25. 'Present_In' (144_521)
  26. \p{Age: V2_1} Code point's usage was introduced in
  27. version 2.1; See also Property
  28. 'Present_In' (2)
  29. \p{Age: V3_0} Code point's usage was introduced in
  30. version 3.0; See also Property
  31. 'Present_In' (10_307)
  32. \p{Age: V3_1} Code point's usage was introduced in
  33. version 3.1; See also Property
  34. 'Present_In' (44_978)
  35. \p{Age: V3_2} Code point's usage was introduced in
  36. version 3.2; See also Property
  37. 'Present_In' (1016)
  38. \p{Age: V4_0} Code point's usage was introduced in
  39. version 4.0; See also Property
  40. 'Present_In' (1226)
  41. \p{Age: V4_1} Code point's usage was introduced in
  42. version 4.1; See also Property
  43. 'Present_In' (1273)
  44. \p{Age: V5_0} Code point's usage was introduced in
  45. version 5.0; See also Property
  46. 'Present_In' (1369)
  47. \p{Age: V5_1} Code point's usage was introduced in
  48. version 5.1; See also Property
  49. 'Present_In' (1624)
  50. \p{Age: V5_2} Code point's usage was introduced in
  51. version 5.2; See also Property
  52. 'Present_In' (6648)
  53. \p{Age: V6_0} Code point's usage was introduced in
  54. version 6.0; See also Property
  55. 'Present_In' (2088)
  56. \p{Age: V6_1} Code point's usage was introduced in
  57. version 6.1; See also Property
  58. 'Present_In' (732)
  59. \p{Age: V6_2} Code point's usage was introduced in
  60. version 6.2; See also Property
  61. 'Present_In' (1)
  62. \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  63. (22)
  64. \p{AHex: *} \p{ASCII_Hex_Digit: *}
  65. X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
  66. Alchemical_Symbols}) (128)
  67. X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
  68. \p{InAlchemical}) (128)
  69. \p{All} \p{Any} (1_114_112)
  70. \p{Alnum} Alphabetic and (decimal) Numeric (102_619)
  71. \p{Alpha} \p{Alphabetic=Y} (102_159)
  72. \p{Alpha: *} \p{Alphabetic: *}
  73. \p{Alphabetic} \p{Alpha} (= \p{Alphabetic=Y}) (102_159)
  74. \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (1_011_953)
  75. \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (102_159)
  76. X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
  77. \p{Block=Alphabetic_Presentation_Forms})
  78. (80)
  79. X \p{Alphabetic_Presentation_Forms} \p{Block=
  80. Alphabetic_Presentation_Forms} (Short:
  81. \p{InAlphabeticPF}) (80)
  82. X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
  83. \p{Block=
  84. Ancient_Greek_Musical_Notation}) (80)
  85. X \p{Ancient_Greek_Musical_Notation} \p{Block=
  86. Ancient_Greek_Musical_Notation} (Short:
  87. \p{InAncientGreekMusic}) (80)
  88. X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
  89. X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
  90. \p{Any} [\x{0000}-\x{10FFFF}] (1_114_112)
  91. \p{Arab} \p{Arabic} (= \p{Script=Arabic}) (NOT
  92. \p{Block=Arabic}) (1235)
  93. \p{Arabic} \p{Script=Arabic} (Short: \p{Arab}; NOT
  94. \p{Block=Arabic}) (1235)
  95. X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
  96. Arabic_Extended_A}) (96)
  97. X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
  98. \p{InArabicExtA}) (96)
  99. X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
  100. (= \p{Block=
  101. Arabic_Mathematical_Alphabetic_Symbols})
  102. (256)
  103. X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
  104. Arabic_Mathematical_Alphabetic_Symbols}
  105. (Short: \p{InArabicMath}) (256)
  106. X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
  107. \p{Block=Arabic_Presentation_Forms_A})
  108. (688)
  109. X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
  110. \p{Block=Arabic_Presentation_Forms_B})
  111. (144)
  112. X \p{Arabic_Presentation_Forms_A} \p{Block=
  113. Arabic_Presentation_Forms_A} (Short:
  114. \p{InArabicPFA}) (688)
  115. X \p{Arabic_Presentation_Forms_B} \p{Block=
  116. Arabic_Presentation_Forms_B} (Short:
  117. \p{InArabicPFB}) (144)
  118. X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
  119. Arabic_Supplement}) (48)
  120. X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
  121. \p{InArabicSup}) (48)
  122. \p{Armenian} \p{Script=Armenian} (Short: \p{Armn}; NOT
  123. \p{Block=Armenian}) (91)
  124. \p{Armi} \p{Imperial_Aramaic} (= \p{Script=
  125. Imperial_Aramaic}) (NOT \p{Block=
  126. Imperial_Aramaic}) (31)
  127. \p{Armn} \p{Armenian} (= \p{Script=Armenian}) (NOT
  128. \p{Block=Armenian}) (91)
  129. X \p{Arrows} \p{Block=Arrows} (112)
  130. \p{ASCII} \p{Block=Basic_Latin} [[:ASCII:]] (128)
  131. \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  132. (22)
  133. \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090)
  134. \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22)
  135. \p{Assigned} All assigned code points (249_698)
  136. \p{Avestan} \p{Script=Avestan} (Short: \p{Avst}; NOT
  137. \p{Block=Avestan}) (61)
  138. \p{Avst} \p{Avestan} (= \p{Script=Avestan}) (NOT
  139. \p{Block=Avestan}) (61)
  140. \p{Bali} \p{Balinese} (= \p{Script=Balinese}) (NOT
  141. \p{Block=Balinese}) (121)
  142. \p{Balinese} \p{Script=Balinese} (Short: \p{Bali}; NOT
  143. \p{Block=Balinese}) (121)
  144. \p{Bamu} \p{Bamum} (= \p{Script=Bamum}) (NOT
  145. \p{Block=Bamum}) (657)
  146. \p{Bamum} \p{Script=Bamum} (Short: \p{Bamu}; NOT
  147. \p{Block=Bamum}) (657)
  148. X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
  149. Bamum_Supplement}) (576)
  150. X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
  151. \p{InBamumSup}) (576)
  152. X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
  153. \p{Batak} \p{Script=Batak} (Short: \p{Batk}; NOT
  154. \p{Block=Batak}) (56)
  155. \p{Batk} \p{Batak} (= \p{Script=Batak}) (NOT
  156. \p{Block=Batak}) (56)
  157. \p{Bc: *} \p{Bidi_Class: *}
  158. \p{Beng} \p{Bengali} (= \p{Script=Bengali}) (NOT
  159. \p{Block=Bengali}) (92)
  160. \p{Bengali} \p{Script=Bengali} (Short: \p{Beng}; NOT
  161. \p{Block=Bengali}) (92)
  162. \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y}) (7)
  163. \p{Bidi_C: *} \p{Bidi_Control: *}
  164. \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1438)
  165. \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (49)
  166. \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1438)
  167. \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (49)
  168. \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
  169. \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4015)
  170. \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4015)
  171. \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15)
  172. \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
  173. \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (131)
  174. \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
  175. \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (66)
  176. \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (131)
  177. \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12)
  178. \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (66)
  179. \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_098_530)
  180. \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_098_530)
  181. \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1)
  182. \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1)
  183. \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
  184. \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
  185. \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1290)
  186. \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1290)
  187. \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (4447)
  188. \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (4447)
  189. \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7)
  190. \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
  191. \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1)
  192. \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (4086)
  193. \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4086)
  194. \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1)
  195. \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1)
  196. \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
  197. \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
  198. \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
  199. \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3)
  200. \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (18)
  201. \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (18)
  202. \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (7)
  203. \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_105)
  204. \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (7)
  205. \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
  206. (545)
  207. \p{Bidi_M: *} \p{Bidi_Mirrored: *}
  208. \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
  209. (545)
  210. \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567)
  211. \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545)
  212. \p{Blank} \h, Horizontal white space (19)
  213. \p{Blk: *} \p{Block: *}
  214. \p{Block: Aegean_Numbers} (Single: \p{InAegeanNumbers}) (64)
  215. \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
  216. \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical},
  217. \p{InAlchemical}) (128)
  218. \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
  219. (80)
  220. \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
  221. AlphabeticPF}, \p{InAlphabeticPF}) (80)
  222. \p{Block: Ancient_Greek_Music} \p{Block=
  223. Ancient_Greek_Musical_Notation} (80)
  224. \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
  225. AncientGreekMusic},
  226. \p{InAncientGreekMusic}) (80)
  227. \p{Block: Ancient_Greek_Numbers} (Single:
  228. \p{InAncientGreekNumbers}) (80)
  229. \p{Block: Ancient_Symbols} (Single: \p{InAncientSymbols}) (64)
  230. \p{Block: Arabic} (Single: \p{InArabic}; NOT \p{Arabic} NOR
  231. \p{Is_Arabic}) (256)
  232. \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
  233. \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA},
  234. \p{InArabicExtA}) (96)
  235. \p{Block: Arabic_Math} \p{Block=
  236. Arabic_Mathematical_Alphabetic_Symbols}
  237. (256)
  238. \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
  239. ArabicMath}, \p{InArabicMath}) (256)
  240. \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
  241. \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
  242. \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA},
  243. \p{InArabicPFA}) (688)
  244. \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB},
  245. \p{InArabicPFB}) (144)
  246. \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
  247. \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup},
  248. \p{InArabicSup}) (48)
  249. \p{Block: Armenian} (Single: \p{InArmenian}; NOT \p{Armenian}
  250. NOR \p{Is_Armenian}) (96)
  251. \p{Block: Arrows} (Single: \p{InArrows}) (112)
  252. \p{Block: ASCII} \p{Block=Basic_Latin} (128)
  253. \p{Block: Avestan} (Single: \p{InAvestan}; NOT \p{Avestan}
  254. NOR \p{Is_Avestan}) (64)
  255. \p{Block: Balinese} (Single: \p{InBalinese}; NOT \p{Balinese}
  256. NOR \p{Is_Balinese}) (128)
  257. \p{Block: Bamum} (Single: \p{InBamum}; NOT \p{Bamum} NOR
  258. \p{Is_Bamum}) (96)
  259. \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
  260. \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup},
  261. \p{InBamumSup}) (576)
  262. \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}, \p{ASCII}) (128)
  263. \p{Block: Batak} (Single: \p{InBatak}; NOT \p{Batak} NOR
  264. \p{Is_Batak}) (64)
  265. \p{Block: Bengali} (Single: \p{InBengali}; NOT \p{Bengali}
  266. NOR \p{Is_Bengali}) (128)
  267. \p{Block: Block_Elements} (Single: \p{InBlockElements}) (32)
  268. \p{Block: Bopomofo} (Single: \p{InBopomofo}; NOT \p{Bopomofo}
  269. NOR \p{Is_Bopomofo}) (48)
  270. \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
  271. \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt},
  272. \p{InBopomofoExt}) (32)
  273. \p{Block: Box_Drawing} (Single: \p{InBoxDrawing}) (128)
  274. \p{Block: Brahmi} (Single: \p{InBrahmi}; NOT \p{Brahmi} NOR
  275. \p{Is_Brahmi}) (128)
  276. \p{Block: Braille} \p{Block=Braille_Patterns} (256)
  277. \p{Block: Braille_Patterns} (Short: \p{Blk=Braille},
  278. \p{InBraille}) (256)
  279. \p{Block: Buginese} (Single: \p{InBuginese}; NOT \p{Buginese}
  280. NOR \p{Is_Buginese}) (32)
  281. \p{Block: Buhid} (Single: \p{InBuhid}; NOT \p{Buhid} NOR
  282. \p{Is_Buhid}) (32)
  283. \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
  284. (256)
  285. \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
  286. ByzantineMusic}, \p{InByzantineMusic})
  287. (256)
  288. \p{Block: Canadian_Syllabics} \p{Block=
  289. Unified_Canadian_Aboriginal_Syllabics}
  290. (640)
  291. \p{Block: Carian} (Single: \p{InCarian}; NOT \p{Carian} NOR
  292. \p{Is_Carian}) (64)
  293. \p{Block: Chakma} (Single: \p{InChakma}; NOT \p{Chakma} NOR
  294. \p{Is_Chakma}) (80)
  295. \p{Block: Cham} (Single: \p{InCham}; NOT \p{Cham} NOR
  296. \p{Is_Cham}) (96)
  297. \p{Block: Cherokee} (Single: \p{InCherokee}; NOT \p{Cherokee}
  298. NOR \p{Is_Cherokee}) (96)
  299. \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
  300. \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
  301. \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
  302. \p{Block: CJK_Compat_Ideographs} \p{Block=
  303. CJK_Compatibility_Ideographs} (512)
  304. \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
  305. CJK_Compatibility_Ideographs_Supplement}
  306. (544)
  307. \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat},
  308. \p{InCJKCompat}) (256)
  309. \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms},
  310. \p{InCJKCompatForms}) (32)
  311. \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
  312. CJKCompatIdeographs},
  313. \p{InCJKCompatIdeographs}) (512)
  314. \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
  315. CJKCompatIdeographsSup},
  316. \p{InCJKCompatIdeographsSup}) (544)
  317. \p{Block: CJK_Ext_A} \p{Block=
  318. CJK_Unified_Ideographs_Extension_A}
  319. (6592)
  320. \p{Block: CJK_Ext_B} \p{Block=
  321. CJK_Unified_Ideographs_Extension_B}
  322. (42_720)
  323. \p{Block: CJK_Ext_C} \p{Block=
  324. CJK_Unified_Ideographs_Extension_C}
  325. (4160)
  326. \p{Block: CJK_Ext_D} \p{Block=
  327. CJK_Unified_Ideographs_Extension_D} (224)
  328. \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
  329. \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup},
  330. \p{InCJKRadicalsSup}) (128)
  331. \p{Block: CJK_Strokes} (Single: \p{InCJKStrokes}) (48)
  332. \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
  333. \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols},
  334. \p{InCJKSymbols}) (64)
  335. \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}, \p{InCJK})
  336. (20_992)
  337. \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
  338. CJKExtA}, \p{InCJKExtA}) (6592)
  339. \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
  340. CJKExtB}, \p{InCJKExtB}) (42_720)
  341. \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
  342. CJKExtC}, \p{InCJKExtC}) (4160)
  343. \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
  344. CJKExtD}, \p{InCJKExtD}) (224)
  345. \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
  346. Diacriticals}, \p{InDiacriticals}) (112)
  347. \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
  348. DiacriticalsForSymbols},
  349. \p{InDiacriticalsForSymbols}) (48)
  350. \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
  351. DiacriticalsSup}, \p{InDiacriticalsSup})
  352. (64)
  353. \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks},
  354. \p{InHalfMarks}) (16)
  355. \p{Block: Combining_Marks_For_Symbols} \p{Block=
  356. Combining_Diacritical_Marks_For_Symbols}
  357. (48)
  358. \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
  359. IndicNumberForms},
  360. \p{InIndicNumberForms}) (16)
  361. \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
  362. \p{Block: Control_Pictures} (Single: \p{InControlPictures}) (64)
  363. \p{Block: Coptic} (Single: \p{InCoptic}; NOT \p{Coptic} NOR
  364. \p{Is_Coptic}) (128)
  365. \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
  366. \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod},
  367. \p{InCountingRod}) (32)
  368. \p{Block: Cuneiform} (Single: \p{InCuneiform}; NOT
  369. \p{Cuneiform} NOR \p{Is_Cuneiform})
  370. (1024)
  371. \p{Block: Cuneiform_Numbers} \p{Block=
  372. Cuneiform_Numbers_And_Punctuation} (128)
  373. \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
  374. CuneiformNumbers},
  375. \p{InCuneiformNumbers}) (128)
  376. \p{Block: Currency_Symbols} (Single: \p{InCurrencySymbols}) (48)
  377. \p{Block: Cypriot_Syllabary} (Single: \p{InCypriotSyllabary}) (64)
  378. \p{Block: Cyrillic} (Single: \p{InCyrillic}; NOT \p{Cyrillic}
  379. NOR \p{Is_Cyrillic}) (256)
  380. \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
  381. \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
  382. \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA},
  383. \p{InCyrillicExtA}) (32)
  384. \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB},
  385. \p{InCyrillicExtB}) (96)
  386. \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
  387. \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup},
  388. \p{InCyrillicSup}) (48)
  389. \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
  390. (48)
  391. \p{Block: Deseret} (Single: \p{InDeseret}) (80)
  392. \p{Block: Devanagari} (Single: \p{InDevanagari}; NOT
  393. \p{Devanagari} NOR \p{Is_Devanagari})
  394. (128)
  395. \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
  396. \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt},
  397. \p{InDevanagariExt}) (32)
  398. \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
  399. \p{Block: Diacriticals_For_Symbols} \p{Block=
  400. Combining_Diacritical_Marks_For_Symbols}
  401. (48)
  402. \p{Block: Diacriticals_Sup} \p{Block=
  403. Combining_Diacritical_Marks_Supplement}
  404. (64)
  405. \p{Block: Dingbats} (Single: \p{InDingbats}) (192)
  406. \p{Block: Domino} \p{Block=Domino_Tiles} (112)
  407. \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}, \p{InDomino}) (112)
  408. \p{Block: Egyptian_Hieroglyphs} (Single:
  409. \p{InEgyptianHieroglyphs}; NOT
  410. \p{Egyptian_Hieroglyphs} NOR
  411. \p{Is_Egyptian_Hieroglyphs}) (1072)
  412. \p{Block: Emoticons} (Single: \p{InEmoticons}) (80)
  413. \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
  414. \p{Block: Enclosed_Alphanum_Sup} \p{Block=
  415. Enclosed_Alphanumeric_Supplement} (256)
  416. \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
  417. EnclosedAlphanumSup},
  418. \p{InEnclosedAlphanumSup}) (256)
  419. \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
  420. EnclosedAlphanum},
  421. \p{InEnclosedAlphanum}) (160)
  422. \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
  423. (256)
  424. \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
  425. EnclosedCJK}, \p{InEnclosedCJK}) (256)
  426. \p{Block: Enclosed_Ideographic_Sup} \p{Block=
  427. Enclosed_Ideographic_Supplement} (256)
  428. \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
  429. EnclosedIdeographicSup},
  430. \p{InEnclosedIdeographicSup}) (256)
  431. \p{Block: Ethiopic} (Single: \p{InEthiopic}; NOT \p{Ethiopic}
  432. NOR \p{Is_Ethiopic}) (384)
  433. \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
  434. \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
  435. \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt},
  436. \p{InEthiopicExt}) (96)
  437. \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA},
  438. \p{InEthiopicExtA}) (48)
  439. \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
  440. \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup},
  441. \p{InEthiopicSup}) (32)
  442. \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation},
  443. \p{InPunctuation}; NOT \p{Punct} NOR
  444. \p{Is_Punctuation}) (112)
  445. \p{Block: Geometric_Shapes} (Single: \p{InGeometricShapes}) (96)
  446. \p{Block: Georgian} (Single: \p{InGeorgian}; NOT \p{Georgian}
  447. NOR \p{Is_Georgian}) (96)
  448. \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
  449. \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup},
  450. \p{InGeorgianSup}) (48)
  451. \p{Block: Glagolitic} (Single: \p{InGlagolitic}; NOT
  452. \p{Glagolitic} NOR \p{Is_Glagolitic})
  453. (96)
  454. \p{Block: Gothic} (Single: \p{InGothic}; NOT \p{Gothic} NOR
  455. \p{Is_Gothic}) (32)
  456. \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
  457. NOR \p{Is_Greek}) (144)
  458. \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}, \p{InGreek};
  459. NOT \p{Greek} NOR \p{Is_Greek}) (144)
  460. \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
  461. \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt},
  462. \p{InGreekExt}) (256)
  463. \p{Block: Gujarati} (Single: \p{InGujarati}; NOT \p{Gujarati}
  464. NOR \p{Is_Gujarati}) (128)
  465. \p{Block: Gurmukhi} (Single: \p{InGurmukhi}; NOT \p{Gurmukhi}
  466. NOR \p{Is_Gurmukhi}) (128)
  467. \p{Block: Half_And_Full_Forms} \p{Block=
  468. Halfwidth_And_Fullwidth_Forms} (240)
  469. \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
  470. \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
  471. HalfAndFullForms},
  472. \p{InHalfAndFullForms}) (240)
  473. \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
  474. NOR \p{Is_Hangul}) (11_184)
  475. \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo},
  476. \p{InCompatJamo}) (96)
  477. \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}, \p{InJamo}) (256)
  478. \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA},
  479. \p{InJamoExtA}) (32)
  480. \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB},
  481. \p{InJamoExtB}) (80)
  482. \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}, \p{InHangul};
  483. NOT \p{Hangul} NOR \p{Is_Hangul})
  484. (11_184)
  485. \p{Block: Hanunoo} (Single: \p{InHanunoo}; NOT \p{Hanunoo}
  486. NOR \p{Is_Hanunoo}) (32)
  487. \p{Block: Hebrew} (Single: \p{InHebrew}; NOT \p{Hebrew} NOR
  488. \p{Is_Hebrew}) (112)
  489. \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
  490. HighPUSurrogates},
  491. \p{InHighPUSurrogates}) (128)
  492. \p{Block: High_PU_Surrogates} \p{Block=
  493. High_Private_Use_Surrogates} (128)
  494. \p{Block: High_Surrogates} (Single: \p{InHighSurrogates}) (896)
  495. \p{Block: Hiragana} (Single: \p{InHiragana}; NOT \p{Hiragana}
  496. NOR \p{Is_Hiragana}) (96)
  497. \p{Block: IDC} \p{Block=
  498. Ideographic_Description_Characters} (NOT
  499. \p{ID_Continue} NOR \p{Is_IDC}) (16)
  500. \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC},
  501. \p{InIDC}; NOT \p{ID_Continue} NOR
  502. \p{Is_IDC}) (16)
  503. \p{Block: Imperial_Aramaic} (Single: \p{InImperialAramaic}; NOT
  504. \p{Imperial_Aramaic} NOR
  505. \p{Is_Imperial_Aramaic}) (32)
  506. \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  507. (16)
  508. \p{Block: Inscriptional_Pahlavi} (Single:
  509. \p{InInscriptionalPahlavi}; NOT
  510. \p{Inscriptional_Pahlavi} NOR
  511. \p{Is_Inscriptional_Pahlavi}) (32)
  512. \p{Block: Inscriptional_Parthian} (Single:
  513. \p{InInscriptionalParthian}; NOT
  514. \p{Inscriptional_Parthian} NOR
  515. \p{Is_Inscriptional_Parthian}) (32)
  516. \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
  517. \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}, \p{InIPAExt})
  518. (96)
  519. \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
  520. \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
  521. \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
  522. \p{Block: Javanese} (Single: \p{InJavanese}; NOT \p{Javanese}
  523. NOR \p{Is_Javanese}) (96)
  524. \p{Block: Kaithi} (Single: \p{InKaithi}; NOT \p{Kaithi} NOR
  525. \p{Is_Kaithi}) (80)
  526. \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
  527. \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}, \p{InKanaSup})
  528. (256)
  529. \p{Block: Kanbun} (Single: \p{InKanbun}) (16)
  530. \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
  531. \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}, \p{InKangxi})
  532. (224)
  533. \p{Block: Kannada} (Single: \p{InKannada}; NOT \p{Kannada}
  534. NOR \p{Is_Kannada}) (128)
  535. \p{Block: Katakana} (Single: \p{InKatakana}; NOT \p{Katakana}
  536. NOR \p{Is_Katakana}) (96)
  537. \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
  538. \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
  539. KatakanaExt}, \p{InKatakanaExt}) (16)
  540. \p{Block: Kayah_Li} (Single: \p{InKayahLi}) (48)
  541. \p{Block: Kharoshthi} (Single: \p{InKharoshthi}; NOT
  542. \p{Kharoshthi} NOR \p{Is_Kharoshthi})
  543. (96)
  544. \p{Block: Khmer} (Single: \p{InKhmer}; NOT \p{Khmer} NOR
  545. \p{Is_Khmer}) (128)
  546. \p{Block: Khmer_Symbols} (Single: \p{InKhmerSymbols}) (32)
  547. \p{Block: Lao} (Single: \p{InLao}; NOT \p{Lao} NOR
  548. \p{Is_Lao}) (128)
  549. \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
  550. \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
  551. \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1},
  552. \p{InLatin1}) (128)
  553. \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
  554. \p{Block: Latin_Ext_Additional} \p{Block=
  555. Latin_Extended_Additional} (256)
  556. \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
  557. \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
  558. \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
  559. \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA},
  560. \p{InLatinExtA}) (128)
  561. \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
  562. LatinExtAdditional},
  563. \p{InLatinExtAdditional}) (256)
  564. \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB},
  565. \p{InLatinExtB}) (208)
  566. \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC},
  567. \p{InLatinExtC}) (32)
  568. \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD},
  569. \p{InLatinExtD}) (224)
  570. \p{Block: Lepcha} (Single: \p{InLepcha}; NOT \p{Lepcha} NOR
  571. \p{Is_Lepcha}) (80)
  572. \p{Block: Letterlike_Symbols} (Single: \p{InLetterlikeSymbols})
  573. (80)
  574. \p{Block: Limbu} (Single: \p{InLimbu}; NOT \p{Limbu} NOR
  575. \p{Is_Limbu}) (80)
  576. \p{Block: Linear_B_Ideograms} (Single: \p{InLinearBIdeograms})
  577. (128)
  578. \p{Block: Linear_B_Syllabary} (Single: \p{InLinearBSyllabary})
  579. (128)
  580. \p{Block: Lisu} (Single: \p{InLisu}) (48)
  581. \p{Block: Low_Surrogates} (Single: \p{InLowSurrogates}) (1024)
  582. \p{Block: Lycian} (Single: \p{InLycian}; NOT \p{Lycian} NOR
  583. \p{Is_Lycian}) (32)
  584. \p{Block: Lydian} (Single: \p{InLydian}; NOT \p{Lydian} NOR
  585. \p{Is_Lydian}) (32)
  586. \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
  587. \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}, \p{InMahjong})
  588. (48)
  589. \p{Block: Malayalam} (Single: \p{InMalayalam}; NOT
  590. \p{Malayalam} NOR \p{Is_Malayalam}) (128)
  591. \p{Block: Mandaic} (Single: \p{InMandaic}; NOT \p{Mandaic}
  592. NOR \p{Is_Mandaic}) (32)
  593. \p{Block: Math_Alphanum} \p{Block=
  594. Mathematical_Alphanumeric_Symbols} (1024)
  595. \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
  596. \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
  597. MathAlphanum}, \p{InMathAlphanum}) (1024)
  598. \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators},
  599. \p{InMathOperators}) (256)
  600. \p{Block: Meetei_Mayek} (Single: \p{InMeeteiMayek}; NOT
  601. \p{Meetei_Mayek} NOR
  602. \p{Is_Meetei_Mayek}) (64)
  603. \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
  604. \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt},
  605. \p{InMeeteiMayekExt}) (32)
  606. \p{Block: Meroitic_Cursive} (Single: \p{InMeroiticCursive}; NOT
  607. \p{Meroitic_Cursive} NOR
  608. \p{Is_Meroitic_Cursive}) (96)
  609. \p{Block: Meroitic_Hieroglyphs} (Single:
  610. \p{InMeroiticHieroglyphs}) (32)
  611. \p{Block: Miao} (Single: \p{InMiao}; NOT \p{Miao} NOR
  612. \p{Is_Miao}) (160)
  613. \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
  614. (256)
  615. \p{Block: Misc_Math_Symbols_A} \p{Block=
  616. Miscellaneous_Mathematical_Symbols_A}
  617. (48)
  618. \p{Block: Misc_Math_Symbols_B} \p{Block=
  619. Miscellaneous_Mathematical_Symbols_B}
  620. (128)
  621. \p{Block: Misc_Pictographs} \p{Block=
  622. Miscellaneous_Symbols_And_Pictographs}
  623. (768)
  624. \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
  625. \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
  626. \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
  627. MiscMathSymbolsA},
  628. \p{InMiscMathSymbolsA}) (48)
  629. \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
  630. MiscMathSymbolsB},
  631. \p{InMiscMathSymbolsB}) (128)
  632. \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols},
  633. \p{InMiscSymbols}) (256)
  634. \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
  635. MiscArrows}, \p{InMiscArrows}) (256)
  636. \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
  637. MiscPictographs}, \p{InMiscPictographs})
  638. (768)
  639. \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical},
  640. \p{InMiscTechnical}) (256)
  641. \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
  642. \p{Block: Modifier_Tone_Letters} (Single:
  643. \p{InModifierToneLetters}) (32)
  644. \p{Block: Mongolian} (Single: \p{InMongolian}; NOT
  645. \p{Mongolian} NOR \p{Is_Mongolian}) (176)
  646. \p{Block: Music} \p{Block=Musical_Symbols} (256)
  647. \p{Block: Musical_Symbols} (Short: \p{Blk=Music}, \p{InMusic})
  648. (256)
  649. \p{Block: Myanmar} (Single: \p{InMyanmar}; NOT \p{Myanmar}
  650. NOR \p{Is_Myanmar}) (160)
  651. \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
  652. \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA},
  653. \p{InMyanmarExtA}) (32)
  654. \p{Block: NB} \p{Block=No_Block} (860_672)
  655. \p{Block: New_Tai_Lue} (Single: \p{InNewTaiLue}; NOT
  656. \p{New_Tai_Lue} NOR \p{Is_New_Tai_Lue})
  657. (96)
  658. \p{Block: NKo} (Single: \p{InNKo}; NOT \p{Nko} NOR
  659. \p{Is_NKo}) (64)
  660. \p{Block: No_Block} (Short: \p{Blk=NB}, \p{InNB}) (860_672)
  661. \p{Block: Number_Forms} (Single: \p{InNumberForms}) (64)
  662. \p{Block: OCR} \p{Block=Optical_Character_Recognition}
  663. (32)
  664. \p{Block: Ogham} (Single: \p{InOgham}; NOT \p{Ogham} NOR
  665. \p{Is_Ogham}) (32)
  666. \p{Block: Ol_Chiki} (Single: \p{InOlChiki}) (48)
  667. \p{Block: Old_Italic} (Single: \p{InOldItalic}; NOT
  668. \p{Old_Italic} NOR \p{Is_Old_Italic})
  669. (48)
  670. \p{Block: Old_Persian} (Single: \p{InOldPersian}; NOT
  671. \p{Old_Persian} NOR \p{Is_Old_Persian})
  672. (64)
  673. \p{Block: Old_South_Arabian} (Single: \p{InOldSouthArabian}) (32)
  674. \p{Block: Old_Turkic} (Single: \p{InOldTurkic}; NOT
  675. \p{Old_Turkic} NOR \p{Is_Old_Turkic})
  676. (80)
  677. \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR},
  678. \p{InOCR}) (32)
  679. \p{Block: Oriya} (Single: \p{InOriya}; NOT \p{Oriya} NOR
  680. \p{Is_Oriya}) (128)
  681. \p{Block: Osmanya} (Single: \p{InOsmanya}; NOT \p{Osmanya}
  682. NOR \p{Is_Osmanya}) (48)
  683. \p{Block: Phags_Pa} (Single: \p{InPhagsPa}; NOT \p{Phags_Pa}
  684. NOR \p{Is_Phags_Pa}) (64)
  685. \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
  686. \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}, \p{InPhaistos})
  687. (48)
  688. \p{Block: Phoenician} (Single: \p{InPhoenician}; NOT
  689. \p{Phoenician} NOR \p{Is_Phoenician})
  690. (32)
  691. \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
  692. \p{Block: Phonetic_Ext_Sup} \p{Block=
  693. Phonetic_Extensions_Supplement} (64)
  694. \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt},
  695. \p{InPhoneticExt}) (128)
  696. \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
  697. PhoneticExtSup}, \p{InPhoneticExtSup})
  698. (64)
  699. \p{Block: Playing_Cards} (Single: \p{InPlayingCards}) (96)
  700. \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
  701. \p{Private_Use} NOR \p{Is_Private_Use})
  702. (6400)
  703. \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}, \p{InPUA}; NOT
  704. \p{Private_Use} NOR \p{Is_Private_Use})
  705. (6400)
  706. \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
  707. \p{Private_Use} NOR \p{Is_Private_Use})
  708. (6400)
  709. \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
  710. \p{Punct} NOR \p{Is_Punctuation}) (112)
  711. \p{Block: Rejang} (Single: \p{InRejang}; NOT \p{Rejang} NOR
  712. \p{Is_Rejang}) (48)
  713. \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
  714. \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}, \p{InRumi})
  715. (32)
  716. \p{Block: Runic} (Single: \p{InRunic}; NOT \p{Runic} NOR
  717. \p{Is_Runic}) (96)
  718. \p{Block: Samaritan} (Single: \p{InSamaritan}; NOT
  719. \p{Samaritan} NOR \p{Is_Samaritan}) (64)
  720. \p{Block: Saurashtra} (Single: \p{InSaurashtra}; NOT
  721. \p{Saurashtra} NOR \p{Is_Saurashtra})
  722. (96)
  723. \p{Block: Sharada} (Single: \p{InSharada}; NOT \p{Sharada}
  724. NOR \p{Is_Sharada}) (96)
  725. \p{Block: Shavian} (Single: \p{InShavian}) (48)
  726. \p{Block: Sinhala} (Single: \p{InSinhala}; NOT \p{Sinhala}
  727. NOR \p{Is_Sinhala}) (128)
  728. \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms},
  729. \p{InSmallForms}) (32)
  730. \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
  731. \p{Block: Sora_Sompeng} (Single: \p{InSoraSompeng}; NOT
  732. \p{Sora_Sompeng} NOR
  733. \p{Is_Sora_Sompeng}) (48)
  734. \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
  735. ModifierLetters}, \p{InModifierLetters})
  736. (80)
  737. \p{Block: Specials} (Single: \p{InSpecials}) (16)
  738. \p{Block: Sundanese} (Single: \p{InSundanese}; NOT
  739. \p{Sundanese} NOR \p{Is_Sundanese}) (64)
  740. \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
  741. \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup},
  742. \p{InSundaneseSup}) (16)
  743. \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
  744. \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
  745. \p{Block: Sup_Math_Operators} \p{Block=
  746. Supplemental_Mathematical_Operators}
  747. (256)
  748. \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
  749. (65_536)
  750. \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
  751. (65_536)
  752. \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
  753. \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
  754. \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
  755. SuperAndSub}, \p{InSuperAndSub}) (48)
  756. \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA},
  757. \p{InSupArrowsA}) (16)
  758. \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB},
  759. \p{InSupArrowsB}) (128)
  760. \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
  761. SupMathOperators},
  762. \p{InSupMathOperators}) (256)
  763. \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
  764. SupPunctuation}, \p{InSupPunctuation})
  765. (128)
  766. \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
  767. SupPUAA}, \p{InSupPUAA}) (65_536)
  768. \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
  769. SupPUAB}, \p{InSupPUAB}) (65_536)
  770. \p{Block: Syloti_Nagri} (Single: \p{InSylotiNagri}; NOT
  771. \p{Syloti_Nagri} NOR
  772. \p{Is_Syloti_Nagri}) (48)
  773. \p{Block: Syriac} (Single: \p{InSyriac}; NOT \p{Syriac} NOR
  774. \p{Is_Syriac}) (80)
  775. \p{Block: Tagalog} (Single: \p{InTagalog}; NOT \p{Tagalog}
  776. NOR \p{Is_Tagalog}) (32)
  777. \p{Block: Tagbanwa} (Single: \p{InTagbanwa}; NOT \p{Tagbanwa}
  778. NOR \p{Is_Tagbanwa}) (32)
  779. \p{Block: Tags} (Single: \p{InTags}) (128)
  780. \p{Block: Tai_Le} (Single: \p{InTaiLe}; NOT \p{Tai_Le} NOR
  781. \p{Is_Tai_Le}) (48)
  782. \p{Block: Tai_Tham} (Single: \p{InTaiTham}; NOT \p{Tai_Tham}
  783. NOR \p{Is_Tai_Tham}) (144)
  784. \p{Block: Tai_Viet} (Single: \p{InTaiViet}; NOT \p{Tai_Viet}
  785. NOR \p{Is_Tai_Viet}) (96)
  786. \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
  787. \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing},
  788. \p{InTaiXuanJing}) (96)
  789. \p{Block: Takri} (Single: \p{InTakri}; NOT \p{Takri} NOR
  790. \p{Is_Takri}) (80)
  791. \p{Block: Tamil} (Single: \p{InTamil}; NOT \p{Tamil} NOR
  792. \p{Is_Tamil}) (128)
  793. \p{Block: Telugu} (Single: \p{InTelugu}; NOT \p{Telugu} NOR
  794. \p{Is_Telugu}) (128)
  795. \p{Block: Thaana} (Single: \p{InThaana}; NOT \p{Thaana} NOR
  796. \p{Is_Thaana}) (64)
  797. \p{Block: Thai} (Single: \p{InThai}; NOT \p{Thai} NOR
  798. \p{Is_Thai}) (128)
  799. \p{Block: Tibetan} (Single: \p{InTibetan}; NOT \p{Tibetan}
  800. NOR \p{Is_Tibetan}) (256)
  801. \p{Block: Tifinagh} (Single: \p{InTifinagh}; NOT \p{Tifinagh}
  802. NOR \p{Is_Tifinagh}) (80)
  803. \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
  804. (128)
  805. \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
  806. TransportAndMap}, \p{InTransportAndMap})
  807. (128)
  808. \p{Block: UCAS} \p{Block=
  809. Unified_Canadian_Aboriginal_Syllabics}
  810. (640)
  811. \p{Block: UCAS_Ext} \p{Block=
  812. Unified_Canadian_Aboriginal_Syllabics_-
  813. Extended} (80)
  814. \p{Block: Ugaritic} (Single: \p{InUgaritic}; NOT \p{Ugaritic}
  815. NOR \p{Is_Ugaritic}) (32)
  816. \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
  817. UCAS}, \p{InUCAS}) (640)
  818. \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
  819. \p{Blk=UCASExt}, \p{InUCASExt}) (80)
  820. \p{Block: Vai} (Single: \p{InVai}; NOT \p{Vai} NOR
  821. \p{Is_Vai}) (320)
  822. \p{Block: Variation_Selectors} (Short: \p{Blk=VS}, \p{InVS}; NOT
  823. \p{Variation_Selector} NOR \p{Is_VS})
  824. (16)
  825. \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup},
  826. \p{InVSSup}) (240)
  827. \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
  828. \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt},
  829. \p{InVedicExt}) (48)
  830. \p{Block: Vertical_Forms} (Single: \p{InVerticalForms}) (16)
  831. \p{Block: VS} \p{Block=Variation_Selectors} (NOT
  832. \p{Variation_Selector} NOR \p{Is_VS})
  833. (16)
  834. \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
  835. (240)
  836. \p{Block: Yi_Radicals} (Single: \p{InYiRadicals}) (64)
  837. \p{Block: Yi_Syllables} (Single: \p{InYiSyllables}) (1168)
  838. \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
  839. \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing},
  840. \p{InYijing}) (64)
  841. X \p{Block_Elements} \p{Block=Block_Elements} (32)
  842. \p{Bopo} \p{Bopomofo} (= \p{Script=Bopomofo}) (NOT
  843. \p{Block=Bopomofo}) (70)
  844. \p{Bopomofo} \p{Script=Bopomofo} (Short: \p{Bopo}; NOT
  845. \p{Block=Bopomofo}) (70)
  846. X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
  847. Bopomofo_Extended}) (32)
  848. X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
  849. \p{InBopomofoExt}) (32)
  850. X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
  851. \p{Brah} \p{Brahmi} (= \p{Script=Brahmi}) (NOT
  852. \p{Block=Brahmi}) (108)
  853. \p{Brahmi} \p{Script=Brahmi} (Short: \p{Brah}; NOT
  854. \p{Block=Brahmi}) (108)
  855. \p{Brai} \p{Braille} (= \p{Script=Braille}) (256)
  856. \p{Braille} \p{Script=Braille} (Short: \p{Brai}) (256)
  857. X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
  858. \p{InBraille}) (256)
  859. \p{Bugi} \p{Buginese} (= \p{Script=Buginese}) (NOT
  860. \p{Block=Buginese}) (30)
  861. \p{Buginese} \p{Script=Buginese} (Short: \p{Bugi}; NOT
  862. \p{Block=Buginese}) (30)
  863. \p{Buhd} \p{Buhid} (= \p{Script=Buhid}) (NOT
  864. \p{Block=Buhid}) (20)
  865. \p{Buhid} \p{Script=Buhid} (Short: \p{Buhd}; NOT
  866. \p{Block=Buhid}) (20)
  867. X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
  868. Byzantine_Musical_Symbols}) (256)
  869. X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
  870. (Short: \p{InByzantineMusic}) (256)
  871. \p{C} \p{Other} (= \p{General_Category=Other})
  872. (1_004_134)
  873. \p{Cakm} \p{Chakma} (= \p{Script=Chakma}) (NOT
  874. \p{Block=Chakma}) (67)
  875. \p{Canadian_Aboriginal} \p{Script=Canadian_Aboriginal} (Short:
  876. \p{Cans}) (710)
  877. X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
  878. (= \p{Block=
  879. Unified_Canadian_Aboriginal_Syllabics})
  880. (640)
  881. T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
  882. Not_Reordered} (1_113_459)
  883. T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
  884. Overlay} (26)
  885. T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
  886. Nukta} (13)
  887. T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
  888. Kana_Voicing} (2)
  889. T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
  890. Virama} (37)
  891. T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
  892. CCC10} (1)
  893. T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
  894. CCC11} (1)
  895. T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
  896. CCC12} (1)
  897. T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
  898. CCC13} (1)
  899. T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
  900. CCC14} (1)
  901. T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
  902. CCC15} (1)
  903. T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
  904. CCC16} (1)
  905. T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
  906. CCC17} (1)
  907. T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
  908. CCC18} (2)
  909. T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
  910. CCC19} (2)
  911. T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
  912. CCC20} (1)
  913. T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
  914. CCC21} (1)
  915. T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
  916. CCC22} (1)
  917. T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
  918. CCC23} (1)
  919. T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
  920. CCC24} (1)
  921. T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
  922. CCC25} (1)
  923. T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
  924. CCC26} (1)
  925. T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
  926. CCC27} (2)
  927. T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
  928. CCC28} (2)
  929. T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
  930. CCC29} (2)
  931. T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
  932. CCC30} (2)
  933. T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
  934. CCC31} (2)
  935. T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
  936. CCC32} (2)
  937. T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
  938. CCC33} (1)
  939. T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
  940. CCC34} (1)
  941. T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
  942. CCC35} (1)
  943. T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
  944. CCC36} (1)
  945. T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
  946. CCC84} (1)
  947. T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
  948. CCC91} (1)
  949. T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
  950. CCC103} (2)
  951. T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
  952. CCC107} (4)
  953. T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
  954. CCC118} (2)
  955. T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
  956. CCC122} (4)
  957. T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
  958. CCC129} (1)
  959. T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
  960. CCC130} (6)
  961. T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
  962. CCC132} (1)
  963. T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
  964. CCC133} (0)
  965. T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
  966. Attached_Below_Left} (0)
  967. T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
  968. Attached_Below} (5)
  969. T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
  970. Attached_Above} (1)
  971. T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
  972. Attached_Above_Right} (9)
  973. T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
  974. Below_Left} (1)
  975. T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
  976. Below} (129)
  977. T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
  978. Below_Right} (4)
  979. T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
  980. Left} (2)
  981. T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
  982. Right} (1)
  983. T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
  984. Above_Left} (3)
  985. T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
  986. Above} (349)
  987. T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
  988. Above_Right} (4)
  989. T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
  990. Double_Below} (4)
  991. T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
  992. Double_Above} (5)
  993. T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
  994. Iota_Subscript} (1)
  995. \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
  996. Above} (349)
  997. \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (349)
  998. \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (3)
  999. \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (4)
  1000. \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
  1001. Above_Left} (3)
  1002. \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
  1003. Above_Right} (4)
  1004. \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
  1005. Attached_Above} (1)
  1006. \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
  1007. Attached_Above_Right} (9)
  1008. \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
  1009. Attached_Below} (5)
  1010. \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
  1011. Attached_Below_Left} (0)
  1012. \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
  1013. (1)
  1014. \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
  1015. \p{Ccc=ATAR}) (9)
  1016. \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
  1017. (5)
  1018. \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
  1019. ATBL}) (0)
  1020. \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
  1021. Below} (129)
  1022. \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (129)
  1023. \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1)
  1024. \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4)
  1025. \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
  1026. Below_Left} (1)
  1027. \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
  1028. Below_Right} (4)
  1029. \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1)
  1030. \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2)
  1031. \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4)
  1032. \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1)
  1033. \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2)
  1034. \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1)
  1035. \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4)
  1036. \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1)
  1037. \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1)
  1038. \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6)
  1039. \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1)
  1040. \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
  1041. \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1)
  1042. \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1)
  1043. \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1)
  1044. \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1)
  1045. \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2)
  1046. \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2)
  1047. \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1)
  1048. \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1)
  1049. \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1)
  1050. \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1)
  1051. \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1)
  1052. \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1)
  1053. \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1)
  1054. \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2)
  1055. \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2)
  1056. \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2)
  1057. \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2)
  1058. \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2)
  1059. \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2)
  1060. \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1)
  1061. \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1)
  1062. \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1)
  1063. \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1)
  1064. \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1)
  1065. \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1)
  1066. \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
  1067. Double_Above} (5)
  1068. \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
  1069. Double_Below} (4)
  1070. \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) (5)
  1071. \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) (4)
  1072. \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
  1073. (1)
  1074. \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
  1075. Iota_Subscript} (1)
  1076. \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) (2)
  1077. \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
  1078. Kana_Voicing} (2)
  1079. \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
  1080. Left} (2)
  1081. \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2)
  1082. \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
  1083. Nukta} (13)
  1084. \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
  1085. (1_113_459)
  1086. \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
  1087. Not_Reordered} (1_113_459)
  1088. \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (13)
  1089. \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
  1090. Overlay} (26)
  1091. \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (26)
  1092. \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
  1093. Right} (1)
  1094. \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1)
  1095. \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (37)
  1096. \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
  1097. Virama} (37)
  1098. \p{Cans} \p{Canadian_Aboriginal} (= \p{Script=
  1099. Canadian_Aboriginal}) (710)
  1100. \p{Cari} \p{Carian} (= \p{Script=Carian}) (NOT
  1101. \p{Block=Carian}) (49)
  1102. \p{Carian} \p{Script=Carian} (Short: \p{Cari}; NOT
  1103. \p{Block=Carian}) (49)
  1104. \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (1799)
  1105. \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_112_313)
  1106. \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (1799)
  1107. \p{Cased} \p{Cased=Y} (3448)
  1108. \p{Cased: N*} (Single: \P{Cased}) (1_110_664)
  1109. \p{Cased: Y*} (Single: \p{Cased}) (3448)
  1110. \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
  1111. \p{LC}) (3223)
  1112. \p{Category: *} \p{General_Category: *}
  1113. \p{Cc} \p{Cntrl} (= \p{General_Category=Control})
  1114. (65)
  1115. \p{Ccc: *} \p{Canonical_Combining_Class: *}
  1116. \p{CE} \p{Composition_Exclusion} (=
  1117. \p{Composition_Exclusion=Y}) (81)
  1118. \p{CE: *} \p{Composition_Exclusion: *}
  1119. \p{Cf} \p{Format} (= \p{General_Category=Format})
  1120. (139)
  1121. \p{Chakma} \p{Script=Chakma} (Short: \p{Cakm}; NOT
  1122. \p{Block=Chakma}) (67)
  1123. \p{Cham} \p{Script=Cham} (NOT \p{Block=Cham}) (83)
  1124. \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
  1125. \p{CWCF}) (1107)
  1126. \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
  1127. (1_113_005)
  1128. \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
  1129. (1107)
  1130. \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
  1131. \p{CWCM}) (2138)
  1132. \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
  1133. (1_111_974)
  1134. \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
  1135. (2138)
  1136. \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
  1137. \p{CWL}) (1043)
  1138. \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
  1139. (1_113_069)
  1140. \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1043)
  1141. \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
  1142. Y} (Short: \p{CWKCF}) (9944)
  1143. \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
  1144. \P{CWKCF}) (1_104_168)
  1145. \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
  1146. \p{CWKCF}) (9944)
  1147. \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
  1148. \p{CWT}) (1099)
  1149. \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
  1150. (1_113_013)
  1151. \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1099)
  1152. \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
  1153. \p{CWU}) (1126)
  1154. \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
  1155. (1_112_986)
  1156. \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1126)
  1157. \p{Cher} \p{Cherokee} (= \p{Script=Cherokee}) (NOT
  1158. \p{Block=Cherokee}) (85)
  1159. \p{Cherokee} \p{Script=Cherokee} (Short: \p{Cher}; NOT
  1160. \p{Block=Cherokee}) (85)
  1161. \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
  1162. Y}) (1799)
  1163. \p{CI: *} \p{Case_Ignorable: *}
  1164. X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
  1165. CJK_Unified_Ideographs}) (20_992)
  1166. X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
  1167. CJK_Compatibility}) (256)
  1168. X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
  1169. CJK_Compatibility_Forms}) (32)
  1170. X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
  1171. \p{Block=CJK_Compatibility_Ideographs})
  1172. (512)
  1173. X \p{CJK_Compat_Ideographs_Sup}
  1174. \p{CJK_Compatibility_Ideographs_-
  1175. Supplement} (= \p{Block=
  1176. CJK_Compatibility_Ideographs_-
  1177. Supplement}) (544)
  1178. X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
  1179. \p{InCJKCompat}) (256)
  1180. X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
  1181. (Short: \p{InCJKCompatForms}) (32)
  1182. X \p{CJK_Compatibility_Ideographs} \p{Block=
  1183. CJK_Compatibility_Ideographs} (Short:
  1184. \p{InCJKCompatIdeographs}) (512)
  1185. X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
  1186. CJK_Compatibility_Ideographs_Supplement}
  1187. (Short: \p{InCJKCompatIdeographsSup})
  1188. (544)
  1189. X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
  1190. \p{Block=
  1191. CJK_Unified_Ideographs_Extension_A})
  1192. (6592)
  1193. X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
  1194. \p{Block=
  1195. CJK_Unified_Ideographs_Extension_B})
  1196. (42_720)
  1197. X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
  1198. \p{Block=
  1199. CJK_Unified_Ideographs_Extension_C})
  1200. (4160)
  1201. X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
  1202. \p{Block=
  1203. CJK_Unified_Ideographs_Extension_D})
  1204. (224)
  1205. X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
  1206. CJK_Radicals_Supplement}) (128)
  1207. X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
  1208. (Short: \p{InCJKRadicalsSup}) (128)
  1209. X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
  1210. X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
  1211. \p{Block=CJK_Symbols_And_Punctuation})
  1212. (64)
  1213. X \p{CJK_Symbols_And_Punctuation} \p{Block=
  1214. CJK_Symbols_And_Punctuation} (Short:
  1215. \p{InCJKSymbols}) (64)
  1216. X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
  1217. (Short: \p{InCJK}) (20_992)
  1218. X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
  1219. CJK_Unified_Ideographs_Extension_A}
  1220. (Short: \p{InCJKExtA}) (6592)
  1221. X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
  1222. CJK_Unified_Ideographs_Extension_B}
  1223. (Short: \p{InCJKExtB}) (42_720)
  1224. X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
  1225. CJK_Unified_Ideographs_Extension_C}
  1226. (Short: \p{InCJKExtC}) (4160)
  1227. X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
  1228. CJK_Unified_Ideographs_Extension_D}
  1229. (Short: \p{InCJKExtD}) (224)
  1230. \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
  1231. (Short: \p{Pe}) (71)
  1232. \p{Cn} \p{Unassigned} (= \p{General_Category=
  1233. Unassigned}) (864_414)
  1234. \p{Cntrl} \p{General_Category=Control} Control
  1235. characters (Short: \p{Cc}) (65)
  1236. \p{Co} \p{Private_Use} (= \p{General_Category=
  1237. Private_Use}) (NOT \p{Private_Use_Area})
  1238. (137_468)
  1239. X \p{Combining_Diacritical_Marks} \p{Block=
  1240. Combining_Diacritical_Marks} (Short:
  1241. \p{InDiacriticals}) (112)
  1242. X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
  1243. Combining_Diacritical_Marks_For_Symbols}
  1244. (Short: \p{InDiacriticalsForSymbols})
  1245. (48)
  1246. X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
  1247. Combining_Diacritical_Marks_Supplement}
  1248. (Short: \p{InDiacriticalsSup}) (64)
  1249. X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
  1250. \p{InHalfMarks}) (16)
  1251. \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
  1252. (1645)
  1253. X \p{Combining_Marks_For_Symbols}
  1254. \p{Combining_Diacritical_Marks_For_-
  1255. Symbols} (= \p{Block=
  1256. Combining_Diacritical_Marks_For_-
  1257. Symbols}) (48)
  1258. \p{Common} \p{Script=Common} (Short: \p{Zyyy}) (6413)
  1259. X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  1260. (Short: \p{InIndicNumberForms}) (16)
  1261. \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
  1262. \p{Full_Composition_Exclusion=Y}) (1120)
  1263. \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
  1264. X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
  1265. Hangul_Compatibility_Jamo}) (96)
  1266. \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
  1267. \p{CE}) (81)
  1268. \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031)
  1269. \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81)
  1270. \p{Connector_Punctuation} \p{General_Category=
  1271. Connector_Punctuation} (Short: \p{Pc})
  1272. (10)
  1273. \p{Control} \p{Cntrl} (= \p{General_Category=Control})
  1274. (65)
  1275. X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
  1276. \p{Copt} \p{Coptic} (= \p{Script=Coptic}) (NOT
  1277. \p{Block=Coptic}) (137)
  1278. \p{Coptic} \p{Script=Coptic} (Short: \p{Copt}; NOT
  1279. \p{Block=Coptic}) (137)
  1280. X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
  1281. Counting_Rod_Numerals}) (32)
  1282. X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
  1283. \p{InCountingRod}) (32)
  1284. \p{Cprt} \p{Cypriot} (= \p{Script=Cypriot}) (55)
  1285. \p{Cs} \p{Surrogate} (= \p{General_Category=
  1286. Surrogate}) (2048)
  1287. \p{Cuneiform} \p{Script=Cuneiform} (Short: \p{Xsux}; NOT
  1288. \p{Block=Cuneiform}) (982)
  1289. X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
  1290. \p{Block=
  1291. Cuneiform_Numbers_And_Punctuation}) (128)
  1292. X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
  1293. Cuneiform_Numbers_And_Punctuation}
  1294. (Short: \p{InCuneiformNumbers}) (128)
  1295. \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
  1296. (Short: \p{Sc}) (49)
  1297. X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
  1298. \p{CWCF} \p{Changes_When_Casefolded} (=
  1299. \p{Changes_When_Casefolded=Y}) (1107)
  1300. \p{CWCF: *} \p{Changes_When_Casefolded: *}
  1301. \p{CWCM} \p{Changes_When_Casemapped} (=
  1302. \p{Changes_When_Casemapped=Y}) (2138)
  1303. \p{CWCM: *} \p{Changes_When_Casemapped: *}
  1304. \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
  1305. \p{Changes_When_NFKC_Casefolded=Y})
  1306. (9944)
  1307. \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
  1308. \p{CWL} \p{Changes_When_Lowercased} (=
  1309. \p{Changes_When_Lowercased=Y}) (1043)
  1310. \p{CWL: *} \p{Changes_When_Lowercased: *}
  1311. \p{CWT} \p{Changes_When_Titlecased} (=
  1312. \p{Changes_When_Titlecased=Y}) (1099)
  1313. \p{CWT: *} \p{Changes_When_Titlecased: *}
  1314. \p{CWU} \p{Changes_When_Uppercased} (=
  1315. \p{Changes_When_Uppercased=Y}) (1126)
  1316. \p{CWU: *} \p{Changes_When_Uppercased: *}
  1317. \p{Cypriot} \p{Script=Cypriot} (Short: \p{Cprt}) (55)
  1318. X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
  1319. \p{Cyrillic} \p{Script=Cyrillic} (Short: \p{Cyrl}; NOT
  1320. \p{Block=Cyrillic}) (417)
  1321. X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
  1322. Cyrillic_Extended_A}) (32)
  1323. X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
  1324. Cyrillic_Extended_B}) (96)
  1325. X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
  1326. \p{InCyrillicExtA}) (32)
  1327. X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
  1328. \p{InCyrillicExtB}) (96)
  1329. X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
  1330. Cyrillic_Supplement}) (48)
  1331. X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
  1332. \p{InCyrillicSup}) (48)
  1333. X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
  1334. Cyrillic_Supplement}) (48)
  1335. \p{Cyrl} \p{Cyrillic} (= \p{Script=Cyrillic}) (NOT
  1336. \p{Block=Cyrillic}) (417)
  1337. \p{Dash} \p{Dash=Y} (27)
  1338. \p{Dash: N*} (Single: \P{Dash}) (1_114_085)
  1339. \p{Dash: Y*} (Single: \p{Dash}) (27)
  1340. \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
  1341. (Short: \p{Pd}) (23)
  1342. \p{Decimal_Number} \p{Digit} (= \p{General_Category=
  1343. Decimal_Number}) (460)
  1344. \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
  1345. (13_225)
  1346. \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_225)
  1347. \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240)
  1348. \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
  1349. \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720)
  1350. \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
  1351. \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
  1352. \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240)
  1353. \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184)
  1354. \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
  1355. \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20)
  1356. \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
  1357. \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171)
  1358. \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
  1359. \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238)
  1360. \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
  1361. \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82)
  1362. \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
  1363. \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122)
  1364. \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
  1365. \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5)
  1366. \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
  1367. Non_Canonical} (Perl extension) (3655)
  1368. \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
  1369. decompositions (Short: \p{Dt=NonCanon})
  1370. (Perl extension) (3655)
  1371. \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_232)
  1372. \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26)
  1373. \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
  1374. \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (284)
  1375. \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (284)
  1376. \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38)
  1377. \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (146)
  1378. \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (146)
  1379. \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
  1380. \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35)
  1381. \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104)
  1382. \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
  1383. Y} (Short: \p{DI}) (4167)
  1384. \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
  1385. (1_109_945)
  1386. \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
  1387. (4167)
  1388. \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (111)
  1389. \p{Dep: *} \p{Deprecated: *}
  1390. \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (111)
  1391. \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_001)
  1392. \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (111)
  1393. \p{Deseret} \p{Script=Deseret} (Short: \p{Dsrt}) (80)
  1394. \p{Deva} \p{Devanagari} (= \p{Script=Devanagari})
  1395. (NOT \p{Block=Devanagari}) (151)
  1396. \p{Devanagari} \p{Script=Devanagari} (Short: \p{Deva};
  1397. NOT \p{Block=Devanagari}) (151)
  1398. X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
  1399. Devanagari_Extended}) (32)
  1400. X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
  1401. \p{InDevanagariExt}) (32)
  1402. \p{DI} \p{Default_Ignorable_Code_Point} (=
  1403. \p{Default_Ignorable_Code_Point=Y})
  1404. (4167)
  1405. \p{DI: *} \p{Default_Ignorable_Code_Point: *}
  1406. \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (693)
  1407. \p{Dia: *} \p{Diacritic: *}
  1408. \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (693)
  1409. \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_419)
  1410. \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (693)
  1411. X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
  1412. \p{Block=Combining_Diacritical_Marks})
  1413. (112)
  1414. X \p{Diacriticals_For_Symbols}
  1415. \p{Combining_Diacritical_Marks_For_-
  1416. Symbols} (= \p{Block=
  1417. Combining_Diacritical_Marks_For_-
  1418. Symbols}) (48)
  1419. X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
  1420. (= \p{Block=
  1421. Combining_Diacritical_Marks_Supplement})
  1422. (64)
  1423. \p{Digit} \p{General_Category=Decimal_Number} [0-9]
  1424. + all other decimal digits (Short:
  1425. \p{Nd}) (460)
  1426. X \p{Dingbats} \p{Block=Dingbats} (192)
  1427. X \p{Domino} \p{Domino_Tiles} (= \p{Block=
  1428. Domino_Tiles}) (112)
  1429. X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
  1430. \p{InDomino}) (112)
  1431. \p{Dsrt} \p{Deseret} (= \p{Script=Deseret}) (80)
  1432. \p{Dt: *} \p{Decomposition_Type: *}
  1433. \p{Ea: *} \p{East_Asian_Width: *}
  1434. \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_746)
  1435. \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_746)
  1436. \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
  1437. \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104)
  1438. \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
  1439. \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123)
  1440. \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (801_894)
  1441. \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
  1442. \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111)
  1443. \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (801_894)
  1444. \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (173_134)
  1445. \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (173_134)
  1446. \p{Egyp} \p{Egyptian_Hieroglyphs} (= \p{Script=
  1447. Egyptian_Hieroglyphs}) (NOT \p{Block=
  1448. Egyptian_Hieroglyphs}) (1071)
  1449. \p{Egyptian_Hieroglyphs} \p{Script=Egyptian_Hieroglyphs} (Short:
  1450. \p{Egyp}; NOT \p{Block=
  1451. Egyptian_Hieroglyphs}) (1071)
  1452. X \p{Emoticons} \p{Block=Emoticons} (80)
  1453. X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
  1454. Enclosed_Alphanumerics}) (160)
  1455. X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
  1456. \p{Block=
  1457. Enclosed_Alphanumeric_Supplement}) (256)
  1458. X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
  1459. Enclosed_Alphanumeric_Supplement}
  1460. (Short: \p{InEnclosedAlphanumSup}) (256)
  1461. X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
  1462. (Short: \p{InEnclosedAlphanum}) (160)
  1463. X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
  1464. \p{Block=
  1465. Enclosed_CJK_Letters_And_Months}) (256)
  1466. X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
  1467. Enclosed_CJK_Letters_And_Months} (Short:
  1468. \p{InEnclosedCJK}) (256)
  1469. X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
  1470. (= \p{Block=
  1471. Enclosed_Ideographic_Supplement}) (256)
  1472. X \p{Enclosed_Ideographic_Supplement} \p{Block=
  1473. Enclosed_Ideographic_Supplement} (Short:
  1474. \p{InEnclosedIdeographicSup}) (256)
  1475. \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
  1476. (Short: \p{Me}) (12)
  1477. \p{Ethi} \p{Ethiopic} (= \p{Script=Ethiopic}) (NOT
  1478. \p{Block=Ethiopic}) (495)
  1479. \p{Ethiopic} \p{Script=Ethiopic} (Short: \p{Ethi}; NOT
  1480. \p{Block=Ethiopic}) (495)
  1481. X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
  1482. Ethiopic_Extended}) (96)
  1483. X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
  1484. Ethiopic_Extended_A}) (48)
  1485. X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
  1486. \p{InEthiopicExt}) (96)
  1487. X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
  1488. \p{InEthiopicExtA}) (48)
  1489. X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
  1490. Ethiopic_Supplement}) (32)
  1491. X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
  1492. \p{InEthiopicSup}) (32)
  1493. \p{Ext} \p{Extender} (= \p{Extender=Y}) (31)
  1494. \p{Ext: *} \p{Extender: *}
  1495. \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (31)
  1496. \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_081)
  1497. \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (31)
  1498. \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
  1499. (Short: \p{Pf}) (10)
  1500. \p{Format} \p{General_Category=Format} (Short:
  1501. \p{Cf}) (139)
  1502. \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
  1503. (Short: \p{CompEx}) (1120)
  1504. \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
  1505. \P{CompEx}) (1_112_992)
  1506. \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
  1507. \p{CompEx}) (1120)
  1508. \p{Gc: *} \p{General_Category: *}
  1509. \p{GCB: *} \p{Grapheme_Cluster_Break: *}
  1510. \p{General_Category: C} \p{General_Category=Other} (1_004_134)
  1511. \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
  1512. \p{Gc=LC}, \p{LC}) (3223)
  1513. \p{General_Category: Cc} \p{General_Category=Control} (65)
  1514. \p{General_Category: Cf} \p{General_Category=Format} (139)
  1515. \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
  1516. (71)
  1517. \p{General_Category: Cn} \p{General_Category=Unassigned} (864_414)
  1518. \p{General_Category: Cntrl} \p{General_Category=Control} (65)
  1519. \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
  1520. \p{General_Category: Combining_Mark} \p{General_Category=Mark}
  1521. (1645)
  1522. \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
  1523. \p{Pc}) (10)
  1524. \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65)
  1525. \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
  1526. \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
  1527. (49)
  1528. \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
  1529. (23)
  1530. \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
  1531. (460)
  1532. \p{General_Category: Digit} \p{General_Category=Decimal_Number}
  1533. (460)
  1534. \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
  1535. (12)
  1536. \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
  1537. (10)
  1538. \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (139)
  1539. \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
  1540. \p{Pi}) (12)
  1541. \p{General_Category: L} \p{General_Category=Letter} (101_013)
  1542. X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3223)
  1543. X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
  1544. the trailing '_' matters in spite of
  1545. loose matching rules. (3223)
  1546. \p{General_Category: LC} \p{General_Category=Cased_Letter} (3223)
  1547. \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (101_013)
  1548. \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
  1549. (224)
  1550. \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) (1)
  1551. \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
  1552. (/i= General_Category=Cased_Letter)
  1553. (1751)
  1554. \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (237)
  1555. \p{General_Category: Lo} \p{General_Category=Other_Letter} (97_553)
  1556. \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
  1557. /i= General_Category=Cased_Letter) (1751)
  1558. \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
  1559. (/i= General_Category=Cased_Letter) (31)
  1560. \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
  1561. (/i= General_Category=Cased_Letter)
  1562. (1441)
  1563. \p{General_Category: M} \p{General_Category=Mark} (1645)
  1564. \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (1645)
  1565. \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (952)
  1566. \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (353)
  1567. \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (12)
  1568. \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
  1569. (1280)
  1570. \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
  1571. (237)
  1572. \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
  1573. (115)
  1574. \p{General_Category: N} \p{General_Category=Number} (1148)
  1575. \p{General_Category: Nd} \p{General_Category=Decimal_Number} (460)
  1576. \p{General_Category: Nl} \p{General_Category=Letter_Number} (224)
  1577. \p{General_Category: No} \p{General_Category=Other_Number} (464)
  1578. \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
  1579. (1280)
  1580. \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1148)
  1581. \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
  1582. (72)
  1583. \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (1_004_134)
  1584. \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
  1585. (97_553)
  1586. \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) (464)
  1587. \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
  1588. (434)
  1589. \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
  1590. (4404)
  1591. \p{General_Category: P} \p{General_Category=Punctuation} (632)
  1592. \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
  1593. \p{Zp}) (1)
  1594. \p{General_Category: Pc} \p{General_Category=
  1595. Connector_Punctuation} (10)
  1596. \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (23)
  1597. \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
  1598. (71)
  1599. \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
  1600. (10)
  1601. \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
  1602. (12)
  1603. \p{General_Category: Po} \p{General_Category=Other_Punctuation}
  1604. (434)
  1605. \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
  1606. (137_468)
  1607. \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (72)
  1608. \p{General_Category: Punct} \p{General_Category=Punctuation} (632)
  1609. \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (632)
  1610. \p{General_Category: S} \p{General_Category=Symbol} (5520)
  1611. \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (49)
  1612. \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (20)
  1613. \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (115)
  1614. \p{General_Category: Sm} \p{General_Category=Math_Symbol} (952)
  1615. \p{General_Category: So} \p{General_Category=Other_Symbol} (4404)
  1616. \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
  1617. (18)
  1618. \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) (353)
  1619. \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048)
  1620. \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (5520)
  1621. \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
  1622. /i= General_Category=Cased_Letter) (31)
  1623. \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
  1624. (864_414)
  1625. \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
  1626. /i= General_Category=Cased_Letter) (1441)
  1627. \p{General_Category: Z} \p{General_Category=Separator} (20)
  1628. \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
  1629. \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
  1630. (1)
  1631. \p{General_Category: Zs} \p{General_Category=Space_Separator} (18)
  1632. X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
  1633. \p{InPunctuation}) (112)
  1634. X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
  1635. \p{Geor} \p{Georgian} (= \p{Script=Georgian}) (NOT
  1636. \p{Block=Georgian}) (127)
  1637. \p{Georgian} \p{Script=Georgian} (Short: \p{Geor}; NOT
  1638. \p{Block=Georgian}) (127)
  1639. X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
  1640. Georgian_Supplement}) (48)
  1641. X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
  1642. \p{InGeorgianSup}) (48)
  1643. \p{Glag} \p{Glagolitic} (= \p{Script=Glagolitic})
  1644. (NOT \p{Block=Glagolitic}) (94)
  1645. \p{Glagolitic} \p{Script=Glagolitic} (Short: \p{Glag};
  1646. NOT \p{Block=Glagolitic}) (94)
  1647. \p{Goth} \p{Gothic} (= \p{Script=Gothic}) (NOT
  1648. \p{Block=Gothic}) (27)
  1649. \p{Gothic} \p{Script=Gothic} (Short: \p{Goth}; NOT
  1650. \p{Block=Gothic}) (27)
  1651. \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
  1652. (108_661)
  1653. \p{Gr_Base: *} \p{Grapheme_Base: *}
  1654. \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
  1655. Y}) (1317)
  1656. \p{Gr_Ext: *} \p{Grapheme_Extend: *}
  1657. \p{Graph} Characters that are graphical (247_565)
  1658. \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
  1659. (108_661)
  1660. \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase})
  1661. (1_005_451)
  1662. \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase}) (108_661)
  1663. \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
  1664. (6023)
  1665. \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (6023)
  1666. \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1)
  1667. \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
  1668. (1317)
  1669. \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1317)
  1670. \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125)
  1671. \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1)
  1672. \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399)
  1673. \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773)
  1674. \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_094_924)
  1675. \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
  1676. (0)
  1677. \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (0)
  1678. \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
  1679. (26)
  1680. \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
  1681. Regional_Indicator} (26)
  1682. \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
  1683. SpacingMark} (291)
  1684. \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (291)
  1685. \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137)
  1686. \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95)
  1687. \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
  1688. (1_094_924)
  1689. \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
  1690. (1317)
  1691. \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_795)
  1692. \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1317)
  1693. \p{Greek} \p{Script=Greek} (Short: \p{Grek}; NOT
  1694. \p{Greek_And_Coptic}) (511)
  1695. X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
  1696. \p{InGreek}) (144)
  1697. X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
  1698. Greek_Extended}) (256)
  1699. X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
  1700. \p{InGreekExt}) (256)
  1701. \p{Grek} \p{Greek} (= \p{Script=Greek}) (NOT
  1702. \p{Greek_And_Coptic}) (511)
  1703. \p{Gujarati} \p{Script=Gujarati} (Short: \p{Gujr}; NOT
  1704. \p{Block=Gujarati}) (84)
  1705. \p{Gujr} \p{Gujarati} (= \p{Script=Gujarati}) (NOT
  1706. \p{Block=Gujarati}) (84)
  1707. \p{Gurmukhi} \p{Script=Gurmukhi} (Short: \p{Guru}; NOT
  1708. \p{Block=Gurmukhi}) (79)
  1709. \p{Guru} \p{Gurmukhi} (= \p{Script=Gurmukhi}) (NOT
  1710. \p{Block=Gurmukhi}) (79)
  1711. X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
  1712. \p{Block=Halfwidth_And_Fullwidth_Forms})
  1713. (240)
  1714. X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
  1715. Combining_Half_Marks}) (16)
  1716. X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
  1717. Halfwidth_And_Fullwidth_Forms} (Short:
  1718. \p{InHalfAndFullForms}) (240)
  1719. \p{Han} \p{Script=Han} (75_963)
  1720. \p{Hang} \p{Hangul} (= \p{Script=Hangul}) (NOT
  1721. \p{Hangul_Syllables}) (11_739)
  1722. \p{Hangul} \p{Script=Hangul} (Short: \p{Hang}; NOT
  1723. \p{Hangul_Syllables}) (11_739)
  1724. X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
  1725. (Short: \p{InCompatJamo}) (96)
  1726. X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
  1727. (256)
  1728. X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
  1729. (Short: \p{InJamoExtA}) (32)
  1730. X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
  1731. (Short: \p{InJamoExtB}) (80)
  1732. \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
  1733. (125)
  1734. \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125)
  1735. \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
  1736. (399)
  1737. \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399)
  1738. \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
  1739. LVT_Syllable} (10_773)
  1740. \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
  1741. (10_773)
  1742. \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
  1743. Not_Applicable} (1_102_583)
  1744. \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
  1745. (1_102_583)
  1746. \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
  1747. (137)
  1748. \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137)
  1749. \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
  1750. (95)
  1751. \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95)
  1752. X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
  1753. \p{InHangul}) (11_184)
  1754. \p{Hani} \p{Han} (= \p{Script=Han}) (75_963)
  1755. \p{Hano} \p{Hanunoo} (= \p{Script=Hanunoo}) (NOT
  1756. \p{Block=Hanunoo}) (21)
  1757. \p{Hanunoo} \p{Script=Hanunoo} (Short: \p{Hano}; NOT
  1758. \p{Block=Hanunoo}) (21)
  1759. \p{Hebr} \p{Hebrew} (= \p{Script=Hebrew}) (NOT
  1760. \p{Block=Hebrew}) (133)
  1761. \p{Hebrew} \p{Script=Hebrew} (Short: \p{Hebr}; NOT
  1762. \p{Block=Hebrew}) (133)
  1763. \p{Hex} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
  1764. \p{Hex: *} \p{Hex_Digit: *}
  1765. \p{Hex_Digit} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
  1766. \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068)
  1767. \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44)
  1768. X \p{High_Private_Use_Surrogates} \p{Block=
  1769. High_Private_Use_Surrogates} (Short:
  1770. \p{InHighPUSurrogates}) (128)
  1771. X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
  1772. \p{Block=High_Private_Use_Surrogates})
  1773. (128)
  1774. X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
  1775. \p{Hira} \p{Hiragana} (= \p{Script=Hiragana}) (NOT
  1776. \p{Block=Hiragana}) (91)
  1777. \p{Hiragana} \p{Script=Hiragana} (Short: \p{Hira}; NOT
  1778. \p{Block=Hiragana}) (91)
  1779. \p{HorizSpace} \p{Blank} (19)
  1780. \p{Hst: *} \p{Hangul_Syllable_Type: *}
  1781. D \p{Hyphen} \p{Hyphen=Y} (11)
  1782. D \p{Hyphen: N*} Supplanted by Line_Break property values;
  1783. see www.unicode.org/reports/tr14
  1784. (Single: \P{Hyphen}) (1_114_101)
  1785. D \p{Hyphen: Y*} Supplanted by Line_Break property values;
  1786. see www.unicode.org/reports/tr14
  1787. (Single: \p{Hyphen}) (11)
  1788. \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
  1789. \p{Ideographic_Description_Characters})
  1790. (103_355)
  1791. \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (1_010_757)
  1792. \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (103_355)
  1793. \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (101_240)
  1794. \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (1_012_872)
  1795. \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (101_240)
  1796. \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
  1797. \p{Ideographic_Description_Characters})
  1798. (103_355)
  1799. \p{IDC: *} \p{ID_Continue: *}
  1800. \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
  1801. (75_633)
  1802. \p{Ideo: *} \p{Ideographic: *}
  1803. \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
  1804. (75_633)
  1805. \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_038_479)
  1806. \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (75_633)
  1807. X \p{Ideographic_Description_Characters} \p{Block=
  1808. Ideographic_Description_Characters}
  1809. (Short: \p{InIDC}) (16)
  1810. \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (101_240)
  1811. \p{IDS: *} \p{ID_Start: *}
  1812. \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
  1813. \p{IDSB}) (10)
  1814. \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
  1815. (1_114_102)
  1816. \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10)
  1817. \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
  1818. \p{IDST}) (2)
  1819. \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
  1820. (1_114_110)
  1821. \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2)
  1822. \p{IDSB} \p{IDS_Binary_Operator} (=
  1823. \p{IDS_Binary_Operator=Y}) (10)
  1824. \p{IDSB: *} \p{IDS_Binary_Operator: *}
  1825. \p{IDST} \p{IDS_Trinary_Operator} (=
  1826. \p{IDS_Trinary_Operator=Y}) (2)
  1827. \p{IDST: *} \p{IDS_Trinary_Operator: *}
  1828. \p{Imperial_Aramaic} \p{Script=Imperial_Aramaic} (Short:
  1829. \p{Armi}; NOT \p{Block=
  1830. Imperial_Aramaic}) (31)
  1831. \p{In: *} \p{Present_In: *} (Perl extension)
  1832. \p{In_*} \p{Block: *}
  1833. X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
  1834. Common_Indic_Number_Forms}) (16)
  1835. \p{Inherited} \p{Script=Inherited} (Short: \p{Zinh})
  1836. (523)
  1837. \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
  1838. (Short: \p{Pi}) (12)
  1839. \p{Inscriptional_Pahlavi} \p{Script=Inscriptional_Pahlavi} (Short:
  1840. \p{Phli}; NOT \p{Block=
  1841. Inscriptional_Pahlavi}) (27)
  1842. \p{Inscriptional_Parthian} \p{Script=Inscriptional_Parthian}
  1843. (Short: \p{Prti}; NOT \p{Block=
  1844. Inscriptional_Parthian}) (30)
  1845. X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
  1846. IPA_Extensions}) (96)
  1847. X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
  1848. \p{InIPAExt}) (96)
  1849. \p{Is_*} \p{*} (Any exceptions are individually
  1850. noted beginning with the word NOT.) If
  1851. an entry has flag(s) at its beginning,
  1852. like "D", the "Is_" form has the same
  1853. flag(s)
  1854. \p{Ital} \p{Old_Italic} (= \p{Script=Old_Italic})
  1855. (NOT \p{Block=Old_Italic}) (35)
  1856. X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
  1857. (256)
  1858. X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
  1859. Hangul_Jamo_Extended_A}) (32)
  1860. X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
  1861. Hangul_Jamo_Extended_B}) (80)
  1862. \p{Java} \p{Javanese} (= \p{Script=Javanese}) (NOT
  1863. \p{Block=Javanese}) (91)
  1864. \p{Javanese} \p{Script=Javanese} (Short: \p{Java}; NOT
  1865. \p{Block=Javanese}) (91)
  1866. \p{Jg: *} \p{Joining_Group: *}
  1867. \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
  1868. \p{Join_C: *} \p{Join_Control: *}
  1869. \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
  1870. \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110)
  1871. \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2)
  1872. \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (7)
  1873. \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1)
  1874. \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10)
  1875. \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (20)
  1876. \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2)
  1877. \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
  1878. BurushaskiYehBarree}) (2)
  1879. \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (14)
  1880. \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4)
  1881. \p{Joining_Group: E} (Short: \p{Jg=E}) (1)
  1882. \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7)
  1883. \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1)
  1884. \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10)
  1885. \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1)
  1886. \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (13)
  1887. \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3)
  1888. \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (18)
  1889. \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
  1890. HamzaOnHehGoal}) (1)
  1891. \p{Joining_Group: He} (Short: \p{Jg=He}) (1)
  1892. \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1)
  1893. \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2)
  1894. \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1)
  1895. \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (5)
  1896. \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1)
  1897. \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1)
  1898. \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2)
  1899. \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (7)
  1900. \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1)
  1901. \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4)
  1902. \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1)
  1903. \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
  1904. (1_113_870)
  1905. \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8)
  1906. \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1)
  1907. \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1)
  1908. \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1)
  1909. \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5)
  1910. \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1)
  1911. \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (17)
  1912. \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1)
  1913. \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1)
  1914. \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (5)
  1915. \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1)
  1916. \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11)
  1917. \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1)
  1918. \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1)
  1919. \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1)
  1920. \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1)
  1921. \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4)
  1922. \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1)
  1923. \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3)
  1924. \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
  1925. Hamza_On_Heh_Goal} (1)
  1926. \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2)
  1927. \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16)
  1928. \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (10)
  1929. \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2)
  1930. \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1)
  1931. \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1)
  1932. \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1)
  1933. \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1)
  1934. \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1)
  1935. \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (3)
  1936. \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (215)
  1937. \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (215)
  1938. \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (3)
  1939. \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (0)
  1940. \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (0)
  1941. \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_112_389)
  1942. \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (82)
  1943. \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (82)
  1944. \p{Joining_Type: T} \p{Joining_Type=Transparent} (1423)
  1945. \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1423)
  1946. \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_112_389)
  1947. \p{Jt: *} \p{Joining_Type: *}
  1948. \p{Kaithi} \p{Script=Kaithi} (Short: \p{Kthi}; NOT
  1949. \p{Block=Kaithi}) (66)
  1950. \p{Kali} \p{Kayah_Li} (= \p{Script=Kayah_Li}) (48)
  1951. \p{Kana} \p{Katakana} (= \p{Script=Katakana}) (NOT
  1952. \p{Block=Katakana}) (300)
  1953. X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
  1954. Kana_Supplement}) (256)
  1955. X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
  1956. \p{InKanaSup}) (256)
  1957. X \p{Kanbun} \p{Block=Kanbun} (16)
  1958. X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
  1959. Kangxi_Radicals}) (224)
  1960. X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
  1961. \p{InKangxi}) (224)
  1962. \p{Kannada} \p{Script=Kannada} (Short: \p{Knda}; NOT
  1963. \p{Block=Kannada}) (86)
  1964. \p{Katakana} \p{Script=Katakana} (Short: \p{Kana}; NOT
  1965. \p{Block=Katakana}) (300)
  1966. X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
  1967. \p{Block=Katakana_Phonetic_Extensions})
  1968. (16)
  1969. X \p{Katakana_Phonetic_Extensions} \p{Block=
  1970. Katakana_Phonetic_Extensions} (Short:
  1971. \p{InKatakanaExt}) (16)
  1972. \p{Kayah_Li} \p{Script=Kayah_Li} (Short: \p{Kali}) (48)
  1973. \p{Khar} \p{Kharoshthi} (= \p{Script=Kharoshthi})
  1974. (NOT \p{Block=Kharoshthi}) (65)
  1975. \p{Kharoshthi} \p{Script=Kharoshthi} (Short: \p{Khar};
  1976. NOT \p{Block=Kharoshthi}) (65)
  1977. \p{Khmer} \p{Script=Khmer} (Short: \p{Khmr}; NOT
  1978. \p{Block=Khmer}) (146)
  1979. X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
  1980. \p{Khmr} \p{Khmer} (= \p{Script=Khmer}) (NOT
  1981. \p{Block=Khmer}) (146)
  1982. \p{Knda} \p{Kannada} (= \p{Script=Kannada}) (NOT
  1983. \p{Block=Kannada}) (86)
  1984. \p{Kthi} \p{Kaithi} (= \p{Script=Kaithi}) (NOT
  1985. \p{Block=Kaithi}) (66)
  1986. \p{L} \p{Letter} (= \p{General_Category=Letter})
  1987. (101_013)
  1988. X \p{L&} \p{Cased_Letter} (= \p{General_Category=
  1989. Cased_Letter}) (3223)
  1990. X \p{L_} \p{Cased_Letter} (= \p{General_Category=
  1991. Cased_Letter}) Note the trailing '_'
  1992. matters in spite of loose matching
  1993. rules. (3223)
  1994. \p{Lana} \p{Tai_Tham} (= \p{Script=Tai_Tham}) (NOT
  1995. \p{Block=Tai_Tham}) (127)
  1996. \p{Lao} \p{Script=Lao} (NOT \p{Block=Lao}) (67)
  1997. \p{Laoo} \p{Lao} (= \p{Script=Lao}) (NOT \p{Block=
  1998. Lao}) (67)
  1999. \p{Latin} \p{Script=Latin} (Short: \p{Latn}) (1272)
  2000. X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
  2001. Latin_1_Supplement}) (128)
  2002. X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
  2003. Latin_1_Supplement}) (128)
  2004. X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
  2005. \p{InLatin1}) (128)
  2006. X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
  2007. Latin_Extended_A}) (128)
  2008. X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
  2009. \p{Block=Latin_Extended_Additional})
  2010. (256)
  2011. X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
  2012. Latin_Extended_B}) (208)
  2013. X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
  2014. Latin_Extended_C}) (32)
  2015. X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
  2016. Latin_Extended_D}) (224)
  2017. X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
  2018. \p{InLatinExtA}) (128)
  2019. X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
  2020. (Short: \p{InLatinExtAdditional}) (256)
  2021. X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
  2022. \p{InLatinExtB}) (208)
  2023. X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
  2024. \p{InLatinExtC}) (32)
  2025. X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
  2026. \p{InLatinExtD}) (224)
  2027. \p{Latn} \p{Latin} (= \p{Script=Latin}) (1272)
  2028. \p{Lb: *} \p{Line_Break: *}
  2029. \p{LC} \p{Cased_Letter} (= \p{General_Category=
  2030. Cased_Letter}) (3223)
  2031. \p{Lepc} \p{Lepcha} (= \p{Script=Lepcha}) (NOT
  2032. \p{Block=Lepcha}) (74)
  2033. \p{Lepcha} \p{Script=Lepcha} (Short: \p{Lepc}; NOT
  2034. \p{Block=Lepcha}) (74)
  2035. \p{Letter} \p{General_Category=Letter} (Short: \p{L})
  2036. (101_013)
  2037. \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
  2038. \p{Nl}) (224)
  2039. X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
  2040. \p{Limb} \p{Limbu} (= \p{Script=Limbu}) (NOT
  2041. \p{Block=Limbu}) (66)
  2042. \p{Limbu} \p{Script=Limbu} (Short: \p{Limb}; NOT
  2043. \p{Block=Limbu}) (66)
  2044. \p{Linb} \p{Linear_B} (= \p{Script=Linear_B}) (211)
  2045. \p{Line_Break: AI} \p{Line_Break=Ambiguous} (687)
  2046. \p{Line_Break: AL} \p{Line_Break=Alphabetic} (15_355)
  2047. \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (15_355)
  2048. \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (687)
  2049. \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
  2050. \p{Line_Break: BA} \p{Line_Break=Break_After} (151)
  2051. \p{Line_Break: BB} \p{Line_Break=Break_Before} (19)
  2052. \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
  2053. \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (151)
  2054. \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (19)
  2055. \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3)
  2056. \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1)
  2057. \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1)
  2058. \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
  2059. \p{Line_Break: CJ} \p{Line_Break=
  2060. Conditional_Japanese_Starter} (51)
  2061. \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (87)
  2062. \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2)
  2063. \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (87)
  2064. \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (1628)
  2065. \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (1628)
  2066. \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (665)
  2067. \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
  2068. (51)
  2069. \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1)
  2070. \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
  2071. \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
  2072. \p{Line_Break: EX} \p{Line_Break=Exclamation} (34)
  2073. \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (34)
  2074. \p{Line_Break: GL} \p{Line_Break=Glue} (18)
  2075. \p{Line_Break: Glue} (Short: \p{Lb=GL}) (18)
  2076. \p{Line_Break: H2} (Short: \p{Lb=H2}) (399)
  2077. \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773)
  2078. \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (74)
  2079. \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (74)
  2080. \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
  2081. \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1)
  2082. \p{Line_Break: ID} \p{Line_Break=Ideographic} (162_700)
  2083. \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (162_700)
  2084. \p{Line_Break: IN} \p{Line_Break=Inseparable} (4)
  2085. \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13)
  2086. \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (4)
  2087. \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (4)
  2088. \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
  2089. \p{Line_Break: JL} (Short: \p{Lb=JL}) (125)
  2090. \p{Line_Break: JT} (Short: \p{Lb=JT}) (137)
  2091. \p{Line_Break: JV} (Short: \p{Lb=JV}) (95)
  2092. \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
  2093. \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1)
  2094. \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4)
  2095. \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1)
  2096. \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
  2097. \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (26)
  2098. \p{Line_Break: NS} \p{Line_Break=Nonstarter} (26)
  2099. \p{Line_Break: NU} \p{Line_Break=Numeric} (452)
  2100. \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (452)
  2101. \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (81)
  2102. \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (81)
  2103. \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (28)
  2104. \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (28)
  2105. \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (46)
  2106. \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (46)
  2107. \p{Line_Break: QU} \p{Line_Break=Quotation} (34)
  2108. \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (34)
  2109. \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26)
  2110. \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
  2111. \p{Line_Break: SA} \p{Line_Break=Complex_Context} (665)
  2112. D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
  2113. \p{Line_Break: SP} \p{Line_Break=Space} (1)
  2114. \p{Line_Break: Space} (Short: \p{Lb=SP}) (1)
  2115. D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
  2116. should never appear in well-formed text,
  2117. and therefore shouldn't be the basis for
  2118. line breaking (Short: \p{Lb=SG}) (2048)
  2119. \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
  2120. \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (918_337)
  2121. \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
  2122. \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2)
  2123. \p{Line_Break: XX} \p{Line_Break=Unknown} (918_337)
  2124. \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
  2125. \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1)
  2126. \p{Line_Separator} \p{General_Category=Line_Separator}
  2127. (Short: \p{Zl}) (1)
  2128. \p{Linear_B} \p{Script=Linear_B} (Short: \p{Linb}) (211)
  2129. X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
  2130. X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
  2131. \p{Lisu} \p{Script=Lisu} (48)
  2132. \p{Ll} \p{Lowercase_Letter} (=
  2133. \p{General_Category=Lowercase_Letter})
  2134. (/i= General_Category=Cased_Letter)
  2135. (1751)
  2136. \p{Lm} \p{Modifier_Letter} (=
  2137. \p{General_Category=Modifier_Letter})
  2138. (237)
  2139. \p{Lo} \p{Other_Letter} (= \p{General_Category=
  2140. Other_Letter}) (97_553)
  2141. \p{LOE} \p{Logical_Order_Exception} (=
  2142. \p{Logical_Order_Exception=Y}) (15)
  2143. \p{LOE: *} \p{Logical_Order_Exception: *}
  2144. \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
  2145. \p{LOE}) (15)
  2146. \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
  2147. (1_114_097)
  2148. \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (15)
  2149. X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
  2150. \p{Lower} \p{Lowercase=Y} (/i= Cased=Yes) (1934)
  2151. \p{Lower: *} \p{Lowercase: *}
  2152. \p{Lowercase} \p{Lower} (= \p{Lowercase=Y}) (/i= Cased=
  2153. Yes) (1934)
  2154. \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
  2155. No) (1_112_178)
  2156. \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
  2157. Yes) (1934)
  2158. \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
  2159. (Short: \p{Ll}; /i= General_Category=
  2160. Cased_Letter) (1751)
  2161. \p{Lt} \p{Titlecase_Letter} (=
  2162. \p{General_Category=Titlecase_Letter})
  2163. (/i= General_Category=Cased_Letter) (31)
  2164. \p{Lu} \p{Uppercase_Letter} (=
  2165. \p{General_Category=Uppercase_Letter})
  2166. (/i= General_Category=Cased_Letter)
  2167. (1441)
  2168. \p{Lyci} \p{Lycian} (= \p{Script=Lycian}) (NOT
  2169. \p{Block=Lycian}) (29)
  2170. \p{Lycian} \p{Script=Lycian} (Short: \p{Lyci}; NOT
  2171. \p{Block=Lycian}) (29)
  2172. \p{Lydi} \p{Lydian} (= \p{Script=Lydian}) (NOT
  2173. \p{Block=Lydian}) (27)
  2174. \p{Lydian} \p{Script=Lydian} (Short: \p{Lydi}; NOT
  2175. \p{Block=Lydian}) (27)
  2176. \p{M} \p{Mark} (= \p{General_Category=Mark})
  2177. (1645)
  2178. X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
  2179. Mahjong_Tiles}) (48)
  2180. X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
  2181. \p{InMahjong}) (48)
  2182. \p{Malayalam} \p{Script=Malayalam} (Short: \p{Mlym}; NOT
  2183. \p{Block=Malayalam}) (98)
  2184. \p{Mand} \p{Mandaic} (= \p{Script=Mandaic}) (NOT
  2185. \p{Block=Mandaic}) (29)
  2186. \p{Mandaic} \p{Script=Mandaic} (Short: \p{Mand}; NOT
  2187. \p{Block=Mandaic}) (29)
  2188. \p{Mark} \p{General_Category=Mark} (Short: \p{M})
  2189. (1645)
  2190. \p{Math} \p{Math=Y} (2310)
  2191. \p{Math: N*} (Single: \P{Math}) (1_111_802)
  2192. \p{Math: Y*} (Single: \p{Math}) (2310)
  2193. X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
  2194. \p{Block=
  2195. Mathematical_Alphanumeric_Symbols})
  2196. (1024)
  2197. X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
  2198. Mathematical_Operators}) (256)
  2199. \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
  2200. \p{Sm}) (952)
  2201. X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
  2202. Mathematical_Alphanumeric_Symbols}
  2203. (Short: \p{InMathAlphanum}) (1024)
  2204. X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
  2205. (Short: \p{InMathOperators}) (256)
  2206. \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
  2207. Spacing_Mark}) (353)
  2208. \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
  2209. Enclosing_Mark}) (12)
  2210. \p{Meetei_Mayek} \p{Script=Meetei_Mayek} (Short: \p{Mtei};
  2211. NOT \p{Block=Meetei_Mayek}) (79)
  2212. X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
  2213. Meetei_Mayek_Extensions}) (32)
  2214. X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
  2215. (Short: \p{InMeeteiMayekExt}) (32)
  2216. \p{Merc} \p{Meroitic_Cursive} (= \p{Script=
  2217. Meroitic_Cursive}) (NOT \p{Block=
  2218. Meroitic_Cursive}) (26)
  2219. \p{Mero} \p{Meroitic_Hieroglyphs} (= \p{Script=
  2220. Meroitic_Hieroglyphs}) (32)
  2221. \p{Meroitic_Cursive} \p{Script=Meroitic_Cursive} (Short:
  2222. \p{Merc}; NOT \p{Block=
  2223. Meroitic_Cursive}) (26)
  2224. \p{Meroitic_Hieroglyphs} \p{Script=Meroitic_Hieroglyphs} (Short:
  2225. \p{Mero}) (32)
  2226. \p{Miao} \p{Script=Miao} (NOT \p{Block=Miao}) (133)
  2227. X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
  2228. \p{Block=
  2229. Miscellaneous_Symbols_And_Arrows}) (256)
  2230. X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
  2231. (= \p{Block=
  2232. Miscellaneous_Mathematical_Symbols_A})
  2233. (48)
  2234. X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
  2235. (= \p{Block=
  2236. Miscellaneous_Mathematical_Symbols_B})
  2237. (128)
  2238. X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
  2239. (= \p{Block=
  2240. Miscellaneous_Symbols_And_Pictographs})
  2241. (768)
  2242. X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
  2243. Miscellaneous_Symbols}) (256)
  2244. X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
  2245. Miscellaneous_Technical}) (256)
  2246. X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
  2247. Miscellaneous_Mathematical_Symbols_A}
  2248. (Short: \p{InMiscMathSymbolsA}) (48)
  2249. X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
  2250. Miscellaneous_Mathematical_Symbols_B}
  2251. (Short: \p{InMiscMathSymbolsB}) (128)
  2252. X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
  2253. \p{InMiscSymbols}) (256)
  2254. X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
  2255. Miscellaneous_Symbols_And_Arrows}
  2256. (Short: \p{InMiscArrows}) (256)
  2257. X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
  2258. Miscellaneous_Symbols_And_Pictographs}
  2259. (Short: \p{InMiscPictographs}) (768)
  2260. X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
  2261. (Short: \p{InMiscTechnical}) (256)
  2262. \p{Mlym} \p{Malayalam} (= \p{Script=Malayalam})
  2263. (NOT \p{Block=Malayalam}) (98)
  2264. \p{Mn} \p{Nonspacing_Mark} (=
  2265. \p{General_Category=Nonspacing_Mark})
  2266. (1280)
  2267. \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
  2268. (Short: \p{Lm}) (237)
  2269. X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
  2270. Spacing_Modifier_Letters}) (80)
  2271. \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
  2272. (Short: \p{Sk}) (115)
  2273. X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
  2274. \p{Mong} \p{Mongolian} (= \p{Script=Mongolian})
  2275. (NOT \p{Block=Mongolian}) (153)
  2276. \p{Mongolian} \p{Script=Mongolian} (Short: \p{Mong}; NOT
  2277. \p{Block=Mongolian}) (153)
  2278. \p{Mtei} \p{Meetei_Mayek} (= \p{Script=
  2279. Meetei_Mayek}) (NOT \p{Block=
  2280. Meetei_Mayek}) (79)
  2281. X \p{Music} \p{Musical_Symbols} (= \p{Block=
  2282. Musical_Symbols}) (256)
  2283. X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
  2284. \p{InMusic}) (256)
  2285. \p{Myanmar} \p{Script=Myanmar} (Short: \p{Mymr}; NOT
  2286. \p{Block=Myanmar}) (188)
  2287. X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
  2288. Myanmar_Extended_A}) (32)
  2289. X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
  2290. \p{InMyanmarExtA}) (32)
  2291. \p{Mymr} \p{Myanmar} (= \p{Script=Myanmar}) (NOT
  2292. \p{Block=Myanmar}) (188)
  2293. \p{N} \p{Number} (= \p{General_Category=Number})
  2294. (1148)
  2295. X \p{NB} \p{No_Block} (= \p{Block=No_Block})
  2296. (860_672)
  2297. \p{NChar} \p{Noncharacter_Code_Point} (=
  2298. \p{Noncharacter_Code_Point=Y}) (66)
  2299. \p{NChar: *} \p{Noncharacter_Code_Point: *}
  2300. \p{Nd} \p{Digit} (= \p{General_Category=
  2301. Decimal_Number}) (460)
  2302. \p{New_Tai_Lue} \p{Script=New_Tai_Lue} (Short: \p{Talu};
  2303. NOT \p{Block=New_Tai_Lue}) (83)
  2304. \p{NFC_QC: *} \p{NFC_Quick_Check: *}
  2305. \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (104)
  2306. \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (104)
  2307. \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
  2308. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  2309. (1120)
  2310. \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
  2311. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  2312. (1120)
  2313. \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
  2314. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  2315. (1_112_888)
  2316. \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
  2317. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  2318. (1_112_888)
  2319. \p{NFD_QC: *} \p{NFD_Quick_Check: *}
  2320. \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
  2321. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  2322. (13_225)
  2323. \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
  2324. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  2325. (13_225)
  2326. \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
  2327. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  2328. (1_100_887)
  2329. \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
  2330. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  2331. (1_100_887)
  2332. \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
  2333. \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (104)
  2334. \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (104)
  2335. \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
  2336. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  2337. (4787)
  2338. \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
  2339. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  2340. (4787)
  2341. \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
  2342. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  2343. (1_109_221)
  2344. \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
  2345. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  2346. (1_109_221)
  2347. \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
  2348. \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
  2349. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  2350. (16_880)
  2351. \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
  2352. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  2353. (16_880)
  2354. \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
  2355. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  2356. (1_097_232)
  2357. \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
  2358. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  2359. (1_097_232)
  2360. \p{Nko} \p{Script=Nko} (NOT \p{NKo}) (59)
  2361. \p{Nkoo} \p{Nko} (= \p{Script=Nko}) (NOT \p{NKo})
  2362. (59)
  2363. \p{Nl} \p{Letter_Number} (= \p{General_Category=
  2364. Letter_Number}) (224)
  2365. \p{No} \p{Other_Number} (= \p{General_Category=
  2366. Other_Number}) (464)
  2367. X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
  2368. (860_672)
  2369. \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
  2370. \p{NChar}) (66)
  2371. \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
  2372. (1_114_046)
  2373. \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
  2374. (66)
  2375. \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
  2376. (Short: \p{Mn}) (1280)
  2377. \p{Nt: *} \p{Numeric_Type: *}
  2378. \p{Number} \p{General_Category=Number} (Short: \p{N})
  2379. (1148)
  2380. X \p{Number_Forms} \p{Block=Number_Forms} (64)
  2381. \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (460)
  2382. \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (460)
  2383. \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
  2384. \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128)
  2385. \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_883)
  2386. \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (641)
  2387. \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (641)
  2388. T \p{Numeric_Value: -1} (Short: \p{Nv=-1}) (2)
  2389. T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1)
  2390. T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (60)
  2391. T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (3)
  2392. T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (1)
  2393. T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1)
  2394. T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (5)
  2395. T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1)
  2396. T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (2)
  2397. T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (3)
  2398. T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (1)
  2399. T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (9)
  2400. T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (4)
  2401. T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1)
  2402. T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1)
  2403. T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (10)
  2404. T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1)
  2405. T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1)
  2406. T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (5)
  2407. T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (6)
  2408. T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1)
  2409. T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (2)
  2410. T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1)
  2411. T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (97)
  2412. T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1)
  2413. T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (100)
  2414. T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1)
  2415. T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (102)
  2416. T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1)
  2417. T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (93)
  2418. T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1)
  2419. T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (90)
  2420. T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1)
  2421. T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (82)
  2422. T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1)
  2423. T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (81)
  2424. T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1)
  2425. T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (77)
  2426. T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1)
  2427. T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (81)
  2428. T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (40)
  2429. T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (6)
  2430. T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (6)
  2431. T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (4)
  2432. T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (4)
  2433. T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (4)
  2434. T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (5)
  2435. T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (5)
  2436. T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (5)
  2437. T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (5)
  2438. T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (19)
  2439. T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1)
  2440. T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1)
  2441. T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1)
  2442. T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1)
  2443. T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1)
  2444. T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1)
  2445. T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1)
  2446. T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1)
  2447. T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1)
  2448. T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (11)
  2449. T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1)
  2450. T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1)
  2451. T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1)
  2452. T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1)
  2453. T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1)
  2454. T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1)
  2455. T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1)
  2456. T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1)
  2457. T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1)
  2458. T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (10)
  2459. T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1)
  2460. T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1)
  2461. T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1)
  2462. T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1)
  2463. T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1)
  2464. T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1)
  2465. T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1)
  2466. T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1)
  2467. T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1)
  2468. T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (20)
  2469. T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (6)
  2470. T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (6)
  2471. T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (6)
  2472. T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (6)
  2473. T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (20)
  2474. T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (2)
  2475. T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (3)
  2476. T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (2)
  2477. T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (12)
  2478. T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (2)
  2479. T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (2)
  2480. T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (2)
  2481. T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (3)
  2482. T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (17)
  2483. T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (1)
  2484. T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (1)
  2485. T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (1)
  2486. T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (5)
  2487. T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (1)
  2488. T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (1)
  2489. T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (1)
  2490. T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (1)
  2491. T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (7)
  2492. T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (1)
  2493. T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (1)
  2494. T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (1)
  2495. T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (4)
  2496. T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (1)
  2497. T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (1)
  2498. T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (1)
  2499. T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (1)
  2500. T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (1)
  2501. T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1)
  2502. T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1)
  2503. T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
  2504. (2)
  2505. T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
  2506. 1000000000000}) (1)
  2507. \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_883)
  2508. \p{Nv: *} \p{Numeric_Value: *}
  2509. X \p{OCR} \p{Optical_Character_Recognition} (=
  2510. \p{Block=Optical_Character_Recognition})
  2511. (32)
  2512. \p{Ogam} \p{Ogham} (= \p{Script=Ogham}) (NOT
  2513. \p{Block=Ogham}) (29)
  2514. \p{Ogham} \p{Script=Ogham} (Short: \p{Ogam}; NOT
  2515. \p{Block=Ogham}) (29)
  2516. \p{Ol_Chiki} \p{Script=Ol_Chiki} (Short: \p{Olck}) (48)
  2517. \p{Olck} \p{Ol_Chiki} (= \p{Script=Ol_Chiki}) (48)
  2518. \p{Old_Italic} \p{Script=Old_Italic} (Short: \p{Ital};
  2519. NOT \p{Block=Old_Italic}) (35)
  2520. \p{Old_Persian} \p{Script=Old_Persian} (Short: \p{Xpeo};
  2521. NOT \p{Block=Old_Persian}) (50)
  2522. \p{Old_South_Arabian} \p{Script=Old_South_Arabian} (Short:
  2523. \p{Sarb}) (32)
  2524. \p{Old_Turkic} \p{Script=Old_Turkic} (Short: \p{Orkh};
  2525. NOT \p{Block=Old_Turkic}) (73)
  2526. \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
  2527. (Short: \p{Ps}) (72)
  2528. X \p{Optical_Character_Recognition} \p{Block=
  2529. Optical_Character_Recognition} (Short:
  2530. \p{InOCR}) (32)
  2531. \p{Oriya} \p{Script=Oriya} (Short: \p{Orya}; NOT
  2532. \p{Block=Oriya}) (90)
  2533. \p{Orkh} \p{Old_Turkic} (= \p{Script=Old_Turkic})
  2534. (NOT \p{Block=Old_Turkic}) (73)
  2535. \p{Orya} \p{Oriya} (= \p{Script=Oriya}) (NOT
  2536. \p{Block=Oriya}) (90)
  2537. \p{Osma} \p{Osmanya} (= \p{Script=Osmanya}) (NOT
  2538. \p{Block=Osmanya}) (40)
  2539. \p{Osmanya} \p{Script=Osmanya} (Short: \p{Osma}; NOT
  2540. \p{Block=Osmanya}) (40)
  2541. \p{Other} \p{General_Category=Other} (Short: \p{C})
  2542. (1_004_134)
  2543. \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
  2544. \p{Lo}) (97_553)
  2545. \p{Other_Number} \p{General_Category=Other_Number} (Short:
  2546. \p{No}) (464)
  2547. \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
  2548. (Short: \p{Po}) (434)
  2549. \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
  2550. \p{So}) (4404)
  2551. \p{P} \p{Punct} (= \p{General_Category=
  2552. Punctuation}) (NOT
  2553. \p{General_Punctuation}) (632)
  2554. \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
  2555. (Short: \p{Zp}) (1)
  2556. \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
  2557. Y}) (2760)
  2558. \p{Pat_Syn: *} \p{Pattern_Syntax: *}
  2559. \p{Pat_WS} \p{Pattern_White_Space} (=
  2560. \p{Pattern_White_Space=Y}) (11)
  2561. \p{Pat_WS: *} \p{Pattern_White_Space: *}
  2562. \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
  2563. (2760)
  2564. \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
  2565. (1_111_352)
  2566. \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760)
  2567. \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
  2568. \p{PatWS}) (11)
  2569. \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
  2570. (1_114_101)
  2571. \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11)
  2572. \p{Pc} \p{Connector_Punctuation} (=
  2573. \p{General_Category=
  2574. Connector_Punctuation}) (10)
  2575. \p{Pd} \p{Dash_Punctuation} (=
  2576. \p{General_Category=Dash_Punctuation})
  2577. (23)
  2578. \p{Pe} \p{Close_Punctuation} (=
  2579. \p{General_Category=Close_Punctuation})
  2580. (71)
  2581. \p{PerlSpace} \s, restricted to ASCII = [ \f\n\r\t] plus
  2582. vertical tab (6)
  2583. \p{PerlWord} \w, restricted to ASCII = [A-Za-z0-9_] (63)
  2584. \p{Pf} \p{Final_Punctuation} (=
  2585. \p{General_Category=Final_Punctuation})
  2586. (10)
  2587. \p{Phag} \p{Phags_Pa} (= \p{Script=Phags_Pa}) (NOT
  2588. \p{Block=Phags_Pa}) (56)
  2589. \p{Phags_Pa} \p{Script=Phags_Pa} (Short: \p{Phag}; NOT
  2590. \p{Block=Phags_Pa}) (56)
  2591. X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
  2592. Phaistos_Disc}) (48)
  2593. X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
  2594. \p{InPhaistos}) (48)
  2595. \p{Phli} \p{Inscriptional_Pahlavi} (= \p{Script=
  2596. Inscriptional_Pahlavi}) (NOT \p{Block=
  2597. Inscriptional_Pahlavi}) (27)
  2598. \p{Phnx} \p{Phoenician} (= \p{Script=Phoenician})
  2599. (NOT \p{Block=Phoenician}) (29)
  2600. \p{Phoenician} \p{Script=Phoenician} (Short: \p{Phnx};
  2601. NOT \p{Block=Phoenician}) (29)
  2602. X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
  2603. Phonetic_Extensions}) (128)
  2604. X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
  2605. \p{Block=
  2606. Phonetic_Extensions_Supplement}) (64)
  2607. X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
  2608. \p{InPhoneticExt}) (128)
  2609. X \p{Phonetic_Extensions_Supplement} \p{Block=
  2610. Phonetic_Extensions_Supplement} (Short:
  2611. \p{InPhoneticExtSup}) (64)
  2612. \p{Pi} \p{Initial_Punctuation} (=
  2613. \p{General_Category=
  2614. Initial_Punctuation}) (12)
  2615. X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
  2616. \p{Plrd} \p{Miao} (= \p{Script=Miao}) (NOT
  2617. \p{Block=Miao}) (133)
  2618. \p{Po} \p{Other_Punctuation} (=
  2619. \p{General_Category=Other_Punctuation})
  2620. (434)
  2621. \p{PosixAlnum} [A-Za-z0-9] (62)
  2622. \p{PosixAlpha} [A-Za-z] (52)
  2623. \p{PosixBlank} \t and ' ' (2)
  2624. \p{PosixCntrl} ASCII control characters: NUL, SOH, STX,
  2625. ETX, EOT, ENQ, ACK, BEL, BS, HT, LF, VT,
  2626. FF, CR, SO, SI, DLE, DC1, DC2, DC3, DC4,
  2627. NAK, SYN, ETB, CAN, EOM, SUB, ESC, FS,
  2628. GS, RS, US, and DEL (33)
  2629. \p{PosixDigit} [0-9] (10)
  2630. \p{PosixGraph} [-!"#$%&'()*+,./:;<>?@[\\]^_`{|}~0-9A-Za-
  2631. z] (94)
  2632. \p{PosixLower} [a-z] (/i= PosixAlpha) (26)
  2633. \p{PosixPrint} [- 0-9A-Za-
  2634. z!"#$%&'()*+,./:;<>?@[\\]^_`{|}~] (95)
  2635. \p{PosixPunct} [-!"#$%&'()*+,./:;<>?@[\\]^_`{|}~] (32)
  2636. \p{PosixSpace} \t, \n, \cK, \f, \r, and ' '. (\cK is
  2637. vertical tab) (6)
  2638. \p{PosixUpper} [A-Z] (/i= PosixAlpha) (26)
  2639. \p{PosixWord} \p{PerlWord} (63)
  2640. \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} [0-9A-Fa-f] (Short:
  2641. \p{AHex}) (22)
  2642. T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
  2643. extension) (33_979)
  2644. T \p{Present_In: 2.0} Code point's usage introduced in version
  2645. 2.0 or earlier (Short: \p{In=2.0}) (Perl
  2646. extension) (178_500)
  2647. T \p{Present_In: 2.1} Code point's usage introduced in version
  2648. 2.1 or earlier (Short: \p{In=2.1}) (Perl
  2649. extension) (178_502)
  2650. T \p{Present_In: 3.0} Code point's usage introduced in version
  2651. 3.0 or earlier (Short: \p{In=3.0}) (Perl
  2652. extension) (188_809)
  2653. T \p{Present_In: 3.1} Code point's usage introduced in version
  2654. 3.1 or earlier (Short: \p{In=3.1}) (Perl
  2655. extension) (233_787)
  2656. T \p{Present_In: 3.2} Code point's usage introduced in version
  2657. 3.2 or earlier (Short: \p{In=3.2}) (Perl
  2658. extension) (234_803)
  2659. T \p{Present_In: 4.0} Code point's usage introduced in version
  2660. 4.0 or earlier (Short: \p{In=4.0}) (Perl
  2661. extension) (236_029)
  2662. T \p{Present_In: 4.1} Code point's usage introduced in version
  2663. 4.1 or earlier (Short: \p{In=4.1}) (Perl
  2664. extension) (237_302)
  2665. T \p{Present_In: 5.0} Code point's usage introduced in version
  2666. 5.0 or earlier (Short: \p{In=5.0}) (Perl
  2667. extension) (238_671)
  2668. T \p{Present_In: 5.1} Code point's usage introduced in version
  2669. 5.1 or earlier (Short: \p{In=5.1}) (Perl
  2670. extension) (240_295)
  2671. T \p{Present_In: 5.2} Code point's usage introduced in version
  2672. 5.2 or earlier (Short: \p{In=5.2}) (Perl
  2673. extension) (246_943)
  2674. T \p{Present_In: 6.0} Code point's usage introduced in version
  2675. 6.0 or earlier (Short: \p{In=6.0}) (Perl
  2676. extension) (249_031)
  2677. T \p{Present_In: 6.1} Code point's usage introduced in version
  2678. 6.1 or earlier (Short: \p{In=6.1}) (Perl
  2679. extension) (249_763)
  2680. T \p{Present_In: 6.2} Code point's usage introduced in version
  2681. 6.2 or earlier (Short: \p{In=6.2}) (Perl
  2682. extension) (249_764)
  2683. \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
  2684. Unassigned}) (Perl extension) (864_348)
  2685. \p{Print} Characters that are graphical plus space
  2686. characters (but no controls) (247_583)
  2687. \p{Private_Use} \p{General_Category=Private_Use} (Short:
  2688. \p{Co}; NOT \p{Private_Use_Area})
  2689. (137_468)
  2690. X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
  2691. \p{InPUA}) (6400)
  2692. \p{Prti} \p{Inscriptional_Parthian} (= \p{Script=
  2693. Inscriptional_Parthian}) (NOT \p{Block=
  2694. Inscriptional_Parthian}) (30)
  2695. \p{Ps} \p{Open_Punctuation} (=
  2696. \p{General_Category=Open_Punctuation})
  2697. (72)
  2698. X \p{PUA} \p{Private_Use_Area} (= \p{Block=
  2699. Private_Use_Area}) (6400)
  2700. \p{Punct} \p{General_Category=Punctuation} (Short:
  2701. \p{P}; NOT \p{General_Punctuation}) (632)
  2702. \p{Punctuation} \p{Punct} (= \p{General_Category=
  2703. Punctuation}) (NOT
  2704. \p{General_Punctuation}) (632)
  2705. \p{Qaac} \p{Coptic} (= \p{Script=Coptic}) (NOT
  2706. \p{Block=Coptic}) (137)
  2707. \p{Qaai} \p{Inherited} (= \p{Script=Inherited})
  2708. (523)
  2709. \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
  2710. Y}) (29)
  2711. \p{QMark: *} \p{Quotation_Mark: *}
  2712. \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
  2713. (29)
  2714. \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_083)
  2715. \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (29)
  2716. \p{Radical} \p{Radical=Y} (329)
  2717. \p{Radical: N*} (Single: \P{Radical}) (1_113_783)
  2718. \p{Radical: Y*} (Single: \p{Radical}) (329)
  2719. \p{Rejang} \p{Script=Rejang} (Short: \p{Rjng}; NOT
  2720. \p{Block=Rejang}) (37)
  2721. \p{Rjng} \p{Rejang} (= \p{Script=Rejang}) (NOT
  2722. \p{Block=Rejang}) (37)
  2723. X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
  2724. Rumi_Numeral_Symbols}) (32)
  2725. X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
  2726. \p{InRumi}) (32)
  2727. \p{Runic} \p{Script=Runic} (Short: \p{Runr}; NOT
  2728. \p{Block=Runic}) (78)
  2729. \p{Runr} \p{Runic} (= \p{Script=Runic}) (NOT
  2730. \p{Block=Runic}) (78)
  2731. \p{S} \p{Symbol} (= \p{General_Category=Symbol})
  2732. (5520)
  2733. \p{Samaritan} \p{Script=Samaritan} (Short: \p{Samr}; NOT
  2734. \p{Block=Samaritan}) (61)
  2735. \p{Samr} \p{Samaritan} (= \p{Script=Samaritan})
  2736. (NOT \p{Block=Samaritan}) (61)
  2737. \p{Sarb} \p{Old_South_Arabian} (= \p{Script=
  2738. Old_South_Arabian}) (32)
  2739. \p{Saur} \p{Saurashtra} (= \p{Script=Saurashtra})
  2740. (NOT \p{Block=Saurashtra}) (81)
  2741. \p{Saurashtra} \p{Script=Saurashtra} (Short: \p{Saur};
  2742. NOT \p{Block=Saurashtra}) (81)
  2743. \p{SB: *} \p{Sentence_Break: *}
  2744. \p{Sc} \p{Currency_Symbol} (=
  2745. \p{General_Category=Currency_Symbol})
  2746. (49)
  2747. \p{Sc: *} \p{Script: *}
  2748. \p{Script: Arab} \p{Script=Arabic} (1235)
  2749. \p{Script: Arabic} (Short: \p{Sc=Arab}, \p{Arab}) (1235)
  2750. \p{Script: Armenian} (Short: \p{Sc=Armn}, \p{Armn}) (91)
  2751. \p{Script: Armi} \p{Script=Imperial_Aramaic} (31)
  2752. \p{Script: Armn} \p{Script=Armenian} (91)
  2753. \p{Script: Avestan} (Short: \p{Sc=Avst}, \p{Avst}) (61)
  2754. \p{Script: Avst} \p{Script=Avestan} (61)
  2755. \p{Script: Bali} \p{Script=Balinese} (121)
  2756. \p{Script: Balinese} (Short: \p{Sc=Bali}, \p{Bali}) (121)
  2757. \p{Script: Bamu} \p{Script=Bamum} (657)
  2758. \p{Script: Bamum} (Short: \p{Sc=Bamu}, \p{Bamu}) (657)
  2759. \p{Script: Batak} (Short: \p{Sc=Batk}, \p{Batk}) (56)
  2760. \p{Script: Batk} \p{Script=Batak} (56)
  2761. \p{Script: Beng} \p{Script=Bengali} (92)
  2762. \p{Script: Bengali} (Short: \p{Sc=Beng}, \p{Beng}) (92)
  2763. \p{Script: Bopo} \p{Script=Bopomofo} (70)
  2764. \p{Script: Bopomofo} (Short: \p{Sc=Bopo}, \p{Bopo}) (70)
  2765. \p{Script: Brah} \p{Script=Brahmi} (108)
  2766. \p{Script: Brahmi} (Short: \p{Sc=Brah}, \p{Brah}) (108)
  2767. \p{Script: Brai} \p{Script=Braille} (256)
  2768. \p{Script: Braille} (Short: \p{Sc=Brai}, \p{Brai}) (256)
  2769. \p{Script: Bugi} \p{Script=Buginese} (30)
  2770. \p{Script: Buginese} (Short: \p{Sc=Bugi}, \p{Bugi}) (30)
  2771. \p{Script: Buhd} \p{Script=Buhid} (20)
  2772. \p{Script: Buhid} (Short: \p{Sc=Buhd}, \p{Buhd}) (20)
  2773. \p{Script: Cakm} \p{Script=Chakma} (67)
  2774. \p{Script: Canadian_Aboriginal} (Short: \p{Sc=Cans}, \p{Cans})
  2775. (710)
  2776. \p{Script: Cans} \p{Script=Canadian_Aboriginal} (710)
  2777. \p{Script: Cari} \p{Script=Carian} (49)
  2778. \p{Script: Carian} (Short: \p{Sc=Cari}, \p{Cari}) (49)
  2779. \p{Script: Chakma} (Short: \p{Sc=Cakm}, \p{Cakm}) (67)
  2780. \p{Script: Cham} (Short: \p{Sc=Cham}, \p{Cham}) (83)
  2781. \p{Script: Cher} \p{Script=Cherokee} (85)
  2782. \p{Script: Cherokee} (Short: \p{Sc=Cher}, \p{Cher}) (85)
  2783. \p{Script: Common} (Short: \p{Sc=Zyyy}, \p{Zyyy}) (6413)
  2784. \p{Script: Copt} \p{Script=Coptic} (137)
  2785. \p{Script: Coptic} (Short: \p{Sc=Copt}, \p{Copt}) (137)
  2786. \p{Script: Cprt} \p{Script=Cypriot} (55)
  2787. \p{Script: Cuneiform} (Short: \p{Sc=Xsux}, \p{Xsux}) (982)
  2788. \p{Script: Cypriot} (Short: \p{Sc=Cprt}, \p{Cprt}) (55)
  2789. \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}, \p{Cyrl}) (417)
  2790. \p{Script: Cyrl} \p{Script=Cyrillic} (417)
  2791. \p{Script: Deseret} (Short: \p{Sc=Dsrt}, \p{Dsrt}) (80)
  2792. \p{Script: Deva} \p{Script=Devanagari} (151)
  2793. \p{Script: Devanagari} (Short: \p{Sc=Deva}, \p{Deva}) (151)
  2794. \p{Script: Dsrt} \p{Script=Deseret} (80)
  2795. \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (1071)
  2796. \p{Script: Egyptian_Hieroglyphs} (Short: \p{Sc=Egyp}, \p{Egyp})
  2797. (1071)
  2798. \p{Script: Ethi} \p{Script=Ethiopic} (495)
  2799. \p{Script: Ethiopic} (Short: \p{Sc=Ethi}, \p{Ethi}) (495)
  2800. \p{Script: Geor} \p{Script=Georgian} (127)
  2801. \p{Script: Georgian} (Short: \p{Sc=Geor}, \p{Geor}) (127)
  2802. \p{Script: Glag} \p{Script=Glagolitic} (94)
  2803. \p{Script: Glagolitic} (Short: \p{Sc=Glag}, \p{Glag}) (94)
  2804. \p{Script: Goth} \p{Script=Gothic} (27)
  2805. \p{Script: Gothic} (Short: \p{Sc=Goth}, \p{Goth}) (27)
  2806. \p{Script: Greek} (Short: \p{Sc=Grek}, \p{Grek}) (511)
  2807. \p{Script: Grek} \p{Script=Greek} (511)
  2808. \p{Script: Gujarati} (Short: \p{Sc=Gujr}, \p{Gujr}) (84)
  2809. \p{Script: Gujr} \p{Script=Gujarati} (84)
  2810. \p{Script: Gurmukhi} (Short: \p{Sc=Guru}, \p{Guru}) (79)
  2811. \p{Script: Guru} \p{Script=Gurmukhi} (79)
  2812. \p{Script: Han} (Short: \p{Sc=Han}, \p{Han}) (75_963)
  2813. \p{Script: Hang} \p{Script=Hangul} (11_739)
  2814. \p{Script: Hangul} (Short: \p{Sc=Hang}, \p{Hang}) (11_739)
  2815. \p{Script: Hani} \p{Script=Han} (75_963)
  2816. \p{Script: Hano} \p{Script=Hanunoo} (21)
  2817. \p{Script: Hanunoo} (Short: \p{Sc=Hano}, \p{Hano}) (21)
  2818. \p{Script: Hebr} \p{Script=Hebrew} (133)
  2819. \p{Script: Hebrew} (Short: \p{Sc=Hebr}, \p{Hebr}) (133)
  2820. \p{Script: Hira} \p{Script=Hiragana} (91)
  2821. \p{Script: Hiragana} (Short: \p{Sc=Hira}, \p{Hira}) (91)
  2822. \p{Script: Imperial_Aramaic} (Short: \p{Sc=Armi}, \p{Armi}) (31)
  2823. \p{Script: Inherited} (Short: \p{Sc=Zinh}, \p{Zinh}) (523)
  2824. \p{Script: Inscriptional_Pahlavi} (Short: \p{Sc=Phli}, \p{Phli})
  2825. (27)
  2826. \p{Script: Inscriptional_Parthian} (Short: \p{Sc=Prti}, \p{Prti})
  2827. (30)
  2828. \p{Script: Ital} \p{Script=Old_Italic} (35)
  2829. \p{Script: Java} \p{Script=Javanese} (91)
  2830. \p{Script: Javanese} (Short: \p{Sc=Java}, \p{Java}) (91)
  2831. \p{Script: Kaithi} (Short: \p{Sc=Kthi}, \p{Kthi}) (66)
  2832. \p{Script: Kali} \p{Script=Kayah_Li} (48)
  2833. \p{Script: Kana} \p{Script=Katakana} (300)
  2834. \p{Script: Kannada} (Short: \p{Sc=Knda}, \p{Knda}) (86)
  2835. \p{Script: Katakana} (Short: \p{Sc=Kana}, \p{Kana}) (300)
  2836. \p{Script: Kayah_Li} (Short: \p{Sc=Kali}, \p{Kali}) (48)
  2837. \p{Script: Khar} \p{Script=Kharoshthi} (65)
  2838. \p{Script: Kharoshthi} (Short: \p{Sc=Khar}, \p{Khar}) (65)
  2839. \p{Script: Khmer} (Short: \p{Sc=Khmr}, \p{Khmr}) (146)
  2840. \p{Script: Khmr} \p{Script=Khmer} (146)
  2841. \p{Script: Knda} \p{Script=Kannada} (86)
  2842. \p{Script: Kthi} \p{Script=Kaithi} (66)
  2843. \p{Script: Lana} \p{Script=Tai_Tham} (127)
  2844. \p{Script: Lao} (Short: \p{Sc=Lao}, \p{Lao}) (67)
  2845. \p{Script: Laoo} \p{Script=Lao} (67)
  2846. \p{Script: Latin} (Short: \p{Sc=Latn}, \p{Latn}) (1272)
  2847. \p{Script: Latn} \p{Script=Latin} (1272)
  2848. \p{Script: Lepc} \p{Script=Lepcha} (74)
  2849. \p{Script: Lepcha} (Short: \p{Sc=Lepc}, \p{Lepc}) (74)
  2850. \p{Script: Limb} \p{Script=Limbu} (66)
  2851. \p{Script: Limbu} (Short: \p{Sc=Limb}, \p{Limb}) (66)
  2852. \p{Script: Linb} \p{Script=Linear_B} (211)
  2853. \p{Script: Linear_B} (Short: \p{Sc=Linb}, \p{Linb}) (211)
  2854. \p{Script: Lisu} (Short: \p{Sc=Lisu}, \p{Lisu}) (48)
  2855. \p{Script: Lyci} \p{Script=Lycian} (29)
  2856. \p{Script: Lycian} (Short: \p{Sc=Lyci}, \p{Lyci}) (29)
  2857. \p{Script: Lydi} \p{Script=Lydian} (27)
  2858. \p{Script: Lydian} (Short: \p{Sc=Lydi}, \p{Lydi}) (27)
  2859. \p{Script: Malayalam} (Short: \p{Sc=Mlym}, \p{Mlym}) (98)
  2860. \p{Script: Mand} \p{Script=Mandaic} (29)
  2861. \p{Script: Mandaic} (Short: \p{Sc=Mand}, \p{Mand}) (29)
  2862. \p{Script: Meetei_Mayek} (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
  2863. \p{Script: Merc} \p{Script=Meroitic_Cursive} (26)
  2864. \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (32)
  2865. \p{Script: Meroitic_Cursive} (Short: \p{Sc=Merc}, \p{Merc}) (26)
  2866. \p{Script: Meroitic_Hieroglyphs} (Short: \p{Sc=Mero}, \p{Mero})
  2867. (32)
  2868. \p{Script: Miao} (Short: \p{Sc=Miao}, \p{Miao}) (133)
  2869. \p{Script: Mlym} \p{Script=Malayalam} (98)
  2870. \p{Script: Mong} \p{Script=Mongolian} (153)
  2871. \p{Script: Mongolian} (Short: \p{Sc=Mong}, \p{Mong}) (153)
  2872. \p{Script: Mtei} \p{Script=Meetei_Mayek} (79)
  2873. \p{Script: Myanmar} (Short: \p{Sc=Mymr}, \p{Mymr}) (188)
  2874. \p{Script: Mymr} \p{Script=Myanmar} (188)
  2875. \p{Script: New_Tai_Lue} (Short: \p{Sc=Talu}, \p{Talu}) (83)
  2876. \p{Script: Nko} (Short: \p{Sc=Nko}, \p{Nko}) (59)
  2877. \p{Script: Nkoo} \p{Script=Nko} (59)
  2878. \p{Script: Ogam} \p{Script=Ogham} (29)
  2879. \p{Script: Ogham} (Short: \p{Sc=Ogam}, \p{Ogam}) (29)
  2880. \p{Script: Ol_Chiki} (Short: \p{Sc=Olck}, \p{Olck}) (48)
  2881. \p{Script: Olck} \p{Script=Ol_Chiki} (48)
  2882. \p{Script: Old_Italic} (Short: \p{Sc=Ital}, \p{Ital}) (35)
  2883. \p{Script: Old_Persian} (Short: \p{Sc=Xpeo}, \p{Xpeo}) (50)
  2884. \p{Script: Old_South_Arabian} (Short: \p{Sc=Sarb}, \p{Sarb}) (32)
  2885. \p{Script: Old_Turkic} (Short: \p{Sc=Orkh}, \p{Orkh}) (73)
  2886. \p{Script: Oriya} (Short: \p{Sc=Orya}, \p{Orya}) (90)
  2887. \p{Script: Orkh} \p{Script=Old_Turkic} (73)
  2888. \p{Script: Orya} \p{Script=Oriya} (90)
  2889. \p{Script: Osma} \p{Script=Osmanya} (40)
  2890. \p{Script: Osmanya} (Short: \p{Sc=Osma}, \p{Osma}) (40)
  2891. \p{Script: Phag} \p{Script=Phags_Pa} (56)
  2892. \p{Script: Phags_Pa} (Short: \p{Sc=Phag}, \p{Phag}) (56)
  2893. \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (27)
  2894. \p{Script: Phnx} \p{Script=Phoenician} (29)
  2895. \p{Script: Phoenician} (Short: \p{Sc=Phnx}, \p{Phnx}) (29)
  2896. \p{Script: Plrd} \p{Script=Miao} (133)
  2897. \p{Script: Prti} \p{Script=Inscriptional_Parthian} (30)
  2898. \p{Script: Qaac} \p{Script=Coptic} (137)
  2899. \p{Script: Qaai} \p{Script=Inherited} (523)
  2900. \p{Script: Rejang} (Short: \p{Sc=Rjng}, \p{Rjng}) (37)
  2901. \p{Script: Rjng} \p{Script=Rejang} (37)
  2902. \p{Script: Runic} (Short: \p{Sc=Runr}, \p{Runr}) (78)
  2903. \p{Script: Runr} \p{Script=Runic} (78)
  2904. \p{Script: Samaritan} (Short: \p{Sc=Samr}, \p{Samr}) (61)
  2905. \p{Script: Samr} \p{Script=Samaritan} (61)
  2906. \p{Script: Sarb} \p{Script=Old_South_Arabian} (32)
  2907. \p{Script: Saur} \p{Script=Saurashtra} (81)
  2908. \p{Script: Saurashtra} (Short: \p{Sc=Saur}, \p{Saur}) (81)
  2909. \p{Script: Sharada} (Short: \p{Sc=Shrd}, \p{Shrd}) (83)
  2910. \p{Script: Shavian} (Short: \p{Sc=Shaw}, \p{Shaw}) (48)
  2911. \p{Script: Shaw} \p{Script=Shavian} (48)
  2912. \p{Script: Shrd} \p{Script=Sharada} (83)
  2913. \p{Script: Sinh} \p{Script=Sinhala} (80)
  2914. \p{Script: Sinhala} (Short: \p{Sc=Sinh}, \p{Sinh}) (80)
  2915. \p{Script: Sora} \p{Script=Sora_Sompeng} (35)
  2916. \p{Script: Sora_Sompeng} (Short: \p{Sc=Sora}, \p{Sora}) (35)
  2917. \p{Script: Sund} \p{Script=Sundanese} (72)
  2918. \p{Script: Sundanese} (Short: \p{Sc=Sund}, \p{Sund}) (72)
  2919. \p{Script: Sylo} \p{Script=Syloti_Nagri} (44)
  2920. \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}, \p{Sylo}) (44)
  2921. \p{Script: Syrc} \p{Script=Syriac} (77)
  2922. \p{Script: Syriac} (Short: \p{Sc=Syrc}, \p{Syrc}) (77)
  2923. \p{Script: Tagalog} (Short: \p{Sc=Tglg}, \p{Tglg}) (20)
  2924. \p{Script: Tagb} \p{Script=Tagbanwa} (18)
  2925. \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}, \p{Tagb}) (18)
  2926. \p{Script: Tai_Le} (Short: \p{Sc=Tale}, \p{Tale}) (35)
  2927. \p{Script: Tai_Tham} (Short: \p{Sc=Lana}, \p{Lana}) (127)
  2928. \p{Script: Tai_Viet} (Short: \p{Sc=Tavt}, \p{Tavt}) (72)
  2929. \p{Script: Takr} \p{Script=Takri} (66)
  2930. \p{Script: Takri} (Short: \p{Sc=Takr}, \p{Takr}) (66)
  2931. \p{Script: Tale} \p{Script=Tai_Le} (35)
  2932. \p{Script: Talu} \p{Script=New_Tai_Lue} (83)
  2933. \p{Script: Tamil} (Short: \p{Sc=Taml}, \p{Taml}) (72)
  2934. \p{Script: Taml} \p{Script=Tamil} (72)
  2935. \p{Script: Tavt} \p{Script=Tai_Viet} (72)
  2936. \p{Script: Telu} \p{Script=Telugu} (93)
  2937. \p{Script: Telugu} (Short: \p{Sc=Telu}, \p{Telu}) (93)
  2938. \p{Script: Tfng} \p{Script=Tifinagh} (59)
  2939. \p{Script: Tglg} \p{Script=Tagalog} (20)
  2940. \p{Script: Thaa} \p{Script=Thaana} (50)
  2941. \p{Script: Thaana} (Short: \p{Sc=Thaa}, \p{Thaa}) (50)
  2942. \p{Script: Thai} (Short: \p{Sc=Thai}, \p{Thai}) (86)
  2943. \p{Script: Tibetan} (Short: \p{Sc=Tibt}, \p{Tibt}) (207)
  2944. \p{Script: Tibt} \p{Script=Tibetan} (207)
  2945. \p{Script: Tifinagh} (Short: \p{Sc=Tfng}, \p{Tfng}) (59)
  2946. \p{Script: Ugar} \p{Script=Ugaritic} (31)
  2947. \p{Script: Ugaritic} (Short: \p{Sc=Ugar}, \p{Ugar}) (31)
  2948. \p{Script: Unknown} (Short: \p{Sc=Zzzz}, \p{Zzzz}) (1_003_930)
  2949. \p{Script: Vai} (Short: \p{Sc=Vai}, \p{Vai}) (300)
  2950. \p{Script: Vaii} \p{Script=Vai} (300)
  2951. \p{Script: Xpeo} \p{Script=Old_Persian} (50)
  2952. \p{Script: Xsux} \p{Script=Cuneiform} (982)
  2953. \p{Script: Yi} (Short: \p{Sc=Yi}, \p{Yi}) (1220)
  2954. \p{Script: Yiii} \p{Script=Yi} (1220)
  2955. \p{Script: Zinh} \p{Script=Inherited} (523)
  2956. \p{Script: Zyyy} \p{Script=Common} (6413)
  2957. \p{Script: Zzzz} \p{Script=Unknown} (1_003_930)
  2958. \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1262)
  2959. \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}) (1262)
  2960. \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}) (92)
  2961. \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
  2962. (31)
  2963. \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (92)
  2964. \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}) (61)
  2965. \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
  2966. \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
  2967. \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}) (121)
  2968. \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
  2969. \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}) (657)
  2970. \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}) (56)
  2971. \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
  2972. \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (94)
  2973. \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}) (94)
  2974. \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (306)
  2975. \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}) (306)
  2976. \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (108)
  2977. \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}) (108)
  2978. \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
  2979. \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}) (256)
  2980. \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (30)
  2981. \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}) (30)
  2982. \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
  2983. \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}) (22)
  2984. \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (67)
  2985. \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans})
  2986. (710)
  2987. \p{Script_Extensions: Cans} \p{Script_Extensions=
  2988. Canadian_Aboriginal} (710)
  2989. \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
  2990. \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}) (49)
  2991. \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}) (67)
  2992. \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}) (83)
  2993. \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (85)
  2994. \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}) (85)
  2995. \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}) (6057)
  2996. \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (137)
  2997. \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}) (137)
  2998. \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
  2999. \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}) (982)
  3000. \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}) (112)
  3001. \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}) (419)
  3002. \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (419)
  3003. \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}) (80)
  3004. \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (193)
  3005. \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}) (193)
  3006. \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
  3007. \p{Script_Extensions: Egyp} \p{Script_Extensions=
  3008. Egyptian_Hieroglyphs} (1071)
  3009. \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp})
  3010. (1071)
  3011. \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
  3012. \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}) (495)
  3013. \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (128)
  3014. \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}) (128)
  3015. \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (94)
  3016. \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}) (94)
  3017. \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
  3018. \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}) (27)
  3019. \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}) (515)
  3020. \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (515)
  3021. \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}) (94)
  3022. \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (94)
  3023. \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}) (91)
  3024. \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (91)
  3025. \p{Script_Extensions: Han} (Short: \p{Scx=Han}) (76_218)
  3026. \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_971)
  3027. \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}) (11_971)
  3028. \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (76_218)
  3029. \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
  3030. \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}) (23)
  3031. \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (133)
  3032. \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}) (133)
  3033. \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (356)
  3034. \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}) (356)
  3035. \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi}) (31)
  3036. \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}) (459)
  3037. \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli})
  3038. (27)
  3039. \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
  3040. Prti}) (30)
  3041. \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (35)
  3042. \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
  3043. \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}) (91)
  3044. \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}) (76)
  3045. \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
  3046. \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (565)
  3047. \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}) (86)
  3048. \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}) (565)
  3049. \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}) (48)
  3050. \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (65)
  3051. \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}) (65)
  3052. \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}) (146)
  3053. \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
  3054. \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (86)
  3055. \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (76)
  3056. \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
  3057. \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}) (67)
  3058. \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (67)
  3059. \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}) (1289)
  3060. \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1289)
  3061. \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
  3062. \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}) (74)
  3063. \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (66)
  3064. \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}) (66)
  3065. \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
  3066. \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}) (268)
  3067. \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}) (48)
  3068. \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
  3069. \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}) (29)
  3070. \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
  3071. \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}) (27)
  3072. \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}) (98)
  3073. \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
  3074. \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}) (30)
  3075. \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei}) (79)
  3076. \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
  3077. (26)
  3078. \p{Script_Extensions: Mero} \p{Script_Extensions=
  3079. Meroitic_Hieroglyphs} (32)
  3080. \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc}) (26)
  3081. \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero})
  3082. (32)
  3083. \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}) (133)
  3084. \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (98)
  3085. \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (156)
  3086. \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}) (156)
  3087. \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
  3088. \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}) (188)
  3089. \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (188)
  3090. \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}) (83)
  3091. \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}) (59)
  3092. \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (59)
  3093. \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
  3094. \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}) (29)
  3095. \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}) (48)
  3096. \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
  3097. \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}) (35)
  3098. \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}) (50)
  3099. \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb}) (32)
  3100. \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}) (73)
  3101. \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}) (92)
  3102. \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
  3103. \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (92)
  3104. \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
  3105. \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}) (40)
  3106. \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
  3107. \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}) (59)
  3108. \p{Script_Extensions: Phli} \p{Script_Extensions=
  3109. Inscriptional_Pahlavi} (27)
  3110. \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
  3111. \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}) (29)
  3112. \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (133)
  3113. \p{Script_Extensions: Prti} \p{Script_Extensions=
  3114. Inscriptional_Parthian} (30)
  3115. \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (137)
  3116. \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (459)
  3117. \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}) (37)
  3118. \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
  3119. \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}) (78)
  3120. \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (78)
  3121. \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}) (61)
  3122. \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
  3123. \p{Script_Extensions: Sarb} \p{Script_Extensions=
  3124. Old_South_Arabian} (32)
  3125. \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (81)
  3126. \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}) (81)
  3127. \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}) (83)
  3128. \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}) (48)
  3129. \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
  3130. \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (83)
  3131. \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (80)
  3132. \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}) (80)
  3133. \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
  3134. \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora}) (35)
  3135. \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
  3136. \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}) (72)
  3137. \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (44)
  3138. \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo}) (44)
  3139. \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (93)
  3140. \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}) (93)
  3141. \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}) (22)
  3142. \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
  3143. \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}) (20)
  3144. \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}) (35)
  3145. \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}) (127)
  3146. \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}) (72)
  3147. \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (78)
  3148. \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}) (78)
  3149. \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (35)
  3150. \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
  3151. \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}) (72)
  3152. \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (72)
  3153. \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
  3154. \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (93)
  3155. \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}) (93)
  3156. \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
  3157. \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
  3158. \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (65)
  3159. \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}) (65)
  3160. \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}) (86)
  3161. \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}) (207)
  3162. \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
  3163. \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}) (59)
  3164. \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
  3165. \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}) (31)
  3166. \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}) (1_003_930)
  3167. \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}) (300)
  3168. \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
  3169. \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
  3170. \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (982)
  3171. \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}) (1246)
  3172. \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
  3173. \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (459)
  3174. \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (6057)
  3175. \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown}
  3176. (1_003_930)
  3177. \p{Scx: *} \p{Script_Extensions: *}
  3178. \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
  3179. \p{SD: *} \p{Soft_Dotted: *}
  3180. \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
  3181. \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4)
  3182. \p{Sentence_Break: CL} \p{Sentence_Break=Close} (177)
  3183. \p{Sentence_Break: Close} (Short: \p{SB=CL}) (177)
  3184. \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1)
  3185. \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (1649)
  3186. \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (1649)
  3187. \p{Sentence_Break: FO} \p{Sentence_Break=Format} (137)
  3188. \p{Sentence_Break: Format} (Short: \p{SB=FO}) (137)
  3189. \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (97_841)
  3190. \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1)
  3191. \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (1933)
  3192. \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (1933)
  3193. \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (452)
  3194. \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (452)
  3195. \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (97_841)
  3196. \p{Sentence_Break: Other} (Short: \p{SB=XX}) (1_010_273)
  3197. \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
  3198. \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26)
  3199. \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
  3200. \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3)
  3201. \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (21)
  3202. \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (80)
  3203. \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (80)
  3204. \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1514)
  3205. \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1514)
  3206. \p{Sentence_Break: XX} \p{Sentence_Break=Other} (1_010_273)
  3207. \p{Separator} \p{General_Category=Separator} (Short:
  3208. \p{Z}) (20)
  3209. \p{Sharada} \p{Script=Sharada} (Short: \p{Shrd}; NOT
  3210. \p{Block=Sharada}) (83)
  3211. \p{Shavian} \p{Script=Shavian} (Short: \p{Shaw}) (48)
  3212. \p{Shaw} \p{Shavian} (= \p{Script=Shavian}) (48)
  3213. \p{Shrd} \p{Sharada} (= \p{Script=Sharada}) (NOT
  3214. \p{Block=Sharada}) (83)
  3215. \p{Sinh} \p{Sinhala} (= \p{Script=Sinhala}) (NOT
  3216. \p{Block=Sinhala}) (80)
  3217. \p{Sinhala} \p{Script=Sinhala} (Short: \p{Sinh}; NOT
  3218. \p{Block=Sinhala}) (80)
  3219. \p{Sk} \p{Modifier_Symbol} (=
  3220. \p{General_Category=Modifier_Symbol})
  3221. (115)
  3222. \p{Sm} \p{Math_Symbol} (= \p{General_Category=
  3223. Math_Symbol}) (952)
  3224. X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
  3225. \p{InSmallForms}) (32)
  3226. X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
  3227. Small_Form_Variants}) (32)
  3228. \p{So} \p{Other_Symbol} (= \p{General_Category=
  3229. Other_Symbol}) (4404)
  3230. \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
  3231. \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066)
  3232. \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46)
  3233. \p{Sora} \p{Sora_Sompeng} (= \p{Script=
  3234. Sora_Sompeng}) (NOT \p{Block=
  3235. Sora_Sompeng}) (35)
  3236. \p{Sora_Sompeng} \p{Script=Sora_Sompeng} (Short: \p{Sora};
  3237. NOT \p{Block=Sora_Sompeng}) (35)
  3238. \p{Space} \p{White_Space=Y} \s including beyond
  3239. ASCII and vertical tab (26)
  3240. \p{Space: *} \p{White_Space: *}
  3241. \p{Space_Separator} \p{General_Category=Space_Separator}
  3242. (Short: \p{Zs}) (18)
  3243. \p{SpacePerl} \p{XPerlSpace} (26)
  3244. \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
  3245. \p{Mc}) (353)
  3246. X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
  3247. (Short: \p{InModifierLetters}) (80)
  3248. X \p{Specials} \p{Block=Specials} (16)
  3249. \p{STerm} \p{STerm=Y} (83)
  3250. \p{STerm: N*} (Single: \P{STerm}) (1_114_029)
  3251. \p{STerm: Y*} (Single: \p{STerm}) (83)
  3252. \p{Sund} \p{Sundanese} (= \p{Script=Sundanese})
  3253. (NOT \p{Block=Sundanese}) (72)
  3254. \p{Sundanese} \p{Script=Sundanese} (Short: \p{Sund}; NOT
  3255. \p{Block=Sundanese}) (72)
  3256. X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
  3257. Sundanese_Supplement}) (16)
  3258. X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
  3259. \p{InSundaneseSup}) (16)
  3260. X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
  3261. Supplemental_Arrows_A}) (16)
  3262. X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
  3263. Supplemental_Arrows_B}) (128)
  3264. X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
  3265. \p{Block=
  3266. Supplemental_Mathematical_Operators})
  3267. (256)
  3268. X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
  3269. \p{Block=
  3270. Supplementary_Private_Use_Area_A})
  3271. (65_536)
  3272. X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
  3273. \p{Block=
  3274. Supplementary_Private_Use_Area_B})
  3275. (65_536)
  3276. X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
  3277. Supplemental_Punctuation}) (128)
  3278. X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
  3279. \p{Block=Superscripts_And_Subscripts})
  3280. (48)
  3281. X \p{Superscripts_And_Subscripts} \p{Block=
  3282. Superscripts_And_Subscripts} (Short:
  3283. \p{InSuperAndSub}) (48)
  3284. X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
  3285. \p{InSupArrowsA}) (16)
  3286. X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
  3287. \p{InSupArrowsB}) (128)
  3288. X \p{Supplemental_Mathematical_Operators} \p{Block=
  3289. Supplemental_Mathematical_Operators}
  3290. (Short: \p{InSupMathOperators}) (256)
  3291. X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
  3292. (Short: \p{InSupPunctuation}) (128)
  3293. X \p{Supplementary_Private_Use_Area_A} \p{Block=
  3294. Supplementary_Private_Use_Area_A}
  3295. (Short: \p{InSupPUAA}) (65_536)
  3296. X \p{Supplementary_Private_Use_Area_B} \p{Block=
  3297. Supplementary_Private_Use_Area_B}
  3298. (Short: \p{InSupPUAB}) (65_536)
  3299. \p{Surrogate} \p{General_Category=Surrogate} (Short:
  3300. \p{Cs}) (2048)
  3301. \p{Sylo} \p{Syloti_Nagri} (= \p{Script=
  3302. Syloti_Nagri}) (NOT \p{Block=
  3303. Syloti_Nagri}) (44)
  3304. \p{Syloti_Nagri} \p{Script=Syloti_Nagri} (Short: \p{Sylo};
  3305. NOT \p{Block=Syloti_Nagri}) (44)
  3306. \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
  3307. (5520)
  3308. \p{Syrc} \p{Syriac} (= \p{Script=Syriac}) (NOT
  3309. \p{Block=Syriac}) (77)
  3310. \p{Syriac} \p{Script=Syriac} (Short: \p{Syrc}; NOT
  3311. \p{Block=Syriac}) (77)
  3312. \p{Tagalog} \p{Script=Tagalog} (Short: \p{Tglg}; NOT
  3313. \p{Block=Tagalog}) (20)
  3314. \p{Tagb} \p{Tagbanwa} (= \p{Script=Tagbanwa}) (NOT
  3315. \p{Block=Tagbanwa}) (18)
  3316. \p{Tagbanwa} \p{Script=Tagbanwa} (Short: \p{Tagb}; NOT
  3317. \p{Block=Tagbanwa}) (18)
  3318. X \p{Tags} \p{Block=Tags} (128)
  3319. \p{Tai_Le} \p{Script=Tai_Le} (Short: \p{Tale}; NOT
  3320. \p{Block=Tai_Le}) (35)
  3321. \p{Tai_Tham} \p{Script=Tai_Tham} (Short: \p{Lana}; NOT
  3322. \p{Block=Tai_Tham}) (127)
  3323. \p{Tai_Viet} \p{Script=Tai_Viet} (Short: \p{Tavt}; NOT
  3324. \p{Block=Tai_Viet}) (72)
  3325. X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
  3326. Tai_Xuan_Jing_Symbols}) (96)
  3327. X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
  3328. \p{InTaiXuanJing}) (96)
  3329. \p{Takr} \p{Takri} (= \p{Script=Takri}) (NOT
  3330. \p{Block=Takri}) (66)
  3331. \p{Takri} \p{Script=Takri} (Short: \p{Takr}; NOT
  3332. \p{Block=Takri}) (66)
  3333. \p{Tale} \p{Tai_Le} (= \p{Script=Tai_Le}) (NOT
  3334. \p{Block=Tai_Le}) (35)
  3335. \p{Talu} \p{New_Tai_Lue} (= \p{Script=New_Tai_Lue})
  3336. (NOT \p{Block=New_Tai_Lue}) (83)
  3337. \p{Tamil} \p{Script=Tamil} (Short: \p{Taml}; NOT
  3338. \p{Block=Tamil}) (72)
  3339. \p{Taml} \p{Tamil} (= \p{Script=Tamil}) (NOT
  3340. \p{Block=Tamil}) (72)
  3341. \p{Tavt} \p{Tai_Viet} (= \p{Script=Tai_Viet}) (NOT
  3342. \p{Block=Tai_Viet}) (72)
  3343. \p{Telu} \p{Telugu} (= \p{Script=Telugu}) (NOT
  3344. \p{Block=Telugu}) (93)
  3345. \p{Telugu} \p{Script=Telugu} (Short: \p{Telu}; NOT
  3346. \p{Block=Telugu}) (93)
  3347. \p{Term} \p{Terminal_Punctuation} (=
  3348. \p{Terminal_Punctuation=Y}) (176)
  3349. \p{Term: *} \p{Terminal_Punctuation: *}
  3350. \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
  3351. \p{Term}) (176)
  3352. \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
  3353. (1_113_936)
  3354. \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (176)
  3355. \p{Tfng} \p{Tifinagh} (= \p{Script=Tifinagh}) (NOT
  3356. \p{Block=Tifinagh}) (59)
  3357. \p{Tglg} \p{Tagalog} (= \p{Script=Tagalog}) (NOT
  3358. \p{Block=Tagalog}) (20)
  3359. \p{Thaa} \p{Thaana} (= \p{Script=Thaana}) (NOT
  3360. \p{Block=Thaana}) (50)
  3361. \p{Thaana} \p{Script=Thaana} (Short: \p{Thaa}; NOT
  3362. \p{Block=Thaana}) (50)
  3363. \p{Thai} \p{Script=Thai} (NOT \p{Block=Thai}) (86)
  3364. \p{Tibetan} \p{Script=Tibetan} (Short: \p{Tibt}; NOT
  3365. \p{Block=Tibetan}) (207)
  3366. \p{Tibt} \p{Tibetan} (= \p{Script=Tibetan}) (NOT
  3367. \p{Block=Tibetan}) (207)
  3368. \p{Tifinagh} \p{Script=Tifinagh} (Short: \p{Tfng}; NOT
  3369. \p{Block=Tifinagh}) (59)
  3370. \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
  3371. \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
  3372. Cased=Yes) (31)
  3373. \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
  3374. (Short: \p{Lt}; /i= General_Category=
  3375. Cased_Letter) (31)
  3376. X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
  3377. Transport_And_Map_Symbols}) (128)
  3378. X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
  3379. (Short: \p{InTransportAndMap}) (128)
  3380. X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
  3381. (= \p{Block=
  3382. Unified_Canadian_Aboriginal_Syllabics})
  3383. (640)
  3384. X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
  3385. Extended} (= \p{Block=
  3386. Unified_Canadian_Aboriginal_Syllabics_-
  3387. Extended}) (80)
  3388. \p{Ugar} \p{Ugaritic} (= \p{Script=Ugaritic}) (NOT
  3389. \p{Block=Ugaritic}) (31)
  3390. \p{Ugaritic} \p{Script=Ugaritic} (Short: \p{Ugar}; NOT
  3391. \p{Block=Ugaritic}) (31)
  3392. \p{UIdeo} \p{Unified_Ideograph} (=
  3393. \p{Unified_Ideograph=Y}) (74_617)
  3394. \p{UIdeo: *} \p{Unified_Ideograph: *}
  3395. \p{Unassigned} \p{General_Category=Unassigned} (Short:
  3396. \p{Cn}) (864_414)
  3397. X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
  3398. Unified_Canadian_Aboriginal_Syllabics}
  3399. (Short: \p{InUCAS}) (640)
  3400. X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
  3401. Unified_Canadian_Aboriginal_Syllabics_-
  3402. Extended} (Short: \p{InUCASExt}) (80)
  3403. \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
  3404. (74_617)
  3405. \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
  3406. (1_039_495)
  3407. \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (74_617)
  3408. \p{Unknown} \p{Script=Unknown} (Short: \p{Zzzz})
  3409. (1_003_930)
  3410. \p{Upper} \p{Uppercase=Y} (/i= Cased=Yes) (1483)
  3411. \p{Upper: *} \p{Uppercase: *}
  3412. \p{Uppercase} \p{Upper} (= \p{Uppercase=Y}) (/i= Cased=
  3413. Yes) (1483)
  3414. \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
  3415. No) (1_112_629)
  3416. \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
  3417. Yes) (1483)
  3418. \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
  3419. (Short: \p{Lu}; /i= General_Category=
  3420. Cased_Letter) (1441)
  3421. \p{Vai} \p{Script=Vai} (NOT \p{Block=Vai}) (300)
  3422. \p{Vaii} \p{Vai} (= \p{Script=Vai}) (NOT \p{Block=
  3423. Vai}) (300)
  3424. \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
  3425. NOT \p{Variation_Selectors}) (259)
  3426. \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853)
  3427. \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259)
  3428. X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
  3429. \p{InVS}) (16)
  3430. X \p{Variation_Selectors_Supplement} \p{Block=
  3431. Variation_Selectors_Supplement} (Short:
  3432. \p{InVSSup}) (240)
  3433. X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
  3434. Vedic_Extensions}) (48)
  3435. X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
  3436. \p{InVedicExt}) (48)
  3437. X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
  3438. \p{VertSpace} \v (7)
  3439. \p{VS} \p{Variation_Selector} (=
  3440. \p{Variation_Selector=Y}) (NOT
  3441. \p{Variation_Selectors}) (259)
  3442. \p{VS: *} \p{Variation_Selector: *}
  3443. X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
  3444. \p{Block=
  3445. Variation_Selectors_Supplement}) (240)
  3446. \p{WB: *} \p{Word_Break: *}
  3447. \p{White_Space} \p{White_Space=Y} (Short: \p{WSpace}) (26)
  3448. \p{White_Space: N*} (Short: \p{Space=N}, \P{WSpace})
  3449. (1_114_086)
  3450. \p{White_Space: Y*} (Short: \p{Space=Y}, \p{WSpace}) (26)
  3451. \p{Word} \w, including beyond ASCII; = \p{Alnum} +
  3452. \pM + \p{Pc} (103_406)
  3453. \p{Word_Break: ALetter} (Short: \p{WB=LE}) (24_941)
  3454. \p{Word_Break: CR} (Short: \p{WB=CR}) (1)
  3455. \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (10)
  3456. \p{Word_Break: Extend} (Short: \p{WB=Extend}) (1649)
  3457. \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (10)
  3458. \p{Word_Break: FO} \p{Word_Break=Format} (136)
  3459. \p{Word_Break: Format} (Short: \p{WB=FO}) (136)
  3460. \p{Word_Break: KA} \p{Word_Break=Katakana} (310)
  3461. \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310)
  3462. \p{Word_Break: LE} \p{Word_Break=ALetter} (24_941)
  3463. \p{Word_Break: LF} (Short: \p{WB=LF}) (1)
  3464. \p{Word_Break: MB} \p{Word_Break=MidNumLet} (8)
  3465. \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8)
  3466. \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15)
  3467. \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (8)
  3468. \p{Word_Break: ML} \p{Word_Break=MidLetter} (8)
  3469. \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
  3470. \p{Word_Break: Newline} (Short: \p{WB=NL}) (5)
  3471. \p{Word_Break: NL} \p{Word_Break=Newline} (5)
  3472. \p{Word_Break: NU} \p{Word_Break=Numeric} (451)
  3473. \p{Word_Break: Numeric} (Short: \p{WB=NU}) (451)
  3474. \p{Word_Break: Other} (Short: \p{WB=XX}) (1_086_551)
  3475. \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26)
  3476. \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
  3477. \p{Word_Break: XX} \p{Word_Break=Other} (1_086_551)
  3478. \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (26)
  3479. \p{WSpace: *} \p{White_Space: *}
  3480. \p{XDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
  3481. \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
  3482. (103_336)
  3483. \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (1_010_776)
  3484. \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (103_336)
  3485. \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (101_217)
  3486. \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (1_012_895)
  3487. \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (101_217)
  3488. \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
  3489. (103_336)
  3490. \p{XIDC: *} \p{XID_Continue: *}
  3491. \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (101_217)
  3492. \p{XIDS: *} \p{XID_Start: *}
  3493. \p{Xpeo} \p{Old_Persian} (= \p{Script=Old_Persian})
  3494. (NOT \p{Block=Old_Persian}) (50)
  3495. \p{XPerlSpace} \s, including beyond ASCII (Short:
  3496. \p{SpacePerl}) (26)
  3497. \p{XPosixAlnum} \p{Alnum} (102_619)
  3498. \p{XPosixAlpha} \p{Alpha} (= \p{Alphabetic=Y}) (102_159)
  3499. \p{XPosixBlank} \p{Blank} (19)
  3500. \p{XPosixCntrl} \p{Cntrl} (= \p{General_Category=Control})
  3501. (65)
  3502. \p{XPosixDigit} \p{Digit} (= \p{General_Category=
  3503. Decimal_Number}) (460)
  3504. \p{XPosixGraph} \p{Graph} (247_565)
  3505. \p{XPosixLower} \p{Lower} (= \p{Lowercase=Y}) (/i= Cased=
  3506. Yes) (1934)
  3507. \p{XPosixPrint} \p{Print} (247_583)
  3508. \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (641)
  3509. \p{XPosixSpace} \p{Space} (= \p{White_Space=Y}) (26)
  3510. \p{XPosixUpper} \p{Upper} (= \p{Uppercase=Y}) (/i= Cased=
  3511. Yes) (1483)
  3512. \p{XPosixWord} \p{Word} (103_406)
  3513. \p{XPosixXDigit} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
  3514. \p{Xsux} \p{Cuneiform} (= \p{Script=Cuneiform})
  3515. (NOT \p{Block=Cuneiform}) (982)
  3516. \p{Yi} \p{Script=Yi} (1220)
  3517. X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
  3518. X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
  3519. \p{Yiii} \p{Yi} (= \p{Script=Yi}) (1220)
  3520. X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
  3521. Yijing_Hexagram_Symbols}) (64)
  3522. X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
  3523. (Short: \p{InYijing}) (64)
  3524. \p{Z} \p{Separator} (= \p{General_Category=
  3525. Separator}) (20)
  3526. \p{Zinh} \p{Inherited} (= \p{Script=Inherited})
  3527. (523)
  3528. \p{Zl} \p{Line_Separator} (= \p{General_Category=
  3529. Line_Separator}) (1)
  3530. \p{Zp} \p{Paragraph_Separator} (=
  3531. \p{General_Category=
  3532. Paragraph_Separator}) (1)
  3533. \p{Zs} \p{Space_Separator} (=
  3534. \p{General_Category=Space_Separator})
  3535. (18)
  3536. \p{Zyyy} \p{Common} (= \p{Script=Common}) (6413)
  3537. \p{Zzzz} \p{Unknown} (= \p{Script=Unknown})
  3538. (1_003_930)
  3539. TX\p{_CanonDCIJ} (For internal use by Perl, not necessarily
  3540. stable) (= \p{Soft_Dotted=Y}) (46)
  3541. TX\p{_Case_Ignorable} (For internal use by Perl, not necessarily
  3542. stable) (= \p{Case_Ignorable=Y}) (1799)
  3543. TX\p{_CombAbove} (For internal use by Perl, not necessarily
  3544. stable) (= \p{Canonical_Combining_Class=
  3545. Above}) (349)

Legal \p{} and \P{} constructs that match no characters

Unicode has some property-value pairs that currently don't match anything. This happens generally either because they are obsolete, or they exist for symmetry with other forms, but no language has yet been encoded that uses them. In this version of Unicode, the following match zero code points:

  • \p{Canonical_Combining_Class=Attached_Below_Left}
  • \p{Canonical_Combining_Class=CCC133}
  • \p{Grapheme_Cluster_Break=Prepend}
  • \p{Joining_Type=Left_Joining}

Properties accessible through Unicode::UCD

All the Unicode character properties mentioned above (except for those marked as for internal use by Perl) are also accessible by prop_invlist() in Unicode::UCD.

Due to their nature, not all Unicode character properties are suitable for regular expression matches, nor prop_invlist() . The remaining non-provisional, non-internal ones are accessible via prop_invmap() in Unicode::UCD (except for those that this Perl installation hasn't included; see below for which those are).

For compatibility with other parts of Perl, all the single forms given in the table in the section above are recognized. BUT, there are some ambiguities between some Perl extensions and the Unicode properties, all of which are silently resolved in favor of the official Unicode property. To avoid surprises, you should only use prop_invmap() for forms listed in the table below, which omits the non-recommended ones. The affected forms are the Perl single form equivalents of Unicode properties, such as \p{sc} being a single-form equivalent of \p{gc=sc} , which is treated by prop_invmap() as the Script property, whose short name is sc . The table indicates the current ambiguities in the INFO column, beginning with the word "NOT" .

The standard Unicode properties listed below are documented in http://www.unicode.org/reports/tr44/; Perl_Decimal_Digit is documented in prop_invmap() in Unicode::UCD. The other Perl extensions are in Other Properties in perlunicode;

The first column in the table is a name for the property; the second column is an alternative name, if any, plus possibly some annotations. The alternative name is the property's full name, unless that would simply repeat the first column, in which case the second column indicates the property's short name (if different). The annotations are given only in the entry for the full name. If a property is obsolete, etc, the entry will be flagged with the same characters used in the table in the section above, like D or S.

  1. NAME INFO
  2. Age
  3. AHex ASCII_Hex_Digit
  4. All Any. (Perl extension)
  5. Alnum (Perl extension). Alphabetic and
  6. (decimal) Numeric
  7. Alpha Alphabetic
  8. Alphabetic (Short: Alpha)
  9. Any (Perl extension). [\x{0000}-\x{10FFFF}]
  10. ASCII Block=ASCII. (Perl extension).
  11. [[:ASCII:]]
  12. ASCII_Hex_Digit (Short: AHex)
  13. Assigned (Perl extension). All assigned code points
  14. Bc Bidi_Class
  15. Bidi_C Bidi_Control
  16. Bidi_Class (Short: bc)
  17. Bidi_Control (Short: Bidi_C)
  18. Bidi_M Bidi_Mirrored
  19. Bidi_Mirrored (Short: Bidi_M)
  20. Bidi_Mirroring_Glyph (Short: bmg)
  21. Blank (Perl extension). \h, Horizontal white
  22. space
  23. Blk Block
  24. Block (Short: blk)
  25. Bmg Bidi_Mirroring_Glyph
  26. Canonical_Combining_Class (Short: ccc)
  27. Case_Folding (Short: cf)
  28. Case_Ignorable (Short: CI)
  29. Cased
  30. Category General_Category
  31. Ccc Canonical_Combining_Class
  32. CE Composition_Exclusion
  33. Cf Case_Folding; NOT 'cf' meaning
  34. 'General_Category=Format'
  35. Changes_When_Casefolded (Short: CWCF)
  36. Changes_When_Casemapped (Short: CWCM)
  37. Changes_When_Lowercased (Short: CWL)
  38. Changes_When_NFKC_Casefolded (Short: CWKCF)
  39. Changes_When_Titlecased (Short: CWT)
  40. Changes_When_Uppercased (Short: CWU)
  41. CI Case_Ignorable
  42. Cntrl General_Category=Cntrl. (Perl extension).
  43. Control characters
  44. Comp_Ex Full_Composition_Exclusion
  45. Composition_Exclusion (Short: CE)
  46. CWCF Changes_When_Casefolded
  47. CWCM Changes_When_Casemapped
  48. CWKCF Changes_When_NFKC_Casefolded
  49. CWL Changes_When_Lowercased
  50. CWT Changes_When_Titlecased
  51. CWU Changes_When_Uppercased
  52. Dash
  53. Decomposition_Mapping (Short: dm)
  54. Decomposition_Type (Short: dt)
  55. Default_Ignorable_Code_Point (Short: DI)
  56. Dep Deprecated
  57. Deprecated (Short: Dep)
  58. DI Default_Ignorable_Code_Point
  59. Dia Diacritic
  60. Diacritic (Short: Dia)
  61. Digit General_Category=Digit. (Perl extension).
  62. [0-9] + all other decimal digits
  63. Dm Decomposition_Mapping
  64. Dt Decomposition_Type
  65. Ea East_Asian_Width
  66. East_Asian_Width (Short: ea)
  67. Ext Extender
  68. Extender (Short: Ext)
  69. Full_Composition_Exclusion (Short: Comp_Ex)
  70. Gc General_Category
  71. GCB Grapheme_Cluster_Break
  72. General_Category (Short: gc)
  73. Gr_Base Grapheme_Base
  74. Gr_Ext Grapheme_Extend
  75. Graph (Perl extension). Characters that are
  76. graphical
  77. Grapheme_Base (Short: Gr_Base)
  78. Grapheme_Cluster_Break (Short: GCB)
  79. Grapheme_Extend (Short: Gr_Ext)
  80. Hangul_Syllable_Type (Short: hst)
  81. Hex Hex_Digit
  82. Hex_Digit (Short: Hex)
  83. HorizSpace Blank. (Perl extension)
  84. Hst Hangul_Syllable_Type
  85. D Hyphen Supplanted by Line_Break property values;
  86. see www.unicode.org/reports/tr14
  87. ID_Continue (Short: IDC)
  88. ID_Start (Short: IDS)
  89. IDC ID_Continue
  90. Ideo Ideographic
  91. Ideographic (Short: Ideo)
  92. IDS ID_Start
  93. IDS_Binary_Operator (Short: IDSB)
  94. IDS_Trinary_Operator (Short: IDST)
  95. IDSB IDS_Binary_Operator
  96. IDST IDS_Trinary_Operator
  97. In Present_In. (Perl extension)
  98. Isc ISO_Comment; NOT 'isc' meaning
  99. 'General_Category=Other'
  100. ISO_Comment (Short: isc)
  101. Jg Joining_Group
  102. Join_C Join_Control
  103. Join_Control (Short: Join_C)
  104. Joining_Group (Short: jg)
  105. Joining_Type (Short: jt)
  106. Jt Joining_Type
  107. Lb Line_Break
  108. Lc Lowercase_Mapping; NOT 'lc' meaning
  109. 'General_Category=Cased_Letter'
  110. Line_Break (Short: lb)
  111. LOE Logical_Order_Exception
  112. Logical_Order_Exception (Short: LOE)
  113. Lower Lowercase
  114. Lowercase (Short: Lower)
  115. Lowercase_Mapping (Short: lc)
  116. Math
  117. Na Name
  118. Na1 Unicode_1_Name
  119. Name (Short: na)
  120. Name_Alias
  121. NChar Noncharacter_Code_Point
  122. NFC_QC NFC_Quick_Check
  123. NFC_Quick_Check (Short: NFC_QC)
  124. NFD_QC NFD_Quick_Check
  125. NFD_Quick_Check (Short: NFD_QC)
  126. NFKC_Casefold (Short: NFKC_CF)
  127. NFKC_CF NFKC_Casefold
  128. NFKC_QC NFKC_Quick_Check
  129. NFKC_Quick_Check (Short: NFKC_QC)
  130. NFKD_QC NFKD_Quick_Check
  131. NFKD_Quick_Check (Short: NFKD_QC)
  132. Noncharacter_Code_Point (Short: NChar)
  133. Nt Numeric_Type
  134. Numeric_Type (Short: nt)
  135. Numeric_Value (Short: nv)
  136. Nv Numeric_Value
  137. Pat_Syn Pattern_Syntax
  138. Pat_WS Pattern_White_Space
  139. Pattern_Syntax (Short: Pat_Syn)
  140. Pattern_White_Space (Short: Pat_WS)
  141. Perl_Decimal_Digit (Perl extension)
  142. PerlSpace (Perl extension). \s, restricted to ASCII
  143. = [ \f\n\r\t] plus vertical tab
  144. PerlWord (Perl extension). \w, restricted to ASCII
  145. = [A-Za-z0-9_]
  146. PosixAlnum (Perl extension). [A-Za-z0-9]
  147. PosixAlpha (Perl extension). [A-Za-z]
  148. PosixBlank (Perl extension). \t and ' '
  149. PosixCntrl (Perl extension). ASCII control
  150. characters: NUL, SOH, STX, ETX, EOT, ENQ,
  151. ACK, BEL, BS, HT, LF, VT, FF, CR, SO, SI,
  152. DLE, DC1, DC2, DC3, DC4, NAK, SYN, ETB,
  153. CAN, EOM, SUB, ESC, FS, GS, RS, US, and DEL
  154. PosixDigit (Perl extension). [0-9]
  155. PosixGraph (Perl extension). [-
  156. !"#$%&'()*+,./:;<>?@[\\]^_`{|}~0-9A-Za-z]
  157. PosixLower (Perl extension). [a-z]
  158. PosixPrint (Perl extension). [- 0-9A-Za-
  159. z!"#$%&'()*+,./:;<>?@[\\]^_`{|}~]
  160. PosixPunct (Perl extension). [-
  161. !"#$%&'()*+,./:;<>?@[\\]^_`{|}~]
  162. PosixSpace (Perl extension). \t, \n, \cK, \f, \r,
  163. and ' '. (\cK is vertical tab)
  164. PosixUpper (Perl extension). [A-Z]
  165. PosixWord PerlWord. (Perl extension)
  166. PosixXDigit (Perl extension). [0-9A-Fa-f]
  167. Present_In (Short: In). (Perl extension)
  168. Print (Perl extension). Characters that are
  169. graphical plus space characters (but no
  170. controls)
  171. Punct General_Category=Punct. (Perl extension)
  172. QMark Quotation_Mark
  173. Quotation_Mark (Short: QMark)
  174. Radical
  175. SB Sentence_Break
  176. Sc Script; NOT 'sc' meaning
  177. 'General_Category=Currency_Symbol'
  178. Scf Simple_Case_Folding
  179. Script (Short: sc)
  180. Script_Extensions (Short: scx)
  181. Scx Script_Extensions
  182. SD Soft_Dotted
  183. Sentence_Break (Short: SB)
  184. Sfc Simple_Case_Folding
  185. Simple_Case_Folding (Short: scf)
  186. Simple_Lowercase_Mapping (Short: slc)
  187. Simple_Titlecase_Mapping (Short: stc)
  188. Simple_Uppercase_Mapping (Short: suc)
  189. Slc Simple_Lowercase_Mapping
  190. Soft_Dotted (Short: SD)
  191. Space White_Space
  192. SpacePerl XPerlSpace. (Perl extension)
  193. Stc Simple_Titlecase_Mapping
  194. STerm
  195. Suc Simple_Uppercase_Mapping
  196. Tc Titlecase_Mapping
  197. Term Terminal_Punctuation
  198. Terminal_Punctuation (Short: Term)
  199. Title Titlecase. (Perl extension)
  200. Titlecase (Short: Title). (Perl extension). (=
  201. \p{Gc=Lt})
  202. Titlecase_Mapping (Short: tc)
  203. Uc Uppercase_Mapping
  204. UIdeo Unified_Ideograph
  205. Unicode_1_Name (Short: na1)
  206. Unified_Ideograph (Short: UIdeo)
  207. Upper Uppercase
  208. Uppercase (Short: Upper)
  209. Uppercase_Mapping (Short: uc)
  210. Variation_Selector (Short: VS)
  211. VertSpace (Perl extension). \v
  212. VS Variation_Selector
  213. WB Word_Break
  214. White_Space (Short: WSpace)
  215. Word (Perl extension). \w, including beyond
  216. ASCII; = \p{Alnum} + \pM + \p{Pc}
  217. Word_Break (Short: WB)
  218. WSpace White_Space
  219. XDigit (Perl extension)
  220. XID_Continue (Short: XIDC)
  221. XID_Start (Short: XIDS)
  222. XIDC XID_Continue
  223. XIDS XID_Start
  224. XPerlSpace (Perl extension). \s, including beyond
  225. ASCII
  226. XPosixAlnum Alnum. (Perl extension)
  227. XPosixAlpha Alpha. (Perl extension)
  228. XPosixBlank Blank. (Perl extension)
  229. XPosixCntrl General_Category=Cntrl. (Perl extension)
  230. XPosixDigit General_Category=Digit. (Perl extension)
  231. XPosixGraph Graph. (Perl extension)
  232. XPosixLower Lower. (Perl extension)
  233. XPosixPrint Print. (Perl extension)
  234. XPosixPunct (Perl extension). \p{Punct} + ASCII-range
  235. \p{Symbol}
  236. XPosixSpace Space. (Perl extension)
  237. XPosixUpper Upper. (Perl extension)
  238. XPosixWord Word. (Perl extension)
  239. XPosixXDigit XDigit. (Perl extension)

Properties accessible through other means

Certain properties are accessible also via core function calls. These are:

  1. Lowercase_Mapping lc() and lcfirst()
  2. Titlecase_Mapping ucfirst()
  3. Uppercase_Mapping uc()

Also, Case_Folding is accessible through the /i modifier in regular expressions, the \F transliteration escape, and the fc operator.

And, the Name and Name_Aliases properties are accessible through the \N{} interpolation in double-quoted strings and regular expressions; and functions charnames::viacode() , charnames::vianame() , and charnames::string_vianame() (which require a use charnames (); to be specified.

Finally, most properties related to decomposition are accessible via Unicode::Normalize.

Unicode character properties that are NOT accepted by Perl

Perl will generate an error for a few character properties in Unicode when used in a regular expression. The non-Unihan ones are listed below, with the reasons they are not accepted, perhaps with work-arounds. The short names for the properties are listed enclosed in (parentheses). As described after the list, an installation can change the defaults and choose to accept any of these. The list is machine generated based on the choices made for the installation that generated this document.

  • Expands_On_NFC (XO_NFC)
  • Expands_On_NFD (XO_NFD)
  • Expands_On_NFKC (XO_NFKC)
  • Expands_On_NFKD (XO_NFKD)

    Deprecated by Unicode. These are characters that expand to more than one character in the specified normalization form, but whether they actually take up more bytes or not depends on the encoding being used. For example, a UTF-8 encoded character may expand to a different number of bytes than a UTF-32 encoded character.

  • Grapheme_Link (Gr_Link)

    Deprecated by Unicode: Duplicates ccc=vr (Canonical_Combining_Class=Virama)

  • Indic_Matra_Category (InMC)
  • Indic_Syllabic_Category (InSC)

    Provisional

  • Jamo_Short_Name (JSN)
  • Other_Alphabetic (OAlpha)
  • Other_Default_Ignorable_Code_Point (ODI)
  • Other_Grapheme_Extend (OGr_Ext)
  • Other_ID_Continue (OIDC)
  • Other_ID_Start (OIDS)
  • Other_Lowercase (OLower)
  • Other_Math (OMath)
  • Other_Uppercase (OUpper)

    Used by Unicode internally for generating other properties and not intended to be used stand-alone

  • Script=Katakana_Or_Hiragana (sc=Hrkt)

    Obsolete. All code points previously matched by this have been moved to "Script=Common". Consider instead using "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or both)

  • Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)

    All code points that would be matched by this are matched by either "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"

An installation can choose to allow any of these to be matched by downloading the Unicode database from http://www.unicode.org/Public/ to $Config{privlib} /unicore/ in the Perl source tree, changing the controlling lists contained in the program $Config{privlib} /unicore/mktables and then re-compiling and installing. (%Config is available from the Config module).

Other information in the Unicode data base

The Unicode data base is delivered in two different formats. The XML version is valid for more modern Unicode releases. The other version is a collection of files. The two are intended to give equivalent information. Perl uses the older form; this allows you to recompile Perl to use early Unicode releases.

The only non-character property that Perl currently supports is Named Sequences, in which a sequence of code points is given a name and generally treated as a single entity. (Perl supports these via the \N{...} double-quotish construct, charnames::string_vianame(name) in charnames, and namedseq() in Unicode::UCD.

Below is a list of the files in the Unicode data base that Perl doesn't currently use, along with very brief descriptions of their purposes. Some of the names of the files have been shortened from those that Unicode uses, in order to allow them to be distinguishable from similarly named files on file systems for which only the first 8 characters of a name are significant.

  • auxiliary/GraphemeBreakTest.html
  • auxiliary/LineBreakTest.html
  • auxiliary/SentenceBreakTest.html
  • auxiliary/WordBreakTest.html

    Documentation of validation tests

  • auxiliary/LBTest.txt
  • auxiliary/SBTest.txt
  • auxiliary/WBTest.txt
  • BidiTest.txt
  • NormTest.txt

    Validation Tests

  • CJKRadicals.txt

    Maps the kRSUnicode property values to corresponding code points

  • EmojiSources.txt

    Maps certain Unicode code points to their legacy Japanese cell-phone values

  • Index.txt

    Alphabetical index of Unicode characters

  • IndicMatraCategory.txt
  • IndicSyllabicCategory.txt

    Provisional; for the analysis and processing of Indic scripts

  • NamedSqProv.txt

    Named sequences proposed for inclusion in a later version of the Unicode Standard; if you need them now, you can append this file to NamedSequences.txt and recompile perl

  • NamesList.txt

    Annotated list of characters

  • NormalizationCorrections.txt

    Documentation of corrections already incorporated into the Unicode data base

  • Props.txt

    Only in very early releases; is a subset of PropList.txt (which is used instead)

  • ReadMe.txt

    Documentation

  • StandardizedVariants.txt

    Certain glyph variations for character display are standardized. This lists the non-Unihan ones; the Unihan ones are also not used by Perl, and are in a separate Unicode data base http://www.unicode.org/ivd

  • USourceData.pdf
  • USourceData.txt

    Documentation of status and cross reference of proposals for encoding by Unicode of Unihan characters

SEE ALSO

http://www.unicode.org/reports/tr44/

perlrecharclass

perlunicode