Properties use ICU for Unicode V15.0; the beta properties support Unicode V15.1β. For more information, see Unicode Utilities Beta.
help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid
Category | Datatype | Source | Property | Values |
---|---|---|---|---|
Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
Bidi_Controlβ | No (N), Yes (Y) | |||
Bidi_Mirrored | No (N), Yes (Y) | |||
Bidi_Mirroredβ | No (N), Yes (Y) | |||
Enumerated | Bidi_Class | Show Values | ||
Bidi_Classβ | Show Values | |||
Bidi_Paired_Bracket_Type | Close (C), None (N), Open (O) | |||
Bidi_Paired_Bracket_Typeβ | Close (c), None (n), Open (o) | |||
String | Bidi_Mirroring_Glyph | Show Values | ||
Bidi_Mirroring_Glyphβ | Show Values | |||
Bidi_Paired_Bracket | Show Values | |||
Bidi_Paired_Bracketβ | Show Values | |||
Case | Binary | UCD | Case_Ignorable | No (N), Yes (Y) |
Case_Ignorableβ | No (N), Yes (Y) | |||
Cased | No (N), Yes (Y) | |||
Casedβ | No (N), Yes (Y) | |||
Changes_When_Casefolded | No (N), Yes (Y) | |||
Changes_When_Casefoldedβ | No (N), Yes (Y) | |||
Changes_When_Casemapped | No (N), Yes (Y) | |||
Changes_When_Casemappedβ | No (N), Yes (Y) | |||
Changes_When_Lowercased | No (N), Yes (Y) | |||
Changes_When_Lowercasedβ | No (N), Yes (Y) | |||
Changes_When_Titlecased | No (N), Yes (Y) | |||
Changes_When_Titlecasedβ | No (N), Yes (Y) | |||
Changes_When_Uppercased | No (N), Yes (Y) | |||
Changes_When_Uppercasedβ | No (N), Yes (Y) | |||
Lowercase | No (N), Yes (Y) | |||
Lowercaseβ | No (N), Yes (Y) | |||
Soft_Dotted | No (N), Yes (Y) | |||
Soft_Dottedβ | No (N), Yes (Y) | |||
Uppercase | No (N), Yes (Y) | |||
Uppercaseβ | No (N), Yes (Y) | |||
Unicode | isCased | No (N), Yes (Y) | ||
isCasefolded | No (N), Yes (Y) | |||
isLowercase | No (N), Yes (Y) | |||
isTitlecase | No (N), Yes (Y) | |||
isUppercase | No (N), Yes (Y) | |||
X-ICU | Case_Sensitive | No (N), Yes (Y) | ||
String | UCD | Case_Folding | Show Values | |
Case_Foldingβ | Show Values | |||
Lowercase_Mapping | Show Values | |||
Lowercase_Mappingβ | Show Values | |||
Simple_Case_Folding | Show Values | |||
Simple_Case_Foldingβ | Show Values | |||
Simple_Lowercase_Mapping | Show Values | |||
Simple_Lowercase_Mappingβ | Show Values | |||
Simple_Titlecase_Mapping | Show Values | |||
Simple_Titlecase_Mappingβ | Show Values | |||
Simple_Uppercase_Mapping | Show Values | |||
Simple_Uppercase_Mappingβ | Show Values | |||
Titlecase_Mapping | Show Values | |||
Titlecase_Mappingβ | Show Values | |||
Uppercase_Mapping | Show Values | |||
Uppercase_Mappingβ | Show Values | |||
Unicode | toCasefold | Show Values | ||
toLowercase | Show Values | |||
toTitlecase | Show Values | |||
toUppercase | Show Values | |||
CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
IDS_Binary_Operatorβ | No (N), Yes (Y) | |||
IDS_Trinary_Operator | No (N), Yes (Y) | |||
IDS_Trinary_Operatorβ | No (N), Yes (Y) | |||
Ideographic | No (N), Yes (Y) | |||
Ideographicβ | No (N), Yes (Y) | |||
Radical | No (N), Yes (Y) | |||
Radicalβ | No (N), Yes (Y) | |||
Unified_Ideograph | No (N), Yes (Y) | |||
Unified_Ideographβ | No (N), Yes (Y) | |||
Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
String | UCD | CJK_Radicalβ | Show Values | |
Equivalent_Unified_Ideographβ | Show Values | |||
kSimplifiedVariantβ | Show Values | |||
kTraditionalVariantβ | Show Values | |||
Emoji | Binary | UCD | Extended_Pictographic | No (N), Yes (Y) |
Extended_Pictographicβ | No (N), Yes (Y) | |||
UTS | Basic_Emoji | No (N), Yes (Y) | ||
Basic_Emojiβ | No (No), Yes (Yes) | |||
Emoji | No (N), Yes (Y) | |||
Emoji_Component | No (N), Yes (Y) | |||
Emoji_Componentβ | No (N), Yes (Y) | |||
Emoji_Modifier | No (N), Yes (Y) | |||
Emoji_Modifier_Base | No (N), Yes (Y) | |||
Emoji_Modifier_Baseβ | No (N), Yes (Y) | |||
Emoji_Modifierβ | No (N), Yes (Y) | |||
Emoji_Presentation | No (N), Yes (Y) | |||
Emoji_Presentationβ | No (N), Yes (Y) | |||
Emojiβ | No (N), Yes (Y) | |||
RGI_Emoji | No, Yes | |||
RGI_Emoji_Flag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Flag_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Keycap_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Modifier_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Modifier_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Tag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Tag_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Zwj_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Zwj_Sequenceβ | No (No), Yes (Yes) | |||
Enumerated | UCD | Regional_Indicator | No (N), Yes (Y) | |
Regional_Indicatorβ | No (N), Yes (Y) | |||
General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
Alphabeticβ | No (N), Yes (Y) | |||
Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
Default_Ignorable_Code_Pointβ | No (N), Yes (Y) | |||
Deprecated | No (N), Yes (Y) | |||
Deprecatedβ | No (N), Yes (Y) | |||
Logical_Order_Exception | No (N), Yes (Y) | |||
Logical_Order_Exceptionβ | No (N), Yes (Y) | |||
Noncharacter_Code_Point | No (N), Yes (Y) | |||
Noncharacter_Code_Pointβ | No (N), Yes (Y) | |||
Variation_Selector | No (N), Yes (Y) | |||
Variation_Selectorβ | No (N), Yes (Y) | |||
White_Space | No (N), Yes (Y) | |||
White_Spaceβ | No (N), Yes (Y) | |||
Catalog | Age | Show Values | ||
Ageβ | Show Values | |||
Block | Show Values | |||
Blockβ | Show Values | |||
Script | Show Values | |||
Scriptβ | Show Values | |||
Enumerated | General_Category | Show Values | ||
General_Categoryβ | Show Values | |||
Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Hangul_Syllable_Typeβ | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Name_Aliasβ | Show Values | |||
Named_Sequences_Provβ | ||||
Named_Sequencesβ | Show Values | |||
String | Nameslist | subhead | Show Values | |
UCD | Name | Show Values | ||
Nameβ | Show Values | |||
Script_Extensions | Show Values | |||
Script_Extensionsβ | Show Values | |||
Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
ID_Continueβ | No (N), Yes (Y) | |||
ID_Start | No (N), Yes (Y) | |||
ID_Startβ | No (N), Yes (Y) | |||
Pattern_Syntax | No (N), Yes (Y) | |||
Pattern_Syntaxβ | No (N), Yes (Y) | |||
Pattern_White_Space | No (N), Yes (Y) | |||
Pattern_White_Spaceβ | No (N), Yes (Y) | |||
XID_Continue | No (N), Yes (Y) | |||
XID_Continueβ | No (N), Yes (Y) | |||
XID_Start | No (N), Yes (Y) | |||
XID_Startβ | No (N), Yes (Y) | |||
IDNA | Enumerated | UTS | Idn_2008β | na (na), NV8 (nv8), XV8 (xv8) |
Idn_Statusβ | deviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v), ignored (i), mapped (m), valid (v) | |||
idna2003 | deviation, disallowed, ignored, mapped, valid | |||
idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED | |||
idna2008c | deviation, disallowed, ignored, mapped, valid | |||
uts46 | deviation, disallowed, ignored, mapped, valid | |||
String | Idn_Mappingβ | Show Values | ||
toIdna2003 | Show Values | |||
toUts46n | Show Values | |||
toUts46t | Show Values | |||
Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
Dashβ | No (N), Yes (Y) | |||
Diacritic | No (N), Yes (Y) | |||
Diacriticβ | No (N), Yes (Y) | |||
Extender | No (N), Yes (Y) | |||
Extenderβ | No (N), Yes (Y) | |||
Grapheme_Base | No (N), Yes (Y) | |||
Grapheme_Extend | No (N), Yes (Y) | |||
Grapheme_Link | No (N), Yes (Y) | |||
Hyphen | No (N), Yes (Y) | |||
Math | No (N), Yes (Y) | |||
Mathβ | No (N), Yes (Y) | |||
Quotation_Mark | No (N), Yes (Y) | |||
Quotation_Markβ | No (N), Yes (Y) | |||
STerm | No (N), Yes (Y) | |||
STermβ | No (N), Yes (Y) | |||
Terminal_Punctuation | No (N), Yes (Y) | |||
Terminal_Punctuationβ | No (N), Yes (Y) | |||
Enumerated | Indic_Positional_Category | Show Values | ||
Indic_Positional_Categoryβ | Show Values | |||
Indic_Syllabic_Category | Show Values | |||
Indic_Syllabic_Categoryβ | Show Values | |||
Miscellaneous | ISO_Comment | Show Values | ||
Unicode_1_Name | Show Values | |||
Normalization | Binary | ICU | NFC_Inert | No (N), Yes (Y) |
NFD_Inert | No (N), Yes (Y) | |||
NFKC_Inert | No (N), Yes (Y) | |||
NFKD_Inert | No (N), Yes (Y) | |||
isNFM | No, Yes | |||
UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) | ||
Changes_When_NFKC_Casefoldedβ | No (N), Yes (Y) | |||
Full_Composition_Exclusion | No (N), Yes (Y) | |||
Unicode | isNFC | No, Yes | ||
isNFD | No, Yes | |||
isNFKC | No, Yes | |||
isNFKD | No, Yes | |||
Enumerated | ICU | Lead_Canonical_Combining_Class | Show Values | |
Trail_Canonical_Combining_Class | Show Values | |||
UCD | Canonical_Combining_Class | Show Values | ||
Canonical_Combining_Classβ | Show Values | |||
Decomposition_Type | Show Values | |||
Decomposition_Typeβ | Show Values | |||
NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFC_Quick_Checkβ | Maybe (M), No (N), Yes (Y) | |||
NFD_Quick_Check | No (N), Yes (Y) | |||
NFD_Quick_Checkβ | No (N), Yes (Y) | |||
NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFKC_Quick_Checkβ | Maybe (M), No (N), Yes (Y) | |||
NFKD_Quick_Check | No (N), Yes (Y) | |||
NFKD_Quick_Checkβ | No (N), Yes (Y) | |||
String | ICU | toNFM | Show Values | |
UCD | NFKC_Casefold | Show Values | ||
NFKC_Casefoldβ | Show Values | |||
Unicode | toNFC | Show Values | ||
toNFD | Show Values | |||
toNFKC | Show Values | |||
toNFKD | Show Values | |||
Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
ASCII_Hex_Digitβ | No (N), Yes (Y) | |||
Hex_Digit | No (N), Yes (Y) | |||
Hex_Digitβ | No (N), Yes (Y) | |||
Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
Numeric_Typeβ | Decimal (De), Digit (Di), None (None), Numeric (Nu) | |||
kAccountingNumericβ | Show Values | |||
kOtherNumericβ | Show Values | |||
kPrimaryNumericβ | Show Values | |||
Numeric | Numeric_Value | Show Values | ||
Numeric_Valueβ | Show Values | |||
Regex | Binary | UTS | ANY | No, Yes |
ASCII | No, Yes | |||
alnum | No (N), Yes (Y) | |||
blank | No (N), Yes (Y) | |||
bmp | No, Yes | |||
graph | No (N), Yes (Y) | |||
No (N), Yes (Y) | ||||
xdigit | No (N), Yes (Y) | |||
Security | Enumerated | UTS | Confusable_MAβ | Show Values |
Identifier_Statusβ | Allowed (a), Restricted (r) | |||
Identifier_Typeβ | Show Values | |||
Shaping and Rendering | Binary | ICU | Segment_Starter | No (N), Yes (Y) |
UCD | Join_Control | No (N), Yes (Y) | ||
Join_Controlβ | No (N), Yes (Y) | |||
Enumerated | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | ||
East_Asian_Widthβ | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | |||
Grapheme_Cluster_Break | Show Values | |||
Grapheme_Cluster_Breakβ | Show Values | |||
Joining_Group | African_Feh (African_Feh), African_Noon (African_Noon), African_Qaf (African_Qaf), Ain (Ain), Alaph (Alaph), Alef (Alef), Beh (Beh), Beth (Beth), Burushaski_Yeh_Barree (Burushaski_Yeh_Barree), Dal (Dal), Dalath_Rish (Dalath_Rish), E (E), Farsi_Yeh (Farsi_Yeh), Fe (Fe), Feh (Feh), Final_Semkath (Final_Semkath), Gaf (Gaf), Gamal (Gamal), Hah (Hah), Hamza_On_Heh_Goal (Teh_Marbuta_Goal), Hanifi_Rohingya_Kinna_Ya (Hanifi_Rohingya_Kinna_Ya), Hanifi_Rohingya_Pa (Hanifi_Rohingya_Pa), He (He), Heh (Heh), Heh_Goal (Heh_Goal), Heth (Heth), Kaf (Kaf), Kaph (Kaph), Khaph (Khaph), Knotted_Heh (Knotted_Heh), Lam (Lam), Lamadh (Lamadh), Malayalam_Bha (Malayalam_Bha), Malayalam_Ja (Malayalam_Ja), Malayalam_Lla (Malayalam_Lla), Malayalam_Llla (Malayalam_Llla), Malayalam_Nga (Malayalam_Nga), Malayalam_Nna (Malayalam_Nna), Malayalam_Nnna (Malayalam_Nnna), Malayalam_Nya (Malayalam_Nya), Malayalam_Ra (Malayalam_Ra), Malayalam_Ssa (Malayalam_Ssa), Malayalam_Tta (Malayalam_Tta), Manichaean_Aleph (Manichaean_Aleph), Manichaean_Ayin (Manichaean_Ayin), Manichaean_Beth (Manichaean_Beth), Manichaean_Daleth (Manichaean_Daleth), Manichaean_Dhamedh (Manichaean_Dhamedh), Manichaean_Five (Manichaean_Five), Manichaean_Gimel (Manichaean_Gimel), Manichaean_Heth (Manichaean_Heth), Manichaean_Hundred (Manichaean_Hundred), Manichaean_Kaph (Manichaean_Kaph), Manichaean_Lamedh (Manichaean_Lamedh), Manichaean_Mem (Manichaean_Mem), Manichaean_Nun (Manichaean_Nun), Manichaean_One (Manichaean_One), Manichaean_Pe (Manichaean_Pe), Manichaean_Qoph (Manichaean_Qoph), Manichaean_Resh (Manichaean_Resh), Manichaean_Sadhe (Manichaean_Sadhe), Manichaean_Samekh (Manichaean_Samekh), Manichaean_Taw (Manichaean_Taw), Manichaean_Ten (Manichaean_Ten), Manichaean_Teth (Manichaean_Teth), Manichaean_Thamedh (Manichaean_Thamedh), Manichaean_Twenty (Manichaean_Twenty), Manichaean_Waw (Manichaean_Waw), Manichaean_Yodh (Manichaean_Yodh), Manichaean_Zayin (Manichaean_Zayin), Meem (Meem), Mim (Mim), No_Joining_Group (No_Joining_Group), Noon (Noon), Nun (Nun), Nya (Nya), Pe (Pe), Qaf (Qaf), Qaph (Qaph), Reh (Reh), Reversed_Pe (Reversed_Pe), Rohingya_Yeh (Rohingya_Yeh), Sad (Sad), Sadhe (Sadhe), Seen (Seen), Semkath (Semkath), Shin (Shin), Straight_Waw (Straight_Waw), Swash_Kaf (Swash_Kaf), Syriac_Waw (Syriac_Waw), Tah (Tah), Taw (Taw), Teh_Marbuta (Teh_Marbuta), Teth (Teth), Thin_Yeh (Thin_Yeh), Vertical_Tail (Vertical_Tail), Waw (Waw), Yeh (Yeh), Yeh_Barree (Yeh_Barree), Yeh_With_Tail (Yeh_With_Tail), Yudh (Yudh), Yudh_He (Yudh_He), Zain (Zain), Zhain (Zhain) | |||
Joining_Groupβ | Show Values | |||
Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Joining_Typeβ | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Line_Break | Show Values | |||
Line_Breakβ | Show Values | |||
Prepended_Concatenation_Mark | No (N), Yes (Y) | |||
Prepended_Concatenation_Markβ | No (N), Yes (Y) | |||
Sentence_Break | Show Values | |||
Sentence_Breakβ | Show Values | |||
Standardized_Variantβ | Show Values | |||
Vertical_Orientation | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Vertical_Orientationβ | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Word_Break | Show Values | |||
Word_Breakβ | Show Values | |||
UCA | Binary | UTS | uca | Show Values |
uca2 | Show Values | |||
uca2.5 | Show Values | |||
uca3 | Show Values | |||
Z-Other | Other | Other | Emoji_Keycap_Sequence | Other |
ID_Compat_Math_Continueβ | Other | |||
ID_Compat_Math_Startβ | Other | |||
IDS_Unary_Operatorβ | Other | |||
NFKC_Simple_Casefoldβ | Other |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.9; ICU version: 72.0; Unicode/Emoji version: 15.0; Unicodeβ version: 15.0;