Properties use ICU for Unicode V15.0; the beta properties support Unicode V15.1β. For more information, see Unicode Utilities Beta.
help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid
Category | Datatype | Source | Property | Values |
---|---|---|---|---|
Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
Bidi_Controlβ | No (N), Yes (Y) | |||
Bidi_Mirrored | No (N), Yes (Y) | |||
Bidi_Mirroredβ | No (N), Yes (Y) | |||
Enumerated | Bidi_Class | Show Values | ||
Bidi_Classβ | Show Values | |||
Bidi_Paired_Bracket_Type | Close (C), None (N), Open (O) | |||
Bidi_Paired_Bracket_Typeβ | Close (c), None (n), Open (o) | |||
String | Bidi_Mirroring_Glyph | Show Values | ||
Bidi_Mirroring_Glyphβ | Show Values | |||
Bidi_Paired_Bracket | Show Values | |||
Bidi_Paired_Bracketβ | Show Values | |||
Case | Binary | UCD | Case_Ignorable | No (N), Yes (Y) |
Case_Ignorableβ | No (N), Yes (Y) | |||
Cased | No (N), Yes (Y) | |||
Casedβ | No (N), Yes (Y) | |||
Changes_When_Casefolded | No (N), Yes (Y) | |||
Changes_When_Casefoldedβ | No (N), Yes (Y) | |||
Changes_When_Casemapped | No (N), Yes (Y) | |||
Changes_When_Casemappedβ | No (N), Yes (Y) | |||
Changes_When_Lowercased | No (N), Yes (Y) | |||
Changes_When_Lowercasedβ | No (N), Yes (Y) | |||
Changes_When_Titlecased | No (N), Yes (Y) | |||
Changes_When_Titlecasedβ | No (N), Yes (Y) | |||
Changes_When_Uppercased | No (N), Yes (Y) | |||
Changes_When_Uppercasedβ | No (N), Yes (Y) | |||
Lowercase | No (N), Yes (Y) | |||
Lowercaseβ | No (N), Yes (Y) | |||
Soft_Dotted | No (N), Yes (Y) | |||
Soft_Dottedβ | No (N), Yes (Y) | |||
Uppercase | No (N), Yes (Y) | |||
Uppercaseβ | No (N), Yes (Y) | |||
Unicode | isCased | No (N), Yes (Y) | ||
isCasefolded | No (N), Yes (Y) | |||
isLowercase | No (N), Yes (Y) | |||
isTitlecase | No (N), Yes (Y) | |||
isUppercase | No (N), Yes (Y) | |||
X-ICU | Case_Sensitive | No (N), Yes (Y) | ||
String | UCD | Case_Folding | Show Values | |
Case_Foldingβ | Show Values | |||
Lowercase_Mapping | Show Values | |||
Lowercase_Mappingβ | Show Values | |||
Simple_Case_Folding | Show Values | |||
Simple_Case_Foldingβ | Show Values | |||
Simple_Lowercase_Mapping | Show Values | |||
Simple_Lowercase_Mappingβ | Show Values | |||
Simple_Titlecase_Mapping | Show Values | |||
Simple_Titlecase_Mappingβ | Show Values | |||
Simple_Uppercase_Mapping | Show Values | |||
Simple_Uppercase_Mappingβ | <code point>, Ⓜ (Ⓜ), A (A), A (A), Ⓐ (Ⓐ), Á (Á), À (À), Ă (Ă), Ắ (Ắ), Ằ (Ằ), Ẵ (Ẵ), Ẳ (Ẳ), Â (Â), Ấ (Ấ), Ầ (Ầ), Ẫ (Ẫ), Ẩ (Ẩ), Ǎ (Ǎ), Å (Å), Ǻ (Ǻ), Ä (Ä), Ꞛ (Ꞛ), Ǟ (Ǟ), Ã (Ã), Ȧ (Ȧ), Ǡ (Ǡ), Ą (Ą), Ꟁ (Ꟁ), Ā (Ā), Ả (Ả), Ȁ (Ȁ), Ȃ (Ȃ), Ạ (Ạ), Ặ (Ặ), Ậ (Ậ), Ḁ (Ḁ), Ꜳ (Ꜳ), Æ (Æ), Ǽ (Ǽ), Ǣ (Ǣ), Ꜵ (Ꜵ), Ꜷ (Ꜷ), Ꜹ (Ꜹ), Ꜻ (Ꜻ), Ꜽ (Ꜽ), Ⱥ (Ⱥ), Ꞻ (Ꞻ), Ɐ (Ɐ), Ɑ (Ɑ), Ɒ (Ɒ), B (B), B (B), Ⓑ (Ⓑ), Ḃ (Ḃ), Ḅ (Ḅ), Ḇ (Ḇ), Ƀ (Ƀ), Ꞗ (Ꞗ), Ɓ (Ɓ), Ƃ (Ƃ), Ꞵ (Ꞵ), C (C), C (C), Ⅽ (Ⅽ), Ⓒ (Ⓒ), Ć (Ć), Ĉ (Ĉ), Č (Č), Ċ (Ċ), Ç (Ç), Ḉ (Ḉ), Ȼ (Ȼ), Ꞓ (Ꞓ), Ꞔ (Ꞔ), Ƈ (Ƈ), Ↄ (Ↄ), Ꜿ (Ꜿ), D (D), D (D), Ⅾ (Ⅾ), Ⓓ (Ⓓ), Ď (Ď), Ḋ (Ḋ), Ḑ (Ḑ), Đ (Đ), Ḍ (Ḍ), Ḓ (Ḓ), Ḏ (Ḏ), Ð (Ð), Ꝺ (Ꝺ), DZ (DZ), DŽ (DŽ), Ꟈ (Ꟈ), Ɖ (Ɖ), Ɗ (Ɗ), Ƌ (Ƌ), E (E), E (E), Ⓔ (Ⓔ), É (É), È (È), Ĕ (Ĕ), Ê (Ê), Ế (Ế), Ề (Ề), Ễ (Ễ), Ể (Ể), Ě (Ě), Ë (Ë), Ẽ (Ẽ), Ė (Ė), Ȩ (Ȩ), Ḝ (Ḝ), Ę (Ę), Ē (Ē), Ḗ (Ḗ), Ḕ (Ḕ), Ẻ (Ẻ), Ȅ (Ȅ), Ȇ (Ȇ), Ẹ (Ẹ), Ệ (Ệ), Ḙ (Ḙ), Ḛ (Ḛ), Ɇ (Ɇ), Ǝ (Ǝ), Ə (Ə), Ɛ (Ɛ), Ɜ (Ɜ), F (F), F (F), Ⓕ (Ⓕ), Ḟ (Ḟ), Ꝼ (Ꝼ), Ꞙ (Ꞙ), Ƒ (Ƒ), Ⅎ (Ⅎ), G (G), G (G), Ⓖ (Ⓖ), Ǵ (Ǵ), Ğ (Ğ), Ĝ (Ĝ), Ǧ (Ǧ), Ġ (Ġ), Ģ (Ģ), Ḡ (Ḡ), Ꞡ (Ꞡ), Ᵹ (Ᵹ), Ꟑ (Ꟑ), Ɡ (Ɡ), Ǥ (Ǥ), Ɠ (Ɠ), Ꝿ (Ꝿ), Ɣ (Ɣ), Ƣ (Ƣ), H (H), H (H), Ⓗ (Ⓗ), Ĥ (Ĥ), Ȟ (Ȟ), Ḧ (Ḧ), Ḣ (Ḣ), Ḩ (Ḩ), Ħ (Ħ), Ḥ (Ḥ), Ḫ (Ḫ), Ƕ (Ƕ), Ɦ (Ɦ), Ⱨ (Ⱨ), Ⱶ (Ⱶ), Ꟶ (Ꟶ), Ꜧ (Ꜧ), I (I), I (I), Ⅰ (Ⅰ), Ⓘ (Ⓘ), Í (Í), Ì (Ì), Ĭ (Ĭ), Î (Î), Ǐ (Ǐ), Ï (Ï), Ḯ (Ḯ), Ĩ (Ĩ), Į (Į), Ī (Ī), Ỉ (Ỉ), Ȉ (Ȉ), Ȋ (Ȋ), Ị (Ị), Ḭ (Ḭ), Ⅱ (Ⅱ), Ⅲ (Ⅲ), IJ (IJ), Ⅳ (Ⅳ), Ⅸ (Ⅸ), Ɪ (Ɪ), Ɨ (Ɨ), Ꞽ (Ꞽ), Ɩ (Ɩ), J (J), J (J), Ⓙ (Ⓙ), Ĵ (Ĵ), Ɉ (Ɉ), Ʝ (Ʝ), K (K), K (K), Ⓚ (Ⓚ), Ḱ (Ḱ), Ǩ (Ǩ), Ķ (Ķ), Ꞣ (Ꞣ), Ḳ (Ḳ), Ḵ (Ḵ), Ƙ (Ƙ), Ⱪ (Ⱪ), Ꝁ (Ꝁ), Ꝃ (Ꝃ), Ꝅ (Ꝅ), Ʞ (Ʞ), L (L), L (L), Ⅼ (Ⅼ), Ⓛ (Ⓛ), Ĺ (Ĺ), Ľ (Ľ), Ļ (Ļ), Ł (Ł), Ḷ (Ḷ), Ḹ (Ḹ), Ḽ (Ḽ), Ḻ (Ḻ), Ŀ (Ŀ), LJ (LJ), Ỻ (Ỻ), Ꝇ (Ꝇ), Ꝉ (Ꝉ), Ƚ (Ƚ), Ⱡ (Ⱡ), Ɫ (Ɫ), Ɬ (Ɬ), Ꞁ (Ꞁ), M (M), M (M), Ⅿ (Ⅿ), Ḿ (Ḿ), Ṁ (Ṁ), Ṃ (Ṃ), Ɱ (Ɱ), N (N), N (N), Ⓝ (Ⓝ), Ń (Ń) too many values to show | |||
Titlecase_Mapping | Show Values | |||
Titlecase_Mappingβ | Show Values | |||
Uppercase_Mapping | Show Values | |||
Uppercase_Mappingβ | Show Values | |||
Unicode | toCasefold | Show Values | ||
toLowercase | Show Values | |||
toTitlecase | Show Values | |||
toUppercase | Show Values | |||
CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
IDS_Binary_Operatorβ | No (N), Yes (Y) | |||
IDS_Trinary_Operator | No (N), Yes (Y) | |||
IDS_Trinary_Operatorβ | No (N), Yes (Y) | |||
Ideographic | No (N), Yes (Y) | |||
Ideographicβ | No (N), Yes (Y) | |||
Radical | No (N), Yes (Y) | |||
Radicalβ | No (N), Yes (Y) | |||
Unified_Ideograph | No (N), Yes (Y) | |||
Unified_Ideographβ | No (N), Yes (Y) | |||
Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
String | UCD | CJK_Radicalβ | Show Values | |
Equivalent_Unified_Ideographβ | Show Values | |||
kSimplifiedVariantβ | Show Values | |||
kTraditionalVariantβ | Show Values | |||
Emoji | Binary | UCD | Extended_Pictographic | No (N), Yes (Y) |
Extended_Pictographicβ | No (N), Yes (Y) | |||
UTS | Basic_Emoji | No (N), Yes (Y) | ||
Basic_Emojiβ | No (No), Yes (Yes) | |||
Emoji | No (N), Yes (Y) | |||
Emoji_Component | No (N), Yes (Y) | |||
Emoji_Componentβ | No (N), Yes (Y) | |||
Emoji_Modifier | No (N), Yes (Y) | |||
Emoji_Modifier_Base | No (N), Yes (Y) | |||
Emoji_Modifier_Baseβ | No (N), Yes (Y) | |||
Emoji_Modifierβ | No (N), Yes (Y) | |||
Emoji_Presentation | No (N), Yes (Y) | |||
Emoji_Presentationβ | No (N), Yes (Y) | |||
Emojiβ | No (N), Yes (Y) | |||
RGI_Emoji | No, Yes | |||
RGI_Emoji_Flag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Flag_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Keycap_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Modifier_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Modifier_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Tag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Tag_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Zwj_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Zwj_Sequenceβ | No (No), Yes (Yes) | |||
Enumerated | UCD | Regional_Indicator | No (N), Yes (Y) | |
Regional_Indicatorβ | No (N), Yes (Y) | |||
General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
Alphabeticβ | No (N), Yes (Y) | |||
Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
Default_Ignorable_Code_Pointβ | No (N), Yes (Y) | |||
Deprecated | No (N), Yes (Y) | |||
Deprecatedβ | No (N), Yes (Y) | |||
Logical_Order_Exception | No (N), Yes (Y) | |||
Logical_Order_Exceptionβ | No (N), Yes (Y) | |||
Noncharacter_Code_Point | No (N), Yes (Y) | |||
Noncharacter_Code_Pointβ | No (N), Yes (Y) | |||
Variation_Selector | No (N), Yes (Y) | |||
Variation_Selectorβ | No (N), Yes (Y) | |||
White_Space | No (N), Yes (Y) | |||
White_Spaceβ | No (N), Yes (Y) | |||
Catalog | Age | Show Values | ||
Ageβ | Show Values | |||
Block | Show Values | |||
Blockβ | Show Values | |||
Script | Show Values | |||
Scriptβ | Show Values | |||
Enumerated | General_Category | Show Values | ||
General_Categoryβ | Show Values | |||
Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Hangul_Syllable_Typeβ | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Name_Aliasβ | Show Values | |||
Named_Sequences_Provβ | ||||
Named_Sequencesβ | Show Values | |||
String | Nameslist | subhead | Show Values | |
UCD | Name | Show Values | ||
Nameβ | Show Values | |||
Script_Extensions | Show Values | |||
Script_Extensionsβ | Show Values | |||
Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
ID_Continueβ | No (N), Yes (Y) | |||
ID_Start | No (N), Yes (Y) | |||
ID_Startβ | No (N), Yes (Y) | |||
Pattern_Syntax | No (N), Yes (Y) | |||
Pattern_Syntaxβ | No (N), Yes (Y) | |||
Pattern_White_Space | No (N), Yes (Y) | |||
Pattern_White_Spaceβ | No (N), Yes (Y) | |||
XID_Continue | No (N), Yes (Y) | |||
XID_Continueβ | No (N), Yes (Y) | |||
XID_Start | No (N), Yes (Y) | |||
XID_Startβ | No (N), Yes (Y) | |||
IDNA | Enumerated | UTS | Idn_2008β | na (na), NV8 (nv8), XV8 (xv8) |
Idn_Statusβ | deviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v), ignored (i), mapped (m), valid (v) | |||
idna2003 | deviation, disallowed, ignored, mapped, valid | |||
idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED | |||
idna2008c | deviation, disallowed, ignored, mapped, valid | |||
uts46 | deviation, disallowed, ignored, mapped, valid | |||
String | Idn_Mappingβ | Show Values | ||
toIdna2003 | Show Values | |||
toUts46n | Show Values | |||
toUts46t | Show Values | |||
Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
Dashβ | No (N), Yes (Y) | |||
Diacritic | No (N), Yes (Y) | |||
Diacriticβ | No (N), Yes (Y) | |||
Extender | No (N), Yes (Y) | |||
Extenderβ | No (N), Yes (Y) | |||
Grapheme_Base | No (N), Yes (Y) | |||
Grapheme_Extend | No (N), Yes (Y) | |||
Grapheme_Link | No (N), Yes (Y) | |||
Hyphen | No (N), Yes (Y) | |||
Math | No (N), Yes (Y) | |||
Mathβ | No (N), Yes (Y) | |||
Quotation_Mark | No (N), Yes (Y) | |||
Quotation_Markβ | No (N), Yes (Y) | |||
STerm | No (N), Yes (Y) | |||
STermβ | No (N), Yes (Y) | |||
Terminal_Punctuation | No (N), Yes (Y) | |||
Terminal_Punctuationβ | No (N), Yes (Y) | |||
Enumerated | Indic_Positional_Category | Show Values | ||
Indic_Positional_Categoryβ | Show Values | |||
Indic_Syllabic_Category | Show Values | |||
Indic_Syllabic_Categoryβ | Show Values | |||
Miscellaneous | ISO_Comment | Show Values | ||
Unicode_1_Name | Show Values | |||
Normalization | Binary | ICU | NFC_Inert | No (N), Yes (Y) |
NFD_Inert | No (N), Yes (Y) | |||
NFKC_Inert | No (N), Yes (Y) | |||
NFKD_Inert | No (N), Yes (Y) | |||
isNFM | No, Yes | |||
UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) | ||
Changes_When_NFKC_Casefoldedβ | No (N), Yes (Y) | |||
Full_Composition_Exclusion | No (N), Yes (Y) | |||
Unicode | isNFC | No, Yes | ||
isNFD | No, Yes | |||
isNFKC | No, Yes | |||
isNFKD | No, Yes | |||
Enumerated | ICU | Lead_Canonical_Combining_Class | Show Values | |
Trail_Canonical_Combining_Class | Show Values | |||
UCD | Canonical_Combining_Class | Show Values | ||
Canonical_Combining_Classβ | Show Values | |||
Decomposition_Type | Show Values | |||
Decomposition_Typeβ | Show Values | |||
NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFC_Quick_Checkβ | Maybe (M), No (N), Yes (Y) | |||
NFD_Quick_Check | No (N), Yes (Y) | |||
NFD_Quick_Checkβ | No (N), Yes (Y) | |||
NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFKC_Quick_Checkβ | Maybe (M), No (N), Yes (Y) | |||
NFKD_Quick_Check | No (N), Yes (Y) | |||
NFKD_Quick_Checkβ | No (N), Yes (Y) | |||
String | ICU | toNFM | Show Values | |
UCD | NFKC_Casefold | Show Values | ||
NFKC_Casefoldβ | Show Values | |||
Unicode | toNFC | Show Values | ||
toNFD | Show Values | |||
toNFKC | Show Values | |||
toNFKD | Show Values | |||
Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
ASCII_Hex_Digitβ | No (N), Yes (Y) | |||
Hex_Digit | No (N), Yes (Y) | |||
Hex_Digitβ | No (N), Yes (Y) | |||
Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
Numeric_Typeβ | Decimal (De), Digit (Di), None (None), Numeric (Nu) | |||
kAccountingNumericβ | Show Values | |||
kOtherNumericβ | Show Values | |||
kPrimaryNumericβ | Show Values | |||
Numeric | Numeric_Value | Show Values | ||
Numeric_Valueβ | Show Values | |||
Regex | Binary | UTS | ANY | No, Yes |
ASCII | No, Yes | |||
alnum | No (N), Yes (Y) | |||
blank | No (N), Yes (Y) | |||
bmp | No, Yes | |||
graph | No (N), Yes (Y) | |||
No (N), Yes (Y) | ||||
xdigit | No (N), Yes (Y) | |||
Security | Enumerated | UTS | Confusable_MAβ | Show Values |
Identifier_Statusβ | Allowed (a), Restricted (r) | |||
Identifier_Typeβ | Show Values | |||
Shaping and Rendering | Binary | ICU | Segment_Starter | No (N), Yes (Y) |
UCD | Join_Control | No (N), Yes (Y) | ||
Join_Controlβ | No (N), Yes (Y) | |||
Enumerated | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | ||
East_Asian_Widthβ | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | |||
Grapheme_Cluster_Break | Show Values | |||
Grapheme_Cluster_Breakβ | Show Values | |||
Joining_Group | Show Values | |||
Joining_Groupβ | Show Values | |||
Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Joining_Typeβ | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Line_Break | Show Values | |||
Line_Breakβ | Show Values | |||
Prepended_Concatenation_Mark | No (N), Yes (Y) | |||
Prepended_Concatenation_Markβ | No (N), Yes (Y) | |||
Sentence_Break | Show Values | |||
Sentence_Breakβ | Show Values | |||
Standardized_Variantβ | Show Values | |||
Vertical_Orientation | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Vertical_Orientationβ | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Word_Break | Show Values | |||
Word_Breakβ | Show Values | |||
UCA | Binary | UTS | uca | Show Values |
uca2 | Show Values | |||
uca2.5 | Show Values | |||
uca3 | Show Values | |||
Z-Other | Other | Other | Emoji_Keycap_Sequence | Other |
ID_Compat_Math_Continueβ | Other | |||
ID_Compat_Math_Startβ | Other | |||
IDS_Unary_Operatorβ | Other | |||
NFKC_Simple_Casefoldβ | Other |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.9; ICU version: 72.0; Unicode/Emoji version: 15.0; Unicodeβ version: 15.0;