help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid
Category | Datatype | Source | Property | Values |
---|---|---|---|---|
Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
Bidi_Mirrored | No (N), Yes (Y) | |||
Enumerated | Bidi_Class | Show Values | ||
Bidi_Paired_Bracket_Type | Close (C), None (N), Open (O) | |||
String | Bidi_Mirroring_Glyph | Show Values | ||
Bidi_Paired_Bracket | Show Values | |||
Case | Binary | ICU | Case_Sensitive | No (N), Yes (Y) |
UCD | Case_Ignorable | No (N), Yes (Y) | ||
Cased | No (N), Yes (Y) | |||
Changes_When_Casefolded | No (N), Yes (Y) | |||
Changes_When_Casemapped | No (N), Yes (Y) | |||
Changes_When_Lowercased | No (N), Yes (Y) | |||
Changes_When_Titlecased | No (N), Yes (Y) | |||
Changes_When_Uppercased | No (N), Yes (Y) | |||
Lowercase | No (N), Yes (Y) | |||
Soft_Dotted | No (N), Yes (Y) | |||
Uppercase | No (N), Yes (Y) | |||
Unicode | isCased | No (N), Yes (Y) | ||
isCasefolded | No (N), Yes (Y) | |||
isLowercase | No (N), Yes (Y) | |||
isTitlecase | No (N), Yes (Y) | |||
isUppercase | No (N), Yes (Y) | |||
String | UCD | Case_Folding | Show Values | |
Lowercase_Mapping | Show Values | |||
Simple_Case_Folding | Show Values | |||
Simple_Lowercase_Mapping | Show Values | |||
Simple_Titlecase_Mapping | Show Values | |||
Simple_Uppercase_Mapping | Show Values | |||
Titlecase_Mapping | Show Values | |||
Uppercase_Mapping | Show Values | |||
Unicode | toCasefold | Show Values | ||
toLowercase | Show Values | |||
toTitlecase | Show Values | |||
toUppercase | Show Values | |||
CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
IDS_Trinary_Operator | No (N), Yes (Y) | |||
Ideographic | No (N), Yes (Y) | |||
Radical | No (N), Yes (Y) | |||
Unified_Ideograph | No (N), Yes (Y) | |||
Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
String | UCD | CJK_Radical | Show Values | |
kSimplifiedVariant | Show Values | |||
kTraditionalVariant | Show Values | |||
Emoji | Binary | UTS | Emoji | No (N), Yes (Y) |
Emoji_Component | No (N), Yes (Y) | |||
Emoji_Flag_Sequence | No (No), Yes (Yes) | |||
Emoji_Keycap_Sequence | No (No), Yes (Yes) | |||
Emoji_Modifier | No (N), Yes (Y) | |||
Emoji_Modifier_Base | No (N), Yes (Y) | |||
Emoji_Modifier_Sequence | No (No), Yes (Yes) | |||
Emoji_Presentation | No (N), Yes (Y) | |||
Emoji_Tag_Sequence | No (No), Yes (Yes) | |||
Emoji_Zwj_Sequence | No (No), Yes (Yes) | |||
Enumerated | UCD | Regional_Indicator | No (N), Yes (Y) | |
General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
Deprecated | No (N), Yes (Y) | |||
Logical_Order_Exception | No (N), Yes (Y) | |||
Noncharacter_Code_Point | No (N), Yes (Y) | |||
Variation_Selector | No (N), Yes (Y) | |||
White_Space | No (N), Yes (Y) | |||
Catalog | Age | Show Values | ||
Block | Show Values | |||
Script | Show Values | |||
Enumerated | General_Category | Show Values | ||
Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Name_Alias | Show Values | |||
Named_Sequences | Show Values | |||
Named_Sequences_Prov | ||||
String | Nameslist | subhead | Show Values | |
UCD | Name | Show Values | ||
Script_Extensions | Show Values | |||
Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
ID_Start | No (N), Yes (Y) | |||
Pattern_Syntax | No (N), Yes (Y) | |||
Pattern_White_Space | No (N), Yes (Y) | |||
XID_Continue | No (N), Yes (Y) | |||
XID_Start | No (N), Yes (Y) | |||
IDNA | Enumerated | UTS | Idn_2008 | na (na), NV8 (nv8), XV8 (xv8) |
Idn_Status | deviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v), ignored (i), mapped (m), valid (v) | |||
idna2003 | deviation, disallowed, ignored, mapped, valid | |||
idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED | |||
idna2008c | deviation, disallowed, ignored, mapped, valid | |||
uts46 | deviation, disallowed, ignored, mapped, valid | |||
String | Idn_Mapping | Show Values | ||
toIdna2003 | Show Values | |||
toUts46n | Show Values | |||
toUts46t | Show Values | |||
Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
Diacritic | No (N), Yes (Y) | |||
Extender | No (N), Yes (Y) | |||
Grapheme_Base | No (N), Yes (Y) | |||
Grapheme_Extend | No (N), Yes (Y) | |||
Grapheme_Link | No (N), Yes (Y) | |||
Hyphen | No (N), Yes (Y) | |||
Math | No (N), Yes (Y) | |||
Quotation_Mark | No (N), Yes (Y) | |||
Sentence_Terminal | No (N), Yes (Y) | |||
Terminal_Punctuation | No (N), Yes (Y) | |||
Enumerated | Indic_Positional_Category | Show Values | ||
Indic_Syllabic_Category | Show Values | |||
Miscellaneous | ISO_Comment | Show Values | ||
Unicode_1_Name | Show Values | |||
Normalization | Binary | ICU | NFC_Inert | No (N), Yes (Y) |
NFD_Inert | No (N), Yes (Y) | |||
NFKC_Inert | No (N), Yes (Y) | |||
NFKD_Inert | No (N), Yes (Y) | |||
isNFM | No, Yes | |||
UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) | ||
Full_Composition_Exclusion | No (N), Yes (Y) | |||
Unicode | isNFC | No, Yes | ||
isNFD | No, Yes | |||
isNFKC | No, Yes | |||
isNFKD | No, Yes | |||
Enumerated | ICU | Lead_Canonical_Combining_Class | Show Values | |
Trail_Canonical_Combining_Class | Show Values | |||
UCD | Canonical_Combining_Class | Show Values | ||
Decomposition_Type | Show Values | |||
NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFD_Quick_Check | No (N), Yes (Y) | |||
NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFKD_Quick_Check | No (N), Yes (Y) | |||
String | ICU | toNFM | Show Values | |
UCD | NFKC_Casefold | Show Values | ||
Unicode | toNFC | Show Values | ||
toNFD | Show Values | |||
toNFKC | Show Values | |||
toNFKD | Show Values | |||
Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
Hex_Digit | No (N), Yes (Y) | |||
Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
kAccountingNumeric | Show Values | |||
kOtherNumeric | Show Values | |||
kPrimaryNumeric | Show Values | |||
Numeric | Numeric_Value | Show Values | ||
Regex | Binary | UTS | ANY | No, Yes |
ASCII | No, Yes | |||
alnum | No (N), Yes (Y) | |||
blank | No (N), Yes (Y) | |||
bmp | No, Yes | |||
graph | No (N), Yes (Y) | |||
No (N), Yes (Y) | ||||
xdigit | No (N), Yes (Y) | |||
Security | Enumerated | UTS | Confusable_MA | Show Values |
Identifier_Status | Allowed (a), Restricted (r) | |||
Identifier_Type | Show Values | |||
Shaping and Rendering | Binary | ICU | Segment_Starter | No (N), Yes (Y) |
UCD | Join_Control | No (N), Yes (Y) | ||
Enumerated | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | ||
Grapheme_Cluster_Break | Show Values | |||
Joining_Group | Show Values | |||
Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Line_Break | Show Values | |||
Prepended_Concatenation_Mark | No (N), Yes (Y) | |||
Sentence_Break | Show Values | |||
Standardized_Variant | Show Values | |||
Vertical_Orientation | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Word_Break | Show Values | |||
UCA | Binary | UTS | uca | Show Values |
uca2 | , 4A, 4B, 05, 05 05, 05 05 05, 05 05 05 05, 05 05 05 05 05, 05 05 05 05 05 05, 05 05 05 05 05 05 05, 05 05 05 05 05 05 05 05, 05 05 05 05 05 05 05 05 05 05 05 05 05 05 05 05 05 05, 05 05 05 05 05 AE, 05 05 05 05 AE, 05 05 05 05 AE 05, 05 05 05 AE, 05 05 05 AE 05, 05 05 05 AE 05 05, 05 05 05 B0, 05 05 05 B0 05, 05 05 90, 05 05 AE, 05 05 AE 05 05, 05 05 B0 05, 05 05 B0 05 05, 05 05 E30C, 05 05 F0F1, 05 8A, 05 8A D8, 05 8C, 05 8C 8A, 05 8C 9A, 05 8C 88, 05 8C B6, 05 8E, 05 8E 8A, 05 8E 9A, 05 8E 88, 05 8E B6, 05 9A, 05 9A 88, 05 9A 96, 05 9A A4, 05 9C, 05 9C A4, 05 9E, 05 9E 88, 05 84, 05 84 8A, 05 84 8A D8, 05 84 88, 05 84 88 D8, 05 84 94, 05 84 94 D8, 05 84 D8, 05 86, 05 86 8A, 05 86 8A D8, 05 86 88, 05 86 88 D8, 05 86 94, 05 86 94 D8, 05 86 D8, 05 88, 05 88 9C, 05 88 D8, 05 90, 05 90 9C, 05 92, 05 92 88, 05 94, 05 94 D8, 05 96, 05 96 8A, 05 96 88, 05 96 90, 05 96 94, 05 96 A4, 05 98, 05 A0, 05 A0 8C, 05 A0 88, 05 A2, 05 A2 A4, 05 A4, 05 A4 8A, 05 A4 88, 05 A4 96, 05 A8, 05 AA, 05 AC, 05 AE, 05 AE 05, 05 AE 05 05, 05 AE 05 05 05, 05 AE 05 05 05 05, 05 AE 05 05 AE 05, 05 AE 05 AE, 05 B0, 05 B0 05, 05 B0 05 05, 05 B0 05 05 05, 05 B0 05 05 05 05, 05 B0 05 05 AE, 05 B2, 05 B6, 05 B8, 05 BC, 05 BE, 05 BE 8A, 05 BE 9A, 05 BE 88, 05 BE B6, 05 BE C4, 05 C4, 05 C4 8C, 05 C4 8E, 05 C4 9C, 05 C4 A4, 05 C6, 05 C8, 05 CA, 05 CC, 05 CE, 05 D0, 05 D2, 05 D8, 05 E2A7, 05 E3B1, 05 E3D2, 05 E3D2 E3B1, 05 E3D2 E390, 05 E3F3, 05 E5A4, 05 E30C, 05 E32D, 05 E34E, 05 E390, 05 E880, 05 EB3B, 05 F0AF, 05 FB99, 7A, 7C, 8A, 8C, 8E, 9A, 9C, 9E, 46, 47, 48, 49, 70, 70 05, 70 05 88, 70 05 A4, 70 70, 73, 74, 75, 76, 78, 78 05, 78 9C, 79, 82, 84, 86, 88, 90, 92, 94, 96, 96 88, 98, A0, A2, A4, A6, A8, AA, AC, AE, B0, B2, B4, B6, B8, BA, BC, BE, C0, C2, C4, C6, C8, CA, CC, CE, D0, D2, D4, D6, D8, DA, DC, DE, E0, E2A7, E2C8, E2E9, E3B1, E3D2, E3F3, E4BB, E4DC, E4FD, E5A4, E5C5, E5E6, E6AE, E6CF, E6CF E81D, E6F0, E7B8, E7B8 E81D, E7D9, E7FA, E8A1, E8C2, E8E3, E9AB, E9CC, E9ED, E30C, E32D, E34E, E36F, E49A, E62A, E64B, E66C, E66C E81D, E68D, E81D, E81D EB3B, E83E, E85F, E98A, E202, E223, E244, E265 too many values to show | |||
uca2.5 | Show Values | |||
uca3 | Show Values | |||
Z-Other | Other | Other | Basic_Emoji | Other |
Equivalent_Unified_Ideograph | Other | |||
Extended_Pictographic | Other |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.9; ICU version: 63.1; Unicode version: 12.0;