help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid
Category | Datatype | Source | Property | Values |
---|---|---|---|---|
Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
Bidi_Mirrored | No (N), Yes (Y) | |||
Enumerated | Bidi_Class | Show Values | ||
Bidi_Paired_Bracket_Type | Close (C), None (N), Open (O) | |||
String | Bidi_Mirroring_Glyph | Show Values | ||
Bidi_Paired_Bracket | Show Values | |||
Case | Binary | UCD | Case_Ignorable | No (N), Yes (Y) |
Cased | No (N), Yes (Y) | |||
Changes_When_Casefolded | No (N), Yes (Y) | |||
Changes_When_Casemapped | No (N), Yes (Y) | |||
Changes_When_Lowercased | No (N), Yes (Y) | |||
Changes_When_Titlecased | No (N), Yes (Y) | |||
Changes_When_Uppercased | No (N), Yes (Y) | |||
Lowercase | No (N), Yes (Y) | |||
Soft_Dotted | No (N), Yes (Y) | |||
Uppercase | No (N), Yes (Y) | |||
Unicode | isCased | No (N), Yes (Y) | ||
isCasefolded | No (N), Yes (Y) | |||
isLowercase | No (N), Yes (Y) | |||
isTitlecase | No (N), Yes (Y) | |||
isUppercase | No (N), Yes (Y) | |||
X-ICU | Case_Sensitive | No (N), Yes (Y) | ||
String | UCD | Case_Folding | Show Values | |
Lowercase_Mapping | Show Values | |||
Simple_Case_Folding | Show Values | |||
Simple_Lowercase_Mapping | Show Values | |||
Simple_Titlecase_Mapping | Show Values | |||
Simple_Uppercase_Mapping | Show Values | |||
Titlecase_Mapping | Show Values | |||
Uppercase_Mapping | Show Values | |||
Unicode | toCasefold | Show Values | ||
toLowercase | Show Values | |||
toTitlecase | Show Values | |||
toUppercase | Show Values | |||
CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
IDS_Trinary_Operator | No (N), Yes (Y) | |||
Ideographic | No (N), Yes (Y) | |||
Radical | No (N), Yes (Y) | |||
Unified_Ideograph | No (N), Yes (Y) | |||
Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
String | UCD | CJK_Radical | Show Values | |
Equivalent_Unified_Ideograph | Show Values | |||
kSimplifiedVariant | Show Values | |||
kTraditionalVariant | Show Values | |||
Emoji | Binary | UCD | Extended_Pictographic | No (N), Yes (Y) |
UTS | Basic_Emoji | No (N), Yes (Y) | ||
Emoji | No (N), Yes (Y) | |||
Emoji_Component | No (N), Yes (Y) | |||
Emoji_Modifier | No (N), Yes (Y) | |||
Emoji_Modifier_Base | No (N), Yes (Y) | |||
Emoji_Presentation | No (N), Yes (Y) | |||
RGI_Emoji | No, Yes | |||
RGI_Emoji_Flag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Keycap_Sequence | No (No), Yes (Yes) | |||
RGI_Emoji_Modifier_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Tag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Zwj_Sequence | No (N), Yes (Y) | |||
Enumerated | UCD | Regional_Indicator | No (N), Yes (Y) | |
General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
Deprecated | No (N), Yes (Y) | |||
Logical_Order_Exception | No (N), Yes (Y) | |||
Noncharacter_Code_Point | No (N), Yes (Y) | |||
Variation_Selector | No (N), Yes (Y) | |||
White_Space | No (N), Yes (Y) | |||
Catalog | Age | Show Values | ||
Block | Show Values | |||
Script | Show Values | |||
Enumerated | General_Category | Show Values | ||
Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Name_Alias | Show Values | |||
Named_Sequences | Show Values | |||
Named_Sequences_Prov | ||||
String | Nameslist | subhead | Show Values | |
UCD | Name | Show Values | ||
Script_Extensions | Show Values | |||
Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
ID_Start | No (N), Yes (Y) | |||
Pattern_Syntax | No (N), Yes (Y) | |||
Pattern_White_Space | No (N), Yes (Y) | |||
XID_Continue | No (N), Yes (Y) | |||
XID_Start | No (N), Yes (Y) | |||
IDNA | Enumerated | UTS | Idn_2008 | na (na), NV8 (nv8), XV8 (xv8) |
Idn_Status | deviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v), ignored (i), mapped (m), valid (v) | |||
idna2003 | deviation, disallowed, ignored, mapped, valid | |||
idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED | |||
idna2008c | deviation, disallowed, ignored, mapped, valid | |||
uts46 | deviation, disallowed, ignored, mapped, valid | |||
String | Idn_Mapping | Show Values | ||
toIdna2003 | Show Values | |||
toUts46n | Show Values | |||
toUts46t | Show Values | |||
Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
Diacritic | No (N), Yes (Y) | |||
Extender | No (N), Yes (Y) | |||
Grapheme_Base | No (N), Yes (Y) | |||
Grapheme_Extend | No (N), Yes (Y) | |||
Grapheme_Link | No (N), Yes (Y) | |||
Hyphen | No (N), Yes (Y) | |||
Math | No (N), Yes (Y) | |||
Quotation_Mark | No (N), Yes (Y) | |||
STerm | No (N), Yes (Y) | |||
Terminal_Punctuation | No (N), Yes (Y) | |||
Enumerated | Indic_Positional_Category | Show Values | ||
Indic_Syllabic_Category | Show Values | |||
Miscellaneous | ISO_Comment | Show Values | ||
Unicode_1_Name | Show Values | |||
Normalization | Binary | ICU | NFC_Inert | No (N), Yes (Y) |
NFD_Inert | No (N), Yes (Y) | |||
NFKC_Inert | No (N), Yes (Y) | |||
NFKD_Inert | No (N), Yes (Y) | |||
isNFM | No, Yes | |||
UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) | ||
Full_Composition_Exclusion | No (N), Yes (Y) | |||
Unicode | isNFC | No, Yes | ||
isNFD | No, Yes | |||
isNFKC | No, Yes | |||
isNFKD | No, Yes | |||
Enumerated | ICU | Lead_Canonical_Combining_Class | Show Values | |
Trail_Canonical_Combining_Class | Show Values | |||
UCD | Canonical_Combining_Class | Show Values | ||
Decomposition_Type | Show Values | |||
NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFD_Quick_Check | No (N), Yes (Y) | |||
NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFKD_Quick_Check | No (N), Yes (Y) | |||
String | ICU | toNFM | Show Values | |
UCD | NFKC_Casefold | Show Values | ||
Unicode | toNFC | Show Values | ||
toNFD | Show Values | |||
toNFKC | Show Values | |||
toNFKD | Show Values | |||
Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
Hex_Digit | No (N), Yes (Y) | |||
Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
kAccountingNumeric | Show Values | |||
kOtherNumeric | Show Values | |||
kPrimaryNumeric | Show Values | |||
Numeric | Numeric_Value | Show Values | ||
Regex | Binary | UTS | ANY | No, Yes |
ASCII | No, Yes | |||
alnum | No (N), Yes (Y) | |||
blank | No (N), Yes (Y) | |||
bmp | No, Yes | |||
graph | No (N), Yes (Y) | |||
No (N), Yes (Y) | ||||
xdigit | No (N), Yes (Y) | |||
Security | Enumerated | UTS | Confusable_MA | Show Values |
Identifier_Status | Allowed (a), Restricted (r) | |||
Identifier_Type | Show Values | |||
Shaping and Rendering | Binary | ICU | Segment_Starter | No (N), Yes (Y) |
UCD | Join_Control | No (N), Yes (Y) | ||
Enumerated | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | ||
Grapheme_Cluster_Break | Show Values | |||
Joining_Group | Show Values | |||
Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Line_Break | Show Values | |||
Prepended_Concatenation_Mark | No (N), Yes (Y) | |||
Sentence_Break | Show Values | |||
Standardized_Variant | Show Values | |||
Vertical_Orientation | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Word_Break | Show Values | |||
UCA | Binary | UTS | uca | Show Values |
uca2 | Show Values | |||
uca2.5 | Show Values | |||
uca3 | , 1B, 1B 1B, 1B 1B 18, 1B 05, 1B 05 05, 1B 05 05 05, 1B 18, 1B 20, 1C, 1C 05, 1C 05 05, 1E, 1E 1E, 1E 1E 1E 1E, 1E 1E 20 1E 1E, 1E 05, 1E 05 05, 1E 05 05 05, 1E 18, 1E 18 18, 1F, 1F 1F, 1F 22, 2B, 2B 2B, 2B 2C, 2B 2E, 2B 20, 2C, 2C 1E, 2C 2C, 2C 26, 2C 28, 2C 28 2C, 2C 28 28, 2C 30 31 30, 2E, 2E 2B, 2E 2C, 2E 2E, 2E 2E 2E 34, 2E 2E 30 2E, 2E 2E 34, 2E 05, 2E 30 2E, 2E 31, 2E 31 2E, 2E 31 2E 30 34 2E 31, 2E 31 31, 2E 34, 03, 03 03, 05, 05 05, 05 05 05, 05 05 05 05, 05 10, 05 13, 05 13 05, 10, 10 05, 10 10, 10 10 10, 10 18, 10 20, 10 22 10, 13, 13 2E 13, 13 10, 13 13, 13 13 13, 13 13 13 13, 13 13 20, 13 14, 13 20, 14, 15, 15 1B, 15 10, 15 15 10, 15 15 18, 15 16 10, 15 16 18, 15 18, 16, 16 1B, 16 10, 16 18, 17, 17 05, 17 10 10 10 10 17, 17 10 10 10 20 20 17, 17 10 10 17, 17 10 17, 17 13 17, 17 14 17, 17 15 15 17, 17 15 16 17, 17 15 17, 17 16 17, 17 22 17, 17 28 17, 18, 18 05, 18 10, 18 18, 18 18 18, 19, 20, 20 2E, 20 05, 20 10, 20 13, 20 20, 20 20 05, 20 20 20, 20 20 20 20, 20 22, 20 22 05, 22, 22 2E 22, 22 13, 22 13 20, 22 20, 22 22, 22 22 20, 22 22 22, 22 22 22 22, 24, 25, 25 20, 26, 26 1E, 26 2C, 26 05, 26 05 05, 26 05 05 05, 26 20, 26 26, 26 28, 28, 28 1E, 28 2C, 28 10 10, 28 20, 28 26, 28 28, 28 28 2C, 28 28 26, 28 28 28, 29, 30, 30 2C, 30 2E, 30 2E 2E 30 2E, 30 2E 2E 30 30, 30 2E 2E 31, 30 2E 2E 31 31 33, 30 2E 2E 33, 30 2E 30, 30 2E 30 2E 30, 30 2E 30 30, 30 2E 30 30 33 31, 30 2E 30 31, 30 2E 30 33, 30 2E 31, 30 2E 31 30, 30 2E 31 31, 30 2E 31 33 2E, 30 2E 33, 30 2E 33 30 33 30, 30 2E 33 31 33, 30 28, 30 30, 30 30 2E, 30 30 2E 2E, 30 30 2E 31, 30 30 20 30 30 31 30 30, 30 30 30 2C, 30 30 30 30, 30 30 31, 30 30 31 2E 30, 30 30 31 2E 33 30, 30 30 31 20 31 30 30 30 20 30 30 33 30 20 31 30 30 30, 30 30 33, 30 30 33 2E, 30 30 33 30, 30 30 33 33, 30 31, 30 31 33 30 2E 31, 30 33, 30 33 2E 33, 30 33 30 33 33 2E, 30 33 31 30 2E, 30 33 33, 30 34, 30 38, 31, 31 2C, 31 2E, 31 2E 2E 30, 31 2E 30, 31 2E 30 2E, 31 2E 30 31 2E 2E, 31 2E 31 2E, 31 2E 31 31, 31 2E 33 30, 31 20 31, 31 20 31 2E, 31 30, 31 30 2E, 31 30 2E 33 31, 31 30 30, 31 30 30 2C, 31 30 30 2E 30 30, 31 30 30 30, 31 30 30 31, 31 30 30 33 33, 31 30 31 2E 33 30, 31 30 31 30, 31 31, 31 31 2E, 31 31 2E 20 31, 31 31 2E 20 31 2E, 31 31 30, 31 31 30 34 31, 31 31 31, 31 31 31 2E 30, 31 31 31 31, 31 31 33, 31 33, 31 33 30, 31 33 30 2E 2E 30, 31 33 30 30, 31 33 33, 31 33 33 2E, 31 34, 31 34 2E, 31 34 31, 31 38, 33, 33 2E 30, 33 2E 30 2E, 33 2E 33 2E, 33 20, 33 20 33, 33 20 33 33, 33 20 34, 33 30, 33 30 2E 2E 33, 33 30 30 2E 33 2E, 33 30 30 33, 33 30 31, 33 31 too many values to show | |||
Z-Other | Other | Other | Emoji_Keycap_Sequence | Other |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.9; ICU version: 72.0; Unicode/Emoji version: 15.0;