help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid
Category | Datatype | Source | Property | Values |
---|---|---|---|---|
Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
Bidi_Mirrored | No (N), Yes (Y) | |||
Enumerated | Bidi_Class | Show Values | ||
Bidi_Paired_Bracket_Type | Close (C), None (N), Open (O) | |||
String | Bidi_Mirroring_Glyph | Show Values | ||
Bidi_Paired_Bracket | Show Values | |||
Case | Binary | UCD | Case_Ignorable | No (N), Yes (Y) |
Cased | No (N), Yes (Y) | |||
Changes_When_Casefolded | No (N), Yes (Y) | |||
Changes_When_Casemapped | No (N), Yes (Y) | |||
Changes_When_Lowercased | No (N), Yes (Y) | |||
Changes_When_Titlecased | No (N), Yes (Y) | |||
Changes_When_Uppercased | No (N), Yes (Y) | |||
Lowercase | No (N), Yes (Y) | |||
Soft_Dotted | No (N), Yes (Y) | |||
Uppercase | No (N), Yes (Y) | |||
Unicode | isCased | No (N), Yes (Y) | ||
isCasefolded | No (N), Yes (Y) | |||
isLowercase | No (N), Yes (Y) | |||
isTitlecase | No (N), Yes (Y) | |||
isUppercase | No (N), Yes (Y) | |||
X-ICU | Case_Sensitive | No (N), Yes (Y) | ||
String | UCD | Case_Folding | Show Values | |
Lowercase_Mapping | Show Values | |||
Simple_Case_Folding | Show Values | |||
Simple_Lowercase_Mapping | Show Values | |||
Simple_Titlecase_Mapping | Show Values | |||
Simple_Uppercase_Mapping | Show Values | |||
Titlecase_Mapping | Show Values | |||
Uppercase_Mapping | Show Values | |||
Unicode | toCasefold | Show Values | ||
toLowercase | Show Values | |||
toTitlecase | Show Values | |||
toUppercase | Show Values | |||
CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
IDS_Trinary_Operator | No (N), Yes (Y) | |||
Ideographic | No (N), Yes (Y) | |||
Radical | No (N), Yes (Y) | |||
Unified_Ideograph | No (N), Yes (Y) | |||
Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
String | UCD | CJK_Radical | Show Values | |
Equivalent_Unified_Ideograph | Show Values | |||
kSimplifiedVariant | Show Values | |||
kTraditionalVariant | Show Values | |||
Emoji | Binary | UCD | Extended_Pictographic | No (N), Yes (Y) |
UTS | Basic_Emoji | No (N), Yes (Y) | ||
Emoji | No (N), Yes (Y) | |||
Emoji_Component | No (N), Yes (Y) | |||
Emoji_Modifier | No (N), Yes (Y) | |||
Emoji_Modifier_Base | No (N), Yes (Y) | |||
Emoji_Presentation | No (N), Yes (Y) | |||
RGI_Emoji | No, Yes | |||
RGI_Emoji_Flag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Keycap_Sequence | No (No), Yes (Yes) | |||
RGI_Emoji_Modifier_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Tag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Zwj_Sequence | No (N), Yes (Y) | |||
Enumerated | UCD | Regional_Indicator | No (N), Yes (Y) | |
General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
Deprecated | No (N), Yes (Y) | |||
Logical_Order_Exception | No (N), Yes (Y) | |||
Noncharacter_Code_Point | No (N), Yes (Y) | |||
Variation_Selector | No (N), Yes (Y) | |||
White_Space | No (N), Yes (Y) | |||
Catalog | Age | Show Values | ||
Block | Show Values | |||
Script | Show Values | |||
Enumerated | General_Category | Show Values | ||
Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Name_Alias | Show Values | |||
Named_Sequences | Show Values | |||
Named_Sequences_Prov | ||||
String | Nameslist | subhead | Show Values | |
UCD | Name | Show Values | ||
Script_Extensions | Show Values | |||
Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
ID_Start | No (N), Yes (Y) | |||
Pattern_Syntax | No (N), Yes (Y) | |||
Pattern_White_Space | No (N), Yes (Y) | |||
XID_Continue | No (N), Yes (Y) | |||
XID_Start | No (N), Yes (Y) | |||
IDNA | Enumerated | UTS | Idn_2008 | na (na), NV8 (nv8), XV8 (xv8) |
Idn_Status | deviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v), ignored (i), mapped (m), valid (v) | |||
idna2003 | deviation, disallowed, ignored, mapped, valid | |||
idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED | |||
idna2008c | deviation, disallowed, ignored, mapped, valid | |||
uts46 | deviation, disallowed, ignored, mapped, valid | |||
String | Idn_Mapping | Show Values | ||
toIdna2003 | Show Values | |||
toUts46n | Show Values | |||
toUts46t | Show Values | |||
Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
Diacritic | No (N), Yes (Y) | |||
Extender | No (N), Yes (Y) | |||
Grapheme_Base | No (N), Yes (Y) | |||
Grapheme_Extend | No (N), Yes (Y) | |||
Grapheme_Link | No (N), Yes (Y) | |||
Hyphen | No (N), Yes (Y) | |||
Math | No (N), Yes (Y) | |||
Quotation_Mark | No (N), Yes (Y) | |||
STerm | No (N), Yes (Y) | |||
Terminal_Punctuation | No (N), Yes (Y) | |||
Enumerated | Indic_Positional_Category | Show Values | ||
Indic_Syllabic_Category | Show Values | |||
Miscellaneous | ISO_Comment | Show Values | ||
Unicode_1_Name | Show Values | |||
Normalization | Binary | ICU | NFC_Inert | No (N), Yes (Y) |
NFD_Inert | No (N), Yes (Y) | |||
NFKC_Inert | No (N), Yes (Y) | |||
NFKD_Inert | No (N), Yes (Y) | |||
isNFM | No, Yes | |||
UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) | ||
Full_Composition_Exclusion | No (N), Yes (Y) | |||
Unicode | isNFC | No, Yes | ||
isNFD | No, Yes | |||
isNFKC | No, Yes | |||
isNFKD | No, Yes | |||
Enumerated | ICU | Lead_Canonical_Combining_Class | Show Values | |
Trail_Canonical_Combining_Class | Show Values | |||
UCD | Canonical_Combining_Class | Show Values | ||
Decomposition_Type | Show Values | |||
NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFD_Quick_Check | No (N), Yes (Y) | |||
NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFKD_Quick_Check | No (N), Yes (Y) | |||
String | ICU | toNFM | Show Values | |
UCD | NFKC_Casefold | Show Values | ||
Unicode | toNFC | Show Values | ||
toNFD | Show Values | |||
toNFKC | Show Values | |||
toNFKD | Show Values | |||
Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
Hex_Digit | No (N), Yes (Y) | |||
Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
kAccountingNumeric | Show Values | |||
kOtherNumeric | Show Values | |||
kPrimaryNumeric | Show Values | |||
Numeric | Numeric_Value | Show Values | ||
Regex | Binary | UTS | ANY | No, Yes |
ASCII | No, Yes | |||
alnum | No (N), Yes (Y) | |||
blank | No (N), Yes (Y) | |||
bmp | No, Yes | |||
graph | No (N), Yes (Y) | |||
No (N), Yes (Y) | ||||
xdigit | No (N), Yes (Y) | |||
Security | Enumerated | UTS | Confusable_MA | Show Values |
Identifier_Status | Allowed (a), Restricted (r) | |||
Identifier_Type | Show Values | |||
Shaping and Rendering | Binary | ICU | Segment_Starter | No (N), Yes (Y) |
UCD | Join_Control | No (N), Yes (Y) | ||
Enumerated | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | ||
Grapheme_Cluster_Break | Show Values | |||
Joining_Group | Show Values | |||
Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Line_Break | Show Values | |||
Prepended_Concatenation_Mark | No (N), Yes (Y) | |||
Sentence_Break | Show Values | |||
Standardized_Variant | Show Values | |||
Vertical_Orientation | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Word_Break | Show Values | |||
UCA | Binary | UTS | uca | (), 0A0A (0A0A), 0A0C (0A0C), 0A0E (0A0E), 0A1A (0A1A), 0A1C (0A1C), 0A1E (0A1E), 0A02 (0A02), 0A2A (0A2A), 0A2C (0A2C), 0A2E (0A2E), 0A3A (0A3A), 0A3C (0A3C), 0A3E (0A3E), 0A04 (0A04), 0A4A (0A4A), 0A4C (0A4C), 0A4E (0A4E), 0A5A (0A5A), 0A5C (0A5C), 0A5E (0A5E), 0A06 (0A06), 0A6A (0A6A), 0A6C (0A6C), 0A6E (0A6E), 0A7A (0A7A), 0A7C (0A7C), 0A7E (0A7E), 0A08 (0A08), 0A8A (0A8A), 0A8C (0A8C), 0A8E (0A8E), 0A9A (0A9A), 0A9C (0A9C), 0A9E (0A9E), 0A10 (0A10), 0A12 (0A12), 0A14 (0A14), 0A16 (0A16), 0A18 (0A18), 0A20 (0A20), 0A22 (0A22), 0A24 (0A24), 0A26 (0A26), 0A28 (0A28), 0A30 (0A30), 0A32 (0A32), 0A34 (0A34), 0A36 (0A36), 0A38 (0A38), 0A40 (0A40), 0A42 (0A42), 0A44 (0A44), 0A46 (0A46), 0A48 (0A48), 0A50 (0A50), 0A52 (0A52), 0A54 (0A54), 0A56 (0A56), 0A58 (0A58), 0A60 (0A60), 0A62 (0A62), 0A64 (0A64), 0A66 (0A66), 0A66 4E 0A68 (0A66 4E 0A68), 0A66 FA2D9A 0A68 (0A66 FA2D9A 0A68), 0A66 FA08D0 0A68 (0A66 FA08D0 0A68), 0A66 FA0220 0A68 (0A66 FA0220 0A68), 0A66 FA8048 0A68 (0A66 FA8048 0A68), 0A66 FAC902 0A68 (0A66 FAC902 0A68), 0A66 FAE05E 0A68 (0A66 FAE05E 0A68), 0A66 FAF67A 0A68 (0A66 FAF67A 0A68), 0A66 FB821A 0A68 (0A66 FB821A 0A68), 0A66 FB4110 0A68 (0A66 FB4110 0A68), 0A68 (0A68), 0A70 (0A70), 0A72 (0A72), 0A74 (0A74), 0A76 (0A76), 0A78 (0A78), 0A80 (0A80), 0A82 (0A82), 0A84 (0A84), 0A86 (0A86), 0A88 (0A88), 0A90 (0A90), 0A92 (0A92), 0A94 (0A94), 0A96 (0A96), 0A98 (0A98), 0AA0 (0AA0), 0AA2 (0AA2), 0AA4 (0AA4), 0AA6 (0AA6), 0AA8 (0AA8), 0AAA (0AAA), 0AAC (0AAC), 0AAE (0AAE), 0AB0 (0AB0), 0AB2 (0AB2), 0AB4 (0AB4), 0AB6 (0AB6), 0AB8 (0AB8), 0ABA (0ABA), 0ABC (0ABC), 0ABE (0ABE), 0AC0 (0AC0), 0AC2 (0AC2), 0AC4 (0AC4), 0AC6 (0AC6), 0AC8 (0AC8), 0ACA (0ACA), 0ACC (0ACC), 0ACE (0ACE), 0AD0 (0AD0), 0AD0 0AD0 (0AD0 0AD0), 0AD0 0AD0 0AD0 (0AD0 0AD0 0AD0), 0AD0 0AD0 0AD0 0AD0 (0AD0 0AD0 0AD0 0AD0), 0AD2 (0AD2), 0AD2 0AD2 (0AD2 0AD2), 0AD2 0AD2 0AD2 (0AD2 0AD2 0AD2), 0AD4 (0AD4), 0AD8 (0AD8), 0ADA (0ADA), 0ADC (0ADC), 0ADE (0ADE), 0AE0 (0AE0), 0AE2 (0AE2), 0AE4 (0AE4), 0AE6 (0AE6), 0AE8 (0AE8), 0AEA (0AEA), 0AEC (0AEC), 0AEE (0AEE), 0AF0 (0AF0), 0AF2 (0AF2), 0AF4 (0AF4), 0AF6 (0AF6), 0AF8 (0AF8), 0AFA (0AFA), 0AFC (0AFC), 0AFE (0AFE), 0B0A (0B0A), 0B0C (0B0C), 0B0E (0B0E), 0B1A (0B1A), 0B1C (0B1C), 0B1E (0B1E), 0B02 (0B02), 0B2A (0B2A), 0B2C (0B2C), 0B2E1E (0B2E1E), 0B2E02 (0B2E02), 0B2E2C (0B2E2C), 0B2E3A (0B2E3A), 0B2E4F (0B2E4F), 0B2E5D (0B2E5D), 0B2E6B (0B2E6B), 0B2E09 (0B2E09), 0B2E10 (0B2E10), 0B2E17 (0B2E17), 0B2E25 (0B2E25), 0B2E33 (0B2E33), 0B2E41 (0B2E41), 0B2E48 (0B2E48), 0B2E56 (0B2E56), 0B2E64 (0B2E64), 0B2E72 (0B2E72), 0B2E79 (0B2E79), 0B2E80 (0B2E80), 0B3A (0B3A), 0B3C (0B3C), 0B3E (0B3E), 0B04 (0B04), 0B4A (0B4A), 0B4C (0B4C), 0B4E (0B4E), 0B5A (0B5A), 0B5C (0B5C), 0B5E (0B5E), 0B06 (0B06), 0B6A (0B6A), 0B6C (0B6C), 0B6E (0B6E), 0B7A (0B7A), 0B7C (0B7C), 0B7E (0B7E), 0B08 (0B08), 0B8A (0B8A), 0B8C (0B8C), 0B8E1E (0B8E1E), 0B8E02 (0B8E02), 0B8E2C (0B8E2C), 0B8E3A (0B8E3A), 0B8E4F (0B8E4F), 0B8E5D (0B8E5D), 0B8E6B (0B8E6B), 0B8E8E (0B8E8E), 0B8E09 (0B8E09), 0B8E10 (0B8E10), 0B8E17 (0B8E17), 0B8E25 (0B8E25), 0B8E33 (0B8E33), 0B8E41 (0B8E41), 0B8E48 (0B8E48), 0B8E56 (0B8E56), 0B8E64 (0B8E64), 0B8E72 (0B8E72), 0B8E79 (0B8E79), 0B8E80 (0B8E80), 0B8E87 (0B8E87), 0B9A (0B9A), 0B9C (0B9C), 0B9E1E (0B9E1E), 0B9E02 (0B9E02), 0B9E2C (0B9E2C), 0B9E3A (0B9E3A), 0B9E4F (0B9E4F), 0B9E5D (0B9E5D), 0B9E6B (0B9E6B), 0B9E8E (0B9E8E), 0B9E09 (0B9E09), 0B9E9C (0B9E9C), 0B9E10 (0B9E10), 0B9E17 (0B9E17), 0B9E25 (0B9E25), 0B9E33 (0B9E33), 0B9E41 (0B9E41), 0B9E48 (0B9E48), 0B9E56 (0B9E56), 0B9E64 (0B9E64), 0B9E72 (0B9E72), 0B9E79 (0B9E79), 0B9E80 (0B9E80), 0B9E87 (0B9E87), 0B9E95 (0B9E95), 0B9EA3 (0B9EA3), 0B10 (0B10), 0B12 (0B12), 0B14 (0B14), 0B16 (0B16), 0B18 (0B18), 0B20 (0B20), 0B22 (0B22), 0B24 (0B24), 0B26 (0B26), 0B28 (0B28), 0B30 (0B30), 0B32 (0B32), 0B34 (0B34), 0B36 (0B36), 0B38 (0B38), 0B40 (0B40), 0B42 (0B42), 0B44 (0B44) too many values to show |
uca2 | Show Values | |||
uca2.5 | Show Values | |||
uca3 | Show Values | |||
Z-Other | Other | Other | Emoji_Keycap_Sequence | Other |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.9; ICU version: 72.0; Unicode/Emoji version: 15.0;