help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid
| Category | Datatype | Source | Property | Values |
|---|---|---|---|---|
| Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
| Bidi_Mirrored | No (N), Yes (Y) | |||
| Enumerated | Bidi_Class | Show Values | ||
| Bidi_Paired_Bracket_Type | Close (c), None (n), Open (o) | |||
| String | Bidi_Mirroring_Glyph | Show Values | ||
| Bidi_Paired_Bracket | Show Values | |||
| Case | Binary | UCD | Case_Ignorable | No (N), Yes (Y) |
| Cased | No (N), Yes (Y) | |||
| Changes_When_Casefolded | No (N), Yes (Y) | |||
| Changes_When_Casemapped | No (N), Yes (Y) | |||
| Changes_When_Lowercased | No (N), Yes (Y) | |||
| Changes_When_Titlecased | No (N), Yes (Y) | |||
| Changes_When_Uppercased | No (N), Yes (Y) | |||
| Lowercase | No (N), Yes (Y) | |||
| Soft_Dotted | No (N), Yes (Y) | |||
| Uppercase | No (N), Yes (Y) | |||
| String | Case_Folding | Show Values | ||
| Lowercase_Mapping | Show Values | |||
| Simple_Case_Folding | Show Values | |||
| Simple_Lowercase_Mapping | Show Values | |||
| Simple_Titlecase_Mapping | Show Values | |||
| Simple_Uppercase_Mapping | Show Values | |||
| Titlecase_Mapping | Show Values | |||
| Uppercase_Mapping | Show Values | |||
| Unicode | toCasefold | Show Values | ||
| toLowercase | Show Values | |||
| toTitlecase | Show Values | |||
| toUppercase | Show Values | |||
| CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
| IDS_Trinary_Operator | No (N), Yes (Y) | |||
| Ideographic | No (N), Yes (Y) | |||
| Radical | No (N), Yes (Y) | |||
| Unified_Ideograph | No (N), Yes (Y) | |||
| Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
| String | UCD | CJK_Radical | Show Values | |
| Equivalent_Unified_Ideograph | Show Values | |||
| kSimplifiedVariant | Show Values | |||
| kTraditionalVariant | Show Values | |||
| Emoji | Binary | UCD | Extended_Pictographic | No (N), Yes (Y) |
| UTS | Basic_Emoji | No (N), Yes (Y) | ||
| Emoji | No (N), Yes (Y) | |||
| Emoji_Component | No (N), Yes (Y) | |||
| Emoji_Modifier | No (N), Yes (Y) | |||
| Emoji_Modifier_Base | No (N), Yes (Y) | |||
| Emoji_Presentation | No (N), Yes (Y) | |||
| RGI_Emoji | No, Yes | |||
| RGI_Emoji_Flag_Sequence | No (N), Yes (Y) | |||
| RGI_Emoji_Keycap_Sequence | No (N), Yes (Y) | |||
| RGI_Emoji_Modifier_Sequence | No (N), Yes (Y) | |||
| RGI_Emoji_Tag_Sequence | No (N), Yes (Y) | |||
| RGI_Emoji_Zwj_Sequence | No (N), Yes (Y) | |||
| Enumerated | UCD | Regional_Indicator | No (N), Yes (Y) | |
| General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
| Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
| Deprecated | No (N), Yes (Y) | |||
| Logical_Order_Exception | No (N), Yes (Y) | |||
| Noncharacter_Code_Point | No (N), Yes (Y) | |||
| Variation_Selector | No (N), Yes (Y) | |||
| White_Space | No (N), Yes (Y) | |||
| Catalog | Age | Show Values | ||
| Block | Show Values | |||
| Script | Show Values | |||
| Enumerated | General_Category | Show Values | ||
| Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
| Name_Alias | Show Values | |||
| Named_Sequences | Show Values | |||
| Named_Sequences_Prov | ||||
| String | Nameslist | subhead | Show Values | |
| UCD | Name | Show Values | ||
| Script_Extensions | Show Values | |||
| Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
| ID_Start | No (N), Yes (Y) | |||
| Pattern_Syntax | No (N), Yes (Y) | |||
| Pattern_White_Space | No (N), Yes (Y) | |||
| XID_Continue | No (N), Yes (Y) | |||
| XID_Start | No (N), Yes (Y) | |||
| IDNA | Enumerated | UTS | Idn_2008 | na (na), NV8 (nv8), XV8 (xv8) |
| Idn_Status | deviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v), ignored (i), mapped (m), valid (v) | |||
| idna2003 | deviation, disallowed, ignored, mapped, valid | |||
| idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED | |||
| idna2008c | deviation, disallowed, ignored, mapped, valid | |||
| uts46 | deviation, disallowed, ignored, mapped, valid | |||
| String | Idn_Mapping | Show Values | ||
| toIdna2003 | Show Values | |||
| toUts46n | Show Values | |||
| toUts46t | Show Values | |||
| Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
| Diacritic | No (N), Yes (Y) | |||
| Extender | No (N), Yes (Y) | |||
| Grapheme_Base | No (N), Yes (Y) | |||
| Grapheme_Extend | No (N), Yes (Y) | |||
| Grapheme_Link | No (N), Yes (Y) | |||
| Hyphen | No (N), Yes (Y) | |||
| Math | No (N), Yes (Y) | |||
| Quotation_Mark | No (N), Yes (Y) | |||
| STerm | No (N), Yes (Y) | |||
| Terminal_Punctuation | No (N), Yes (Y) | |||
| Enumerated | Indic_Positional_Category | Show Values | ||
| Indic_Syllabic_Category | Show Values | |||
| Miscellaneous | ISO_Comment | Show Values | ||
| Unicode_1_Name | Show Values | |||
| Normalization | Binary | ICU | isNFM | No, Yes |
| UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) | ||
| Full_Composition_Exclusion | No (N), Yes (Y) | |||
| Unicode | isNFC | No, Yes | ||
| isNFD | No, Yes | |||
| isNFKC | No, Yes | |||
| isNFKD | No, Yes | |||
| Enumerated | UCD | Canonical_Combining_Class | Show Values | |
| Decomposition_Type | Show Values | |||
| NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
| NFD_Quick_Check | No (N), Yes (Y) | |||
| NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
| NFKD_Quick_Check | No (N), Yes (Y) | |||
| String | ICU | toNFM | Show Values | |
| UCD | NFKC_Casefold | Show Values | ||
| Unicode | toNFC | Show Values | ||
| toNFD | Show Values | |||
| toNFKC | Show Values | |||
| toNFKD | Show Values | |||
| Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
| Hex_Digit | No (N), Yes (Y) | |||
| Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
| kAccountingNumeric | Show Values | |||
| kOtherNumeric | Show Values | |||
| kPrimaryNumeric | Show Values | |||
| Numeric | Numeric_Value | Show Values | ||
| Regex | Binary | UTS | ANY | No, Yes |
| ASCII | No, Yes | |||
| bmp | No, Yes | |||
| Security | Enumerated | UTS | Confusable_MA | Show Values |
| Identifier_Status | Allowed (a), Restricted (r) | |||
| Identifier_Type | Show Values | |||
| Shaping and Rendering | Binary | UCD | Join_Control | No (N), Yes (Y) |
| Enumerated | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | ||
| Grapheme_Cluster_Break | Show Values | |||
| Joining_Group | Show Values | |||
| Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
| Line_Break | Show Values | |||
| Prepended_Concatenation_Mark | No (N), Yes (Y) | |||
| Sentence_Break | Show Values | |||
| Standardized_Variant | Show Values | |||
| Vertical_Orientation | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
| Word_Break | Show Values | |||
| UCA | Binary | UTS | uca | Show Values |
| uca2 | Show Values | |||
| uca2.5 | Show Values | |||
| uca3 | Show Values | |||
| Z-Other | Other | Other | Composition_Exclusion | Other |
| Confusable_ML | Other | |||
| Confusable_SA | Other | |||
| Confusable_SL | Other | |||
| Decomposition_Mapping | Other | |||
| Do_Not_Emit_Preferred | Other | |||
| Do_Not_Emit_Type | Other | |||
| Emoji_DCM | Other | |||
| Emoji_KDDI | Other | |||
| Emoji_SB | Other | |||
| exemplar | Other | |||
| exemplar_aux | Other | |||
| exemplar_punct | Other | |||
| Expands_On_NFC | Other | |||
| Expands_On_NFD | Other | |||
| Expands_On_NFKC | Other | |||
| Expands_On_NFKD | Other | |||
| FC_NFKC_Closure | Other | |||
| ID_Compat_Math_Continue | Other | |||
| ID_Compat_Math_Start | Other | |||
| IDS_Unary_Operator | Other | |||
| Indic_Conjunct_Break | Other | |||
| Jamo_Short_Name | Other | |||
| kAlternateTotalStrokes | Other | |||
| kBigFive | Other | |||
| kCangjie | Other | |||
| kCantonese | Other | |||
| kCCCII | Other | |||
| kCheungBauer | Other | |||
| kCheungBauerIndex | Other | |||
| kCihaiT | Other | |||
| kCNS1986 | Other | |||
| kCNS1992 | Other | |||
| kCompatibilityVariant | Other | |||
| kCowles | Other | |||
| kDaeJaweon | Other | |||
| kDefinition | Other | |||
| kEACC | Other | |||
| kEH_Cat | Other | |||
| kEH_Core | Other | |||
| kEH_Desc | Other | |||
| kEH_Func | Other | |||
| kEH_FVal | Other | |||
| kEH_HG | Other | |||
| kEH_IFAO | Other | |||
| kEH_JSesh | Other | |||
| kEH_NoMirror | Other | |||
| kEH_NoRotate | Other | |||
| kEH_UniK | Other | |||
| kFanqie | Other | |||
| kFenn | Other | |||
| kFennIndex | Other | |||
| kFourCornerCode | Other | |||
| kFrequency | Other | |||
| kGB0 | Other | |||
| kGB1 | Other | |||
| kGB3 | Other | |||
| kGB5 | Other | |||
| kGB7 | Other | |||
| kGB8 | Other | |||
| kGradeLevel | Other | |||
| kGSR | Other | |||
| kHangul | Other | |||
| kHanYu | Other | |||
| kHanyuPinlu | Other | |||
| kHanyuPinyin | Other | |||
| kHDZRadBreak | Other | |||
| kHKGlyph | Other | |||
| kHKSCS | Other | |||
| kIBMJapan | Other | |||
| kIICore | Other | |||
| kIRG_GSource | Other | |||
| kIRG_HSource | Other | |||
| kIRG_JSource | Other | |||
| kIRG_KPSource | Other | |||
| kIRG_KSource | Other | |||
| kIRG_MSource | Other | |||
| kIRG_SSource | Other | |||
| kIRG_TSource | Other | |||
| kIRG_UKSource | Other | |||
| kIRG_USource | Other | |||
| kIRG_VSource | Other | |||
| kIRGDaeJaweon | Other | |||
| kIRGDaiKanwaZiten | Other | |||
| kIRGHanyuDaZidian | Other | |||
| kIRGKangXi | Other | |||
| kJa | Other | |||
| kJapanese | Other | |||
| kJapaneseKun | Other | |||
| kJapaneseOn | Other | |||
| kJinmeiyoKanji | Other | |||
| kJis0 | Other | |||
| kJIS0213 | Other | |||
| kJis1 | Other | |||
| kJoyoKanji | Other | |||
| kKangXi | Other | |||
| kKarlgren | Other | |||
| kKorean | Other | |||
| kKoreanEducationHanja | Other | |||
| kKoreanName | Other | |||
| kKPS0 | Other | |||
| kKPS1 | Other | |||
| kKSC0 | Other | |||
| kKSC1 | Other | |||
| kLau | Other | |||
| kMainlandTelegraph | Other | |||
| kMandarin | Other | |||
| kMatthews | Other | |||
| kMeyerWempe | Other | |||
| kMojiJoho | Other | |||
| kMorohashi | Other | |||
| kNelson | Other | |||
| kPhonetic | Other | |||
| kPseudoGB1 | Other | |||
| kReading | Other | |||
| kRSAdobe_Japan1_6 | Other | |||
| kRSJapanese | Other | |||
| kRSKangXi | Other | |||
| kRSKanWa | Other | |||
| kRSKorean | Other | |||
| kRSTUnicode | Other | |||
| kRSUnicode | Other | |||
| kSBGY | Other | |||
| kSemanticVariant | Other | |||
| kSMSZD2003Index | Other | |||
| kSMSZD2003Readings | Other | |||
| kSpecializedSemanticVariant | Other | |||
| kSpoofingVariant | Other | |||
| kSrc_NushuDuben | Other | |||
| kStrange | Other | |||
| kTaiwanTelegraph | Other | |||
| kTang | Other | |||
| kTGH | Other | |||
| kTGHZ2013 | Other | |||
| kTGT_MergedSrc | Other | |||
| kTotalStrokes | Other | |||
| kUnihanCore2020 | Other | |||
| kVietnamese | Other | |||
| kVietnameseNumeric | Other | |||
| kXerox | Other | |||
| kXHC1983 | Other | |||
| kZhuangNumeric | Other | |||
| kZVariant | Other | |||
| Modifier_Combining_Mark | Other | |||
| NFKC_Simple_Casefold | Other | |||
| Other_Alphabetic | Other | |||
| Other_Default_Ignorable_Code_Point | Other | |||
| Other_Grapheme_Extend | Other | |||
| Other_ID_Continue | Other | |||
| Other_ID_Start | Other | |||
| Other_Joining_Type | Other | |||
| Other_Lowercase | Other | |||
| Other_Math | Other | |||
| Other_Uppercase | Other |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.9; ICU version: 74.1; Unicode/Emoji version: 15.1.0;