Properties use ICU for Unicode V15.0; the beta properties support Unicode V15.1β. For more information, see Unicode Utilities Beta.
help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid
Category | Datatype | Source | Property | Values |
---|---|---|---|---|
Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
Bidi_Controlβ | No (N), Yes (Y) | |||
Bidi_Mirrored | No (N), Yes (Y) | |||
Bidi_Mirroredβ | No (N), Yes (Y) | |||
Enumerated | Bidi_Class | Show Values | ||
Bidi_Classβ | Show Values | |||
Bidi_Paired_Bracket_Type | Close (C), None (N), Open (O) | |||
Bidi_Paired_Bracket_Typeβ | Close (c), None (n), Open (o) | |||
String | Bidi_Mirroring_Glyph | Show Values | ||
Bidi_Mirroring_Glyphβ | Show Values | |||
Bidi_Paired_Bracket | Show Values | |||
Bidi_Paired_Bracketβ | Show Values | |||
Case | Binary | UCD | Case_Ignorable | No (N), Yes (Y) |
Case_Ignorableβ | No (N), Yes (Y) | |||
Cased | No (N), Yes (Y) | |||
Casedβ | No (N), Yes (Y) | |||
Changes_When_Casefolded | No (N), Yes (Y) | |||
Changes_When_Casefoldedβ | No (N), Yes (Y) | |||
Changes_When_Casemapped | No (N), Yes (Y) | |||
Changes_When_Casemappedβ | No (N), Yes (Y) | |||
Changes_When_Lowercased | No (N), Yes (Y) | |||
Changes_When_Lowercasedβ | No (N), Yes (Y) | |||
Changes_When_Titlecased | No (N), Yes (Y) | |||
Changes_When_Titlecasedβ | No (N), Yes (Y) | |||
Changes_When_Uppercased | No (N), Yes (Y) | |||
Changes_When_Uppercasedβ | No (N), Yes (Y) | |||
Lowercase | No (N), Yes (Y) | |||
Lowercaseβ | No (N), Yes (Y) | |||
Soft_Dotted | No (N), Yes (Y) | |||
Soft_Dottedβ | No (N), Yes (Y) | |||
Uppercase | No (N), Yes (Y) | |||
Uppercaseβ | No (N), Yes (Y) | |||
Unicode | isCased | No (N), Yes (Y) | ||
isCasefolded | No (N), Yes (Y) | |||
isLowercase | No (N), Yes (Y) | |||
isTitlecase | No (N), Yes (Y) | |||
isUppercase | No (N), Yes (Y) | |||
X-ICU | Case_Sensitive | No (N), Yes (Y) | ||
String | UCD | Case_Folding | Show Values | |
Case_Foldingβ | Show Values | |||
Lowercase_Mapping | Show Values | |||
Lowercase_Mappingβ | Show Values | |||
Simple_Case_Folding | Show Values | |||
Simple_Case_Foldingβ | Show Values | |||
Simple_Lowercase_Mapping | Show Values | |||
Simple_Lowercase_Mappingβ | Show Values | |||
Simple_Titlecase_Mapping | Show Values | |||
Simple_Titlecase_Mappingβ | Show Values | |||
Simple_Uppercase_Mapping | Show Values | |||
Simple_Uppercase_Mappingβ | Show Values | |||
Titlecase_Mapping | Show Values | |||
Titlecase_Mappingβ | Show Values | |||
Uppercase_Mapping | Show Values | |||
Uppercase_Mappingβ | Show Values | |||
Unicode | toCasefold | Show Values | ||
toLowercase | Show Values | |||
toTitlecase | Show Values | |||
toUppercase | Show Values | |||
CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
IDS_Binary_Operatorβ | No (N), Yes (Y) | |||
IDS_Trinary_Operator | No (N), Yes (Y) | |||
IDS_Trinary_Operatorβ | No (N), Yes (Y) | |||
Ideographic | No (N), Yes (Y) | |||
Ideographicβ | No (N), Yes (Y) | |||
Radical | No (N), Yes (Y) | |||
Radicalβ | No (N), Yes (Y) | |||
Unified_Ideograph | No (N), Yes (Y) | |||
Unified_Ideographβ | No (N), Yes (Y) | |||
Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
String | UCD | CJK_Radicalβ | Show Values | |
Equivalent_Unified_Ideographβ | Show Values | |||
kSimplifiedVariantβ | Show Values | |||
kTraditionalVariantβ | Show Values | |||
Emoji | Binary | UCD | Extended_Pictographic | No (N), Yes (Y) |
Extended_Pictographicβ | No (N), Yes (Y) | |||
UTS | Basic_Emoji | No (N), Yes (Y) | ||
Basic_Emojiβ | No (No), Yes (Yes) | |||
Emoji | No (N), Yes (Y) | |||
Emoji_Component | No (N), Yes (Y) | |||
Emoji_Componentβ | No (N), Yes (Y) | |||
Emoji_Modifier | No (N), Yes (Y) | |||
Emoji_Modifier_Base | No (N), Yes (Y) | |||
Emoji_Modifier_Baseβ | No (N), Yes (Y) | |||
Emoji_Modifierβ | No (N), Yes (Y) | |||
Emoji_Presentation | No (N), Yes (Y) | |||
Emoji_Presentationβ | No (N), Yes (Y) | |||
Emojiβ | No (N), Yes (Y) | |||
RGI_Emoji | No, Yes | |||
RGI_Emoji_Flag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Flag_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Keycap_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Modifier_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Modifier_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Tag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Tag_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Zwj_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Zwj_Sequenceβ | No (No), Yes (Yes) | |||
Enumerated | UCD | Regional_Indicator | No (N), Yes (Y) | |
Regional_Indicatorβ | No (N), Yes (Y) | |||
General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
Alphabeticβ | No (N), Yes (Y) | |||
Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
Default_Ignorable_Code_Pointβ | No (N), Yes (Y) | |||
Deprecated | No (N), Yes (Y) | |||
Deprecatedβ | No (N), Yes (Y) | |||
Logical_Order_Exception | No (N), Yes (Y) | |||
Logical_Order_Exceptionβ | No (N), Yes (Y) | |||
Noncharacter_Code_Point | No (N), Yes (Y) | |||
Noncharacter_Code_Pointβ | No (N), Yes (Y) | |||
Variation_Selector | No (N), Yes (Y) | |||
Variation_Selectorβ | No (N), Yes (Y) | |||
White_Space | No (N), Yes (Y) | |||
White_Spaceβ | No (N), Yes (Y) | |||
Catalog | Age | Show Values | ||
Ageβ | Show Values | |||
Block | Show Values | |||
Blockβ | Show Values | |||
Script | Show Values | |||
Scriptβ | Adlam (Adlm), Ahom (Ahom), Anatolian_Hieroglyphs (Hluw), Arabic (Arab), Armenian (Armn), Avestan (Avst), Balinese (Bali), Bamum (Bamu), Bassa_Vah (Bass), Batak (Batk), Bengali (Beng), Bhaiksuki (Bhks), Bopomofo (Bopo), Brahmi (Brah), Braille (Brai), Buginese (Bugi), Buhid (Buhd), Canadian_Aboriginal (Cans), Carian (Cari), Caucasian_Albanian (Aghb), Chakma (Cakm), Cham (Cham), Cherokee (Cher), Chorasmian (Chrs), Common (Zyyy), Coptic (Copt), Cuneiform (Xsux), Cypriot (Cprt), Cypro_Minoan (Cpmn), Cyrillic (Cyrl), Deseret (Dsrt), Devanagari (Deva), Dives_Akuru (Diak), Dogra (Dogr), Duployan (Dupl), Egyptian_Hieroglyphs (Egyp), Elbasan (Elba), Elymaic (Elym), Ethiopic (Ethi), Georgian (Geor), Glagolitic (Glag), Gothic (Goth), Grantha (Gran), Greek (Grek), Gujarati (Gujr), Gunjala_Gondi (Gong), Gurmukhi (Guru), Han (Hani), Hangul (Hang), Hanifi_Rohingya (Rohg), Hanunoo (Hano), Hatran (Hatr), Hebrew (Hebr), Hiragana (Hira), Imperial_Aramaic (Armi), Inherited (Zinh), Inscriptional_Pahlavi (Phli), Inscriptional_Parthian (Prti), Javanese (Java), Kaithi (Kthi), Kannada (Knda), Katakana (Kana), Kawi (Kawi), Kayah_Li (Kali), Kharoshthi (Khar), Khitan_Small_Script (Kits), Khmer (Khmr), Khojki (Khoj), Khudawadi (Sind), Lao (Laoo), Latin (Latn), Lepcha (Lepc), Limbu (Limb), Linear_A (Lina), Linear_B (Linb), Lisu (Lisu), Lycian (Lyci), Lydian (Lydi), Mahajani (Mahj), Makasar (Maka), Malayalam (Mlym), Mandaic (Mand), Manichaean (Mani), Marchen (Marc), Masaram_Gondi (Gonm), Medefaidrin (Medf), Meetei_Mayek (Mtei), Mende_Kikakui (Mend), Meroitic_Cursive (Merc), Meroitic_Hieroglyphs (Mero), Miao (Plrd), Modi (Modi), Mongolian (Mong), Mro (Mroo), Multani (Mult), Myanmar (Mymr), Nabataean (Nbat), Nag_Mundari (Nagm), Nandinagari (Nand), New_Tai_Lue (Talu), Newa (Newa), Nko (Nkoo), Nushu (Nshu), Nyiakeng_Puachue_Hmong (Hmnp), Ogham (Ogam), Ol_Chiki (Olck), Old_Hungarian (Hung), Old_Italic (Ital), Old_North_Arabian (Narb), Old_Permic (Perm), Old_Persian (Xpeo), Old_Sogdian (Sogo), Old_South_Arabian (Sarb), Old_Turkic (Orkh), Old_Uyghur (Ougr), Oriya (Orya), Osage (Osge), Osmanya (Osma), Pahawh_Hmong (Hmng), Palmyrene (Palm), Pau_Cin_Hau (Pauc), Phags_Pa (Phag), Phoenician (Phnx), Psalter_Pahlavi (Phlp), Rejang (Rjng), Runic (Runr), Samaritan (Samr), Saurashtra (Saur), Sharada (Shrd), Shavian (Shaw), Siddham (Sidd), SignWriting (Sgnw), Sinhala (Sinh), Sogdian (Sogd), Sora_Sompeng (Sora), Soyombo (Soyo), Sundanese (Sund), Syloti_Nagri (Sylo), Syriac (Syrc), Tagalog (Tglg), Tagbanwa (Tagb), Tai_Le (Tale), Tai_Tham (Lana), Tai_Viet (Tavt), Takri (Takr), Tamil (Taml), Tangsa (Tnsa), Tangut (Tang), Telugu (Telu), Thaana (Thaa), Thai (Thai), Tibetan (Tibt), Tifinagh (Tfng), Tirhuta (Tirh), Toto (Toto), Ugaritic (Ugar), Unknown (Zzzz), Vai (Vaii), Vithkuqi (Vith), Wancho (Wcho), Warang_Citi (Wara), Yezidi (Yezi), Yi (Yiii), Zanabazar_Square (Zanb) | |||
Enumerated | General_Category | Show Values | ||
General_Categoryβ | Show Values | |||
Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Hangul_Syllable_Typeβ | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Name_Aliasβ | Show Values | |||
Named_Sequences_Provβ | ||||
Named_Sequencesβ | Show Values | |||
String | Nameslist | subhead | Show Values | |
UCD | Name | Show Values | ||
Nameβ | Show Values | |||
Script_Extensions | Show Values | |||
Script_Extensionsβ | Show Values | |||
Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
ID_Continueβ | No (N), Yes (Y) | |||
ID_Start | No (N), Yes (Y) | |||
ID_Startβ | No (N), Yes (Y) | |||
Pattern_Syntax | No (N), Yes (Y) | |||
Pattern_Syntaxβ | No (N), Yes (Y) | |||
Pattern_White_Space | No (N), Yes (Y) | |||
Pattern_White_Spaceβ | No (N), Yes (Y) | |||
XID_Continue | No (N), Yes (Y) | |||
XID_Continueβ | No (N), Yes (Y) | |||
XID_Start | No (N), Yes (Y) | |||
XID_Startβ | No (N), Yes (Y) | |||
IDNA | Enumerated | UTS | Idn_2008β | na (na), NV8 (nv8), XV8 (xv8) |
Idn_Statusβ | deviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v), ignored (i), mapped (m), valid (v) | |||
idna2003 | deviation, disallowed, ignored, mapped, valid | |||
idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED | |||
idna2008c | deviation, disallowed, ignored, mapped, valid | |||
uts46 | deviation, disallowed, ignored, mapped, valid | |||
String | Idn_Mappingβ | Show Values | ||
toIdna2003 | Show Values | |||
toUts46n | Show Values | |||
toUts46t | Show Values | |||
Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
Dashβ | No (N), Yes (Y) | |||
Diacritic | No (N), Yes (Y) | |||
Diacriticβ | No (N), Yes (Y) | |||
Extender | No (N), Yes (Y) | |||
Extenderβ | No (N), Yes (Y) | |||
Grapheme_Base | No (N), Yes (Y) | |||
Grapheme_Extend | No (N), Yes (Y) | |||
Grapheme_Link | No (N), Yes (Y) | |||
Hyphen | No (N), Yes (Y) | |||
Math | No (N), Yes (Y) | |||
Mathβ | No (N), Yes (Y) | |||
Quotation_Mark | No (N), Yes (Y) | |||
Quotation_Markβ | No (N), Yes (Y) | |||
STerm | No (N), Yes (Y) | |||
STermβ | No (N), Yes (Y) | |||
Terminal_Punctuation | No (N), Yes (Y) | |||
Terminal_Punctuationβ | No (N), Yes (Y) | |||
Enumerated | Indic_Positional_Category | Show Values | ||
Indic_Positional_Categoryβ | Show Values | |||
Indic_Syllabic_Category | Show Values | |||
Indic_Syllabic_Categoryβ | Show Values | |||
Miscellaneous | ISO_Comment | Show Values | ||
Unicode_1_Name | Show Values | |||
Normalization | Binary | ICU | NFC_Inert | No (N), Yes (Y) |
NFD_Inert | No (N), Yes (Y) | |||
NFKC_Inert | No (N), Yes (Y) | |||
NFKD_Inert | No (N), Yes (Y) | |||
isNFM | No, Yes | |||
UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) | ||
Changes_When_NFKC_Casefoldedβ | No (N), Yes (Y) | |||
Full_Composition_Exclusion | No (N), Yes (Y) | |||
Unicode | isNFC | No, Yes | ||
isNFD | No, Yes | |||
isNFKC | No, Yes | |||
isNFKD | No, Yes | |||
Enumerated | ICU | Lead_Canonical_Combining_Class | Show Values | |
Trail_Canonical_Combining_Class | Show Values | |||
UCD | Canonical_Combining_Class | Show Values | ||
Canonical_Combining_Classβ | Show Values | |||
Decomposition_Type | Show Values | |||
Decomposition_Typeβ | Show Values | |||
NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFC_Quick_Checkβ | Maybe (M), No (N), Yes (Y) | |||
NFD_Quick_Check | No (N), Yes (Y) | |||
NFD_Quick_Checkβ | No (N), Yes (Y) | |||
NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFKC_Quick_Checkβ | Maybe (M), No (N), Yes (Y) | |||
NFKD_Quick_Check | No (N), Yes (Y) | |||
NFKD_Quick_Checkβ | No (N), Yes (Y) | |||
String | ICU | toNFM | Show Values | |
UCD | NFKC_Casefold | Show Values | ||
NFKC_Casefoldβ | Show Values | |||
Unicode | toNFC | Show Values | ||
toNFD | Show Values | |||
toNFKC | Show Values | |||
toNFKD | Show Values | |||
Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
ASCII_Hex_Digitβ | No (N), Yes (Y) | |||
Hex_Digit | No (N), Yes (Y) | |||
Hex_Digitβ | No (N), Yes (Y) | |||
Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
Numeric_Typeβ | Decimal (De), Digit (Di), None (None), Numeric (Nu) | |||
kAccountingNumericβ | Show Values | |||
kOtherNumericβ | Show Values | |||
kPrimaryNumericβ | Show Values | |||
Numeric | Numeric_Value | Show Values | ||
Numeric_Valueβ | Show Values | |||
Regex | Binary | UTS | ANY | No, Yes |
ASCII | No, Yes | |||
alnum | No (N), Yes (Y) | |||
blank | No (N), Yes (Y) | |||
bmp | No, Yes | |||
graph | No (N), Yes (Y) | |||
No (N), Yes (Y) | ||||
xdigit | No (N), Yes (Y) | |||
Security | Enumerated | UTS | Confusable_MAβ | Show Values |
Identifier_Statusβ | Allowed (a), Restricted (r) | |||
Identifier_Typeβ | Show Values | |||
Shaping and Rendering | Binary | ICU | Segment_Starter | No (N), Yes (Y) |
UCD | Join_Control | No (N), Yes (Y) | ||
Join_Controlβ | No (N), Yes (Y) | |||
Enumerated | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | ||
East_Asian_Widthβ | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | |||
Grapheme_Cluster_Break | Show Values | |||
Grapheme_Cluster_Breakβ | Show Values | |||
Joining_Group | Show Values | |||
Joining_Groupβ | Show Values | |||
Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Joining_Typeβ | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Line_Break | Show Values | |||
Line_Breakβ | Show Values | |||
Prepended_Concatenation_Mark | No (N), Yes (Y) | |||
Prepended_Concatenation_Markβ | No (N), Yes (Y) | |||
Sentence_Break | Show Values | |||
Sentence_Breakβ | Show Values | |||
Standardized_Variantβ | Show Values | |||
Vertical_Orientation | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Vertical_Orientationβ | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Word_Break | Show Values | |||
Word_Breakβ | Show Values | |||
UCA | Binary | UTS | uca | Show Values |
uca2 | Show Values | |||
uca2.5 | Show Values | |||
uca3 | Show Values | |||
Z-Other | Other | Other | Emoji_Keycap_Sequence | Other |
ID_Compat_Math_Continueβ | Other | |||
ID_Compat_Math_Startβ | Other | |||
IDS_Unary_Operatorβ | Other | |||
NFKC_Simple_Casefoldβ | Other |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.9; ICU version: 72.0; Unicode/Emoji version: 15.0; Unicodeβ version: 15.0;