Properties use ICU for Unicode V15.0; the beta properties support Unicode V15.1β. For more information, see Unicode Utilities Beta.
help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid
Category | Datatype | Source | Property | Values |
---|---|---|---|---|
Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
Bidi_Controlβ | No (N), Yes (Y) | |||
Bidi_Mirrored | No (N), Yes (Y) | |||
Bidi_Mirroredβ | No (N), Yes (Y) | |||
Enumerated | Bidi_Class | Show Values | ||
Bidi_Classβ | Show Values | |||
Bidi_Paired_Bracket_Type | Close (C), None (N), Open (O) | |||
Bidi_Paired_Bracket_Typeβ | Close (c), None (n), Open (o) | |||
String | Bidi_Mirroring_Glyph | Show Values | ||
Bidi_Mirroring_Glyphβ | Show Values | |||
Bidi_Paired_Bracket | Show Values | |||
Bidi_Paired_Bracketβ | Show Values | |||
Case | Binary | UCD | Case_Ignorable | No (N), Yes (Y) |
Case_Ignorableβ | No (N), Yes (Y) | |||
Cased | No (N), Yes (Y) | |||
Casedβ | No (N), Yes (Y) | |||
Changes_When_Casefolded | No (N), Yes (Y) | |||
Changes_When_Casefoldedβ | No (N), Yes (Y) | |||
Changes_When_Casemapped | No (N), Yes (Y) | |||
Changes_When_Casemappedβ | No (N), Yes (Y) | |||
Changes_When_Lowercased | No (N), Yes (Y) | |||
Changes_When_Lowercasedβ | No (N), Yes (Y) | |||
Changes_When_Titlecased | No (N), Yes (Y) | |||
Changes_When_Titlecasedβ | No (N), Yes (Y) | |||
Changes_When_Uppercased | No (N), Yes (Y) | |||
Changes_When_Uppercasedβ | No (N), Yes (Y) | |||
Lowercase | No (N), Yes (Y) | |||
Lowercaseβ | No (N), Yes (Y) | |||
Soft_Dotted | No (N), Yes (Y) | |||
Soft_Dottedβ | No (N), Yes (Y) | |||
Uppercase | No (N), Yes (Y) | |||
Uppercaseβ | No (N), Yes (Y) | |||
Unicode | isCased | No (N), Yes (Y) | ||
isCasefolded | No (N), Yes (Y) | |||
isLowercase | No (N), Yes (Y) | |||
isTitlecase | No (N), Yes (Y) | |||
isUppercase | No (N), Yes (Y) | |||
X-ICU | Case_Sensitive | No (N), Yes (Y) | ||
String | UCD | Case_Folding | Show Values | |
Case_Foldingβ | Show Values | |||
Lowercase_Mapping | Show Values | |||
Lowercase_Mappingβ | Show Values | |||
Simple_Case_Folding | Show Values | |||
Simple_Case_Foldingβ | Show Values | |||
Simple_Lowercase_Mapping | Show Values | |||
Simple_Lowercase_Mappingβ | Show Values | |||
Simple_Titlecase_Mapping | Show Values | |||
Simple_Titlecase_Mappingβ | Show Values | |||
Simple_Uppercase_Mapping | Show Values | |||
Simple_Uppercase_Mappingβ | Show Values | |||
Titlecase_Mapping | Show Values | |||
Titlecase_Mappingβ | Show Values | |||
Uppercase_Mapping | Show Values | |||
Uppercase_Mappingβ | Show Values | |||
Unicode | toCasefold | Show Values | ||
toLowercase | Show Values | |||
toTitlecase | Show Values | |||
toUppercase | Show Values | |||
CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
IDS_Binary_Operatorβ | No (N), Yes (Y) | |||
IDS_Trinary_Operator | No (N), Yes (Y) | |||
IDS_Trinary_Operatorβ | No (N), Yes (Y) | |||
Ideographic | No (N), Yes (Y) | |||
Ideographicβ | No (N), Yes (Y) | |||
Radical | No (N), Yes (Y) | |||
Radicalβ | No (N), Yes (Y) | |||
Unified_Ideograph | No (N), Yes (Y) | |||
Unified_Ideographβ | No (N), Yes (Y) | |||
Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
String | UCD | CJK_Radicalβ | Show Values | |
Equivalent_Unified_Ideographβ | Show Values | |||
kSimplifiedVariantβ | Show Values | |||
kTraditionalVariantβ | 万萬 (万萬), 丑醜 (丑醜), 丟丢 (丟丢), 两兩 (两兩), 並併并 (並併并), 𠁔 (𠁔), 个個 (个個), 丰豐 (丰豐), 义義 (义義), 么幺麼麽 (么幺麼麽), 乐樂 (乐樂), 乔喬 (乔喬), 乱亂 (乱亂), 乾 (乾), 乾干幹 (乾干幹), 了瞭 (了瞭), 争爭 (争爭), 于於 (于於), 亏虧 (亏虧), 云雲 (云雲), 亞 (亞), 仅僅 (仅僅), 仆僕 (仆僕), 从從 (从從), 仪儀 (仪儀), 价價 (价價), 众眾衆 (众眾衆), 优優 (优優), 伙夥 (伙夥), 会會 (会會), 伤傷 (伤傷), 佇 (佇), 体體 (体體), 余餘 (余餘), 佣傭 (佣傭), 來 (來), 侖 (侖), 侠俠 (侠俠), 侩儈 (侩儈), 侶 (侶), 俁 (俁), 係系繫 (係系繫), 俓 (俓), 俔 (俔), 俥 (俥), 俦儔 (俦儔), 倀 (倀), 倆 (倆), 倈 (倈), 倉 (倉), 們 (們), 借藉 (借藉), 倫 (倫), 倲 (倲), 偉 (偉), 偑 (偑), 偩 (偩), 偬傯 (偬傯), 側 (側), 偵 (偵), 偻僂 (偻僂), 偽僞 (偽僞), 㑮 (㑮), 㑯 (㑯), 𰂠 (𰂠), 傌 (傌), 傑杰 (傑杰), 傖 (傖), 傘 (傘), 備 (備), 傢家 (傢家), 㑳 (㑳), 㑶 (㑶), 𠌥 (𠌥), 𪝖 (𪝖), 傪 (傪), 傱 (傱), 傳 (傳), 傴 (傴), 債 (債), 傾 (傾), 僀 (僀), 僆 (僆), 僉 (僉), 働 (働), 僑 (僑), 僓 (僓), 僗 (僗), 僤 (僤), 僥 (僥), 僨 (僨), 僩 (僩), 僴 (僴), 𠎅 (𠎅), 𠎒 (𠎒), 僾 (僾), 儁 (儁), 儂 (儂), 億 (億), 儅 (儅), 儉 (儉), 㒓 (㒓), 𠏢 (𠏢), 𰂴 (𰂴), 儐 (儐), 儕 (儕), 儖 (儖), 儘尽盡 (儘尽盡), 㒜 (㒜), 𠏮 (𠏮), 𠐇 (𠐇), 償 (償), 儢 (儢), 儣 (儣), 儥 (儥), 儩 (儩), 𠐊 (𠐊), 𠐍 (𠐍), 𪝵 (𪝵), 𫣴 (𫣴), 儰 (儰), 儱 (儱), 儲 (儲), 𠐮 (𠐮), 㒣 (㒣), 𠐽 (𠐽), 𠑇 (𠑇), 儷 (儷), 儸 (儸), 儹 (儹), 儺 (儺), 𠑙 (𠑙), 儻 (儻), 儼 (儼), 𠑲 (𠑲), 儿兒 (儿兒), 克剋 (克剋), 兌兑 (兌兑), 兗 (兗), 党黨 (党黨), 內内 (內内), 冊册 (冊册), 㒿 (㒿), 㓄 (㓄), 冪幂 (冪幂), 𠖫 (𠖫), 𰃴 (𰃴), 冬鼕 (冬鼕), 冲沖衝 (冲沖衝), 况況 (况況), 准準 (准準), 凈 (凈), 凉涼 (凉涼), 凍 (凍), 减減 (减減), 凑湊 (凑湊), 𭂖 (𭂖), 凔 (凔), 㓖 (㓖), 𠗿 (𠗿), 凙 (凙), 凜 (凜), 凟 (凟), 𫥝 (𫥝), 𠘥 (𠘥), 几幾 (几幾), 凤鳳 (凤鳳), 凭憑 (凭憑), 凱 (凱), 出齣 (出齣), 划劃 (划劃), 刘劉 (刘劉), 删刪 (删刪), 別别彆 (別别彆), 刮颳 (刮颳), 制製 (制製), 刹剎 (刹剎), 剄 (剄), 則 (則), 㓨刾 (㓨刾), 剗 (剗), 剛 (剛), 剝 (剝), 剧劇 (剧劇), 𠜲 (𠜲), 剮 (剮), 剴 (剴), 創 (創), 𠝿 (𠝿), 𠞆 (𠞆), 剸 (剸), 剾 (剾), 𠞭 (𠞭), 𫦔 (𫦔), 㔃 (㔃), 㔅 (㔅), 𫦙 (𫦙), 劊 (劊), 劌 (劌), 劍 (劍), 劏 (劏), 𠟪 (𠟪), 劑 (劑), 㔋 (㔋), 𪟖 (𪟖), 𠠎 (𠠎), 𠠏 (𠠏), 𠠝 (𠠝), 劗 (劗), 𠠫 (𠠫), 劚 (劚), 劝勸 (劝勸), 办辦 (办辦), 动動 (动動), 励勵 (励勵), 劳勞 (劳勞), 势勢 (势勢), 勁 (勁), 勑 (勑), 㔝 (㔝), 務 (務), 勛 (勛), 勝胜 (勝胜), 勣 (勣), 勩 (勩), 㔢 (㔢), 𫦸 (𫦸), 勱 (勱), 勴 (勴), 勻匀 (勻匀), 匭 (匭), 匯彙 (匯彙), 匰 (匰), 匱 (匱), 匵 (匵), 𫧝 (𫧝), 区區 (区區), 医醫 (医醫), 千韆 (千韆), 協 (協), 单單 (单單), 卜蔔 (卜蔔), 卨 (卨), 却卻 (却卻), 卷捲 (卷捲), 厂廠 (厂廠), 历曆歷 (历曆歷), 厉厲 (厉厲), 压壓 (压壓), 厘釐 (厘釐), 厙 (厙), 𠩘 (𠩘), 𠩬 (𠩬), 厠廁 (厠廁), 厩廄 (厩廄) too many values to show | |||
Emoji | Binary | UCD | Extended_Pictographic | No (N), Yes (Y) |
Extended_Pictographicβ | No (N), Yes (Y) | |||
UTS | Basic_Emoji | No (N), Yes (Y) | ||
Basic_Emojiβ | No (No), Yes (Yes) | |||
Emoji | No (N), Yes (Y) | |||
Emoji_Component | No (N), Yes (Y) | |||
Emoji_Componentβ | No (N), Yes (Y) | |||
Emoji_Modifier | No (N), Yes (Y) | |||
Emoji_Modifier_Base | No (N), Yes (Y) | |||
Emoji_Modifier_Baseβ | No (N), Yes (Y) | |||
Emoji_Modifierβ | No (N), Yes (Y) | |||
Emoji_Presentation | No (N), Yes (Y) | |||
Emoji_Presentationβ | No (N), Yes (Y) | |||
Emojiβ | No (N), Yes (Y) | |||
RGI_Emoji | No, Yes | |||
RGI_Emoji_Flag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Flag_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Keycap_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Modifier_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Modifier_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Tag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Tag_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Zwj_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Zwj_Sequenceβ | No (No), Yes (Yes) | |||
Enumerated | UCD | Regional_Indicator | No (N), Yes (Y) | |
Regional_Indicatorβ | No (N), Yes (Y) | |||
General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
Alphabeticβ | No (N), Yes (Y) | |||
Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
Default_Ignorable_Code_Pointβ | No (N), Yes (Y) | |||
Deprecated | No (N), Yes (Y) | |||
Deprecatedβ | No (N), Yes (Y) | |||
Logical_Order_Exception | No (N), Yes (Y) | |||
Logical_Order_Exceptionβ | No (N), Yes (Y) | |||
Noncharacter_Code_Point | No (N), Yes (Y) | |||
Noncharacter_Code_Pointβ | No (N), Yes (Y) | |||
Variation_Selector | No (N), Yes (Y) | |||
Variation_Selectorβ | No (N), Yes (Y) | |||
White_Space | No (N), Yes (Y) | |||
White_Spaceβ | No (N), Yes (Y) | |||
Catalog | Age | Show Values | ||
Ageβ | Show Values | |||
Block | Show Values | |||
Blockβ | Show Values | |||
Script | Show Values | |||
Scriptβ | Show Values | |||
Enumerated | General_Category | Show Values | ||
General_Categoryβ | Show Values | |||
Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Hangul_Syllable_Typeβ | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Name_Aliasβ | Show Values | |||
Named_Sequences_Provβ | ||||
Named_Sequencesβ | Show Values | |||
String | Nameslist | subhead | Show Values | |
UCD | Name | Show Values | ||
Nameβ | Show Values | |||
Script_Extensions | Show Values | |||
Script_Extensionsβ | Show Values | |||
Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
ID_Continueβ | No (N), Yes (Y) | |||
ID_Start | No (N), Yes (Y) | |||
ID_Startβ | No (N), Yes (Y) | |||
Pattern_Syntax | No (N), Yes (Y) | |||
Pattern_Syntaxβ | No (N), Yes (Y) | |||
Pattern_White_Space | No (N), Yes (Y) | |||
Pattern_White_Spaceβ | No (N), Yes (Y) | |||
XID_Continue | No (N), Yes (Y) | |||
XID_Continueβ | No (N), Yes (Y) | |||
XID_Start | No (N), Yes (Y) | |||
XID_Startβ | No (N), Yes (Y) | |||
IDNA | Enumerated | UTS | Idn_2008β | na (na), NV8 (nv8), XV8 (xv8) |
Idn_Statusβ | deviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v), ignored (i), mapped (m), valid (v) | |||
idna2003 | deviation, disallowed, ignored, mapped, valid | |||
idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED | |||
idna2008c | deviation, disallowed, ignored, mapped, valid | |||
uts46 | deviation, disallowed, ignored, mapped, valid | |||
String | Idn_Mappingβ | Show Values | ||
toIdna2003 | Show Values | |||
toUts46n | Show Values | |||
toUts46t | Show Values | |||
Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
Dashβ | No (N), Yes (Y) | |||
Diacritic | No (N), Yes (Y) | |||
Diacriticβ | No (N), Yes (Y) | |||
Extender | No (N), Yes (Y) | |||
Extenderβ | No (N), Yes (Y) | |||
Grapheme_Base | No (N), Yes (Y) | |||
Grapheme_Extend | No (N), Yes (Y) | |||
Grapheme_Link | No (N), Yes (Y) | |||
Hyphen | No (N), Yes (Y) | |||
Math | No (N), Yes (Y) | |||
Mathβ | No (N), Yes (Y) | |||
Quotation_Mark | No (N), Yes (Y) | |||
Quotation_Markβ | No (N), Yes (Y) | |||
STerm | No (N), Yes (Y) | |||
STermβ | No (N), Yes (Y) | |||
Terminal_Punctuation | No (N), Yes (Y) | |||
Terminal_Punctuationβ | No (N), Yes (Y) | |||
Enumerated | Indic_Positional_Category | Show Values | ||
Indic_Positional_Categoryβ | Show Values | |||
Indic_Syllabic_Category | Show Values | |||
Indic_Syllabic_Categoryβ | Show Values | |||
Miscellaneous | ISO_Comment | Show Values | ||
Unicode_1_Name | Show Values | |||
Normalization | Binary | ICU | NFC_Inert | No (N), Yes (Y) |
NFD_Inert | No (N), Yes (Y) | |||
NFKC_Inert | No (N), Yes (Y) | |||
NFKD_Inert | No (N), Yes (Y) | |||
isNFM | No, Yes | |||
UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) | ||
Changes_When_NFKC_Casefoldedβ | No (N), Yes (Y) | |||
Full_Composition_Exclusion | No (N), Yes (Y) | |||
Unicode | isNFC | No, Yes | ||
isNFD | No, Yes | |||
isNFKC | No, Yes | |||
isNFKD | No, Yes | |||
Enumerated | ICU | Lead_Canonical_Combining_Class | Show Values | |
Trail_Canonical_Combining_Class | Show Values | |||
UCD | Canonical_Combining_Class | Show Values | ||
Canonical_Combining_Classβ | Show Values | |||
Decomposition_Type | Show Values | |||
Decomposition_Typeβ | Show Values | |||
NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFC_Quick_Checkβ | Maybe (M), No (N), Yes (Y) | |||
NFD_Quick_Check | No (N), Yes (Y) | |||
NFD_Quick_Checkβ | No (N), Yes (Y) | |||
NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFKC_Quick_Checkβ | Maybe (M), No (N), Yes (Y) | |||
NFKD_Quick_Check | No (N), Yes (Y) | |||
NFKD_Quick_Checkβ | No (N), Yes (Y) | |||
String | ICU | toNFM | Show Values | |
UCD | NFKC_Casefold | Show Values | ||
NFKC_Casefoldβ | Show Values | |||
Unicode | toNFC | Show Values | ||
toNFD | Show Values | |||
toNFKC | Show Values | |||
toNFKD | Show Values | |||
Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
ASCII_Hex_Digitβ | No (N), Yes (Y) | |||
Hex_Digit | No (N), Yes (Y) | |||
Hex_Digitβ | No (N), Yes (Y) | |||
Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
Numeric_Typeβ | Decimal (De), Digit (Di), None (None), Numeric (Nu) | |||
kAccountingNumericβ | Show Values | |||
kOtherNumericβ | Show Values | |||
kPrimaryNumericβ | Show Values | |||
Numeric | Numeric_Value | Show Values | ||
Numeric_Valueβ | Show Values | |||
Regex | Binary | UTS | ANY | No, Yes |
ASCII | No, Yes | |||
alnum | No (N), Yes (Y) | |||
blank | No (N), Yes (Y) | |||
bmp | No, Yes | |||
graph | No (N), Yes (Y) | |||
No (N), Yes (Y) | ||||
xdigit | No (N), Yes (Y) | |||
Security | Enumerated | UTS | Confusable_MAβ | Show Values |
Identifier_Statusβ | Allowed (a), Restricted (r) | |||
Identifier_Typeβ | Show Values | |||
Shaping and Rendering | Binary | ICU | Segment_Starter | No (N), Yes (Y) |
UCD | Join_Control | No (N), Yes (Y) | ||
Join_Controlβ | No (N), Yes (Y) | |||
Enumerated | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | ||
East_Asian_Widthβ | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | |||
Grapheme_Cluster_Break | Show Values | |||
Grapheme_Cluster_Breakβ | Show Values | |||
Joining_Group | Show Values | |||
Joining_Groupβ | Show Values | |||
Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Joining_Typeβ | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Line_Break | Show Values | |||
Line_Breakβ | Show Values | |||
Prepended_Concatenation_Mark | No (N), Yes (Y) | |||
Prepended_Concatenation_Markβ | No (N), Yes (Y) | |||
Sentence_Break | Show Values | |||
Sentence_Breakβ | Show Values | |||
Standardized_Variantβ | Show Values | |||
Vertical_Orientation | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Vertical_Orientationβ | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Word_Break | Show Values | |||
Word_Breakβ | Show Values | |||
UCA | Binary | UTS | uca | Show Values |
uca2 | Show Values | |||
uca2.5 | Show Values | |||
uca3 | Show Values | |||
Z-Other | Other | Other | Emoji_Keycap_Sequence | Other |
ID_Compat_Math_Continueβ | Other | |||
ID_Compat_Math_Startβ | Other | |||
IDS_Unary_Operatorβ | Other | |||
NFKC_Simple_Casefoldβ | Other |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.9; ICU version: 72.0; Unicode/Emoji version: 15.0; Unicodeβ version: 15.0;