help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid
Category | Datatype | Source | Property | Values |
---|---|---|---|---|
Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
Bidi_Mirrored | No (N), Yes (Y) | |||
Enumerated | Bidi_Class | Show Values | ||
Bidi_Paired_Bracket_Type | Close (C), None (N), Open (O) | |||
String | Bidi_Mirroring_Glyph | Show Values | ||
Bidi_Paired_Bracket | Show Values | |||
Case | Binary | ICU | Case_Sensitive | No (N), Yes (Y) |
UCD | Case_Ignorable | No (N), Yes (Y) | ||
Cased | No (N), Yes (Y) | |||
Changes_When_Casefolded | No (N), Yes (Y) | |||
Changes_When_Casemapped | No (N), Yes (Y) | |||
Changes_When_Lowercased | No (N), Yes (Y) | |||
Changes_When_Titlecased | No (N), Yes (Y) | |||
Changes_When_Uppercased | No (N), Yes (Y) | |||
Lowercase | No (N), Yes (Y) | |||
Soft_Dotted | No (N), Yes (Y) | |||
Uppercase | No (N), Yes (Y) | |||
Unicode | isCased | No (N), Yes (Y) | ||
isCasefolded | No (N), Yes (Y) | |||
isLowercase | No (N), Yes (Y) | |||
isTitlecase | No (N), Yes (Y) | |||
isUppercase | No (N), Yes (Y) | |||
String | UCD | Case_Folding | Show Values | |
Lowercase_Mapping | Show Values | |||
Simple_Case_Folding | Show Values | |||
Simple_Lowercase_Mapping | Show Values | |||
Simple_Titlecase_Mapping | Show Values | |||
Simple_Uppercase_Mapping | Show Values | |||
Titlecase_Mapping | Show Values | |||
Uppercase_Mapping | Show Values | |||
Unicode | toCasefold | Show Values | ||
toLowercase | Show Values | |||
toTitlecase | Show Values | |||
toUppercase | Show Values | |||
CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
IDS_Trinary_Operator | No (N), Yes (Y) | |||
Ideographic | No (N), Yes (Y) | |||
Radical | No (N), Yes (Y) | |||
Unified_Ideograph | No (N), Yes (Y) | |||
Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
String | UCD | CJK_Radical | Show Values | |
kSimplifiedVariant | Show Values | |||
kTraditionalVariant | 丟 (丟), 並併 (並併), 乾幹 (乾幹), 亂 (亂), 亞 (亞), 佇 (佇), 余餘 (余餘), 來 (來), 侖 (侖), 侶 (侶), 俁 (俁), 係繫 (係繫), 俔 (俔), 俠 (俠), 俥 (俥), 倀 (倀), 倆 (倆), 倈 (倈), 倉 (倉), 個 (個), 們 (們), 倫 (倫), 倲 (倲), 偉 (偉), 偑 (偑), 側 (側), 偵 (偵), 偽僞 (偽僞), 㑯 (㑯), 傑 (傑), 傖 (傖), 傘 (傘), 備 (備), 㑳 (㑳), 𠌥 (𠌥), 傭 (傭), 傯 (傯), 傳 (傳), 傴 (傴), 債 (債), 傷 (傷), 傾 (傾), 僂 (僂), 僅 (僅), 僉 (僉), 僑 (僑), 僕 (僕), 僥 (僥), 僨 (僨), 價 (價), 儀 (儀), 儂 (儂), 億 (億), 儈 (儈), 儉 (儉), 㒓 (㒓), 𠏢 (𠏢), 儐 (儐), 儔 (儔), 儕 (儕), 儘盡 (儘盡), 償 (償), 儣 (儣), 優 (優), 儲 (儲), 儷 (儷), 儸 (儸), 儺 (儺), 儻 (儻), 儼 (儼), 兌 (兌), 兒 (兒), 兗 (兗), 內 (內), 兩 (兩), 冊 (冊), 冪 (冪), 凈 (凈), 凍 (凍), 凙 (凙), 凜 (凜), 凱 (凱), 別 (別), 刪 (刪), 剄 (剄), 則 (則), 剋 (剋), 剎 (剎), 㓨 (㓨), 剗 (剗), 剛 (剛), 剝 (剝), 剮 (剮), 剴 (剴), 創 (創), 𠞆 (𠞆), 剾 (剾), 劃 (劃), 劇 (劇), 劉 (劉), 劊 (劊), 劌 (劌), 劍 (劍), 劏 (劏), 劑 (劑), 𠠎 (𠠎), 劚 (劚), 勁 (勁), 動 (動), 務 (務), 勛 (勛), 勝 (勝), 勞 (勞), 勢 (勢), 勩 (勩), 勱 (勱), 勵 (勵), 勸 (勸), 勻 (勻), 匭 (匭), 匯彙 (匯彙), 匱 (匱), 區 (區), 協 (協), 卻 (卻), 厙 (厙), 厭 (厭), 厲 (厲), 厴 (厴), 參 (參), 叄 (叄), 叢 (叢), 台檯臺颱 (台檯臺颱), 同衕 (同衕), 后後 (后後), 吒 (吒), 吳 (吳), 吶 (吶), 呂 (呂), 咼 (咼), 員 (員), 哯 (哯), 唄 (唄), 唚 (唚), 問 (問), 啞 (啞), 啢 (啢), 喎 (喎), 喚 (喚), 喪 (喪), 喬 (喬), 單 (單), 喲 (喲), 噅 (噅), 嗆 (嗆), 嗇 (嗇), 嗊 (嗊), 嗎 (嗎), 嗚 (嗚), 嗩 (嗩), 嗶 (嗶), 嗹 (嗹), 嘆 (嘆), 嘍 (嘍), 嘓 (嘓), 嘔 (嘔), 嘖 (嘖), 嘗 (嘗), 嘜 (嘜), 噓 (噓), 嘩 (嘩), 嘮 (嘮), 嘯 (嘯), 嘰 (嘰), 嘵 (嘵), 嘸 (嘸), 嘽 (嘽), 噚 (噚), 噝 (噝), 噴 (噴), 㗲 (㗲), 噠 (噠), 噥 (噥), 噦 (噦), 噯 (噯), 噲 (噲), 噸 (噸), 噹當 (噹當), 嚀 (嚀), 嚇 (嚇), 嚌 (嚌), 嚕 (嚕), 嚙 (嚙), 嚦 (嚦), 嚨 (嚨), 嚲 (嚲), 嚳 (嚳), 嚴 (嚴), 嚶 (嚶), 𡄔 (𡄔), 𡄣 (𡄣), 囀 (囀), 囁 (囁), 囂 (囂), 𡅏 (𡅏), 囅 (囅), 囈 (囈), 囉 (囉), 㘚 (㘚), 囑 (囑), 囪 (囪), 圇 (圇), 國 (國), 圍 (圍), 園 (園), 圓 (圓), 圖 (圖), 團 (團), 圞 (圞), 垵 (垵), 埡 (埡), 埰採 (埰採), 執 (執), 堅 (堅), 堊 (堊), 堖 (堖), 堝 (堝), 堯 (堯), 報 (報), 場 (場), 塊 (塊), 塋 (塋), 塏 (塏), 塒 (塒), 塗 (塗), 塢 (塢), 塤 (塤), 塵 (塵), 塹 (塹), 墊 (墊), 墜 (墜), 墮 (墮), 墳 (墳), 墾 (墾), 壇罈 (壇罈), 壈 (壈), 壋 (壋), 𡑭 (𡑭), 壓 (壓), 壘 (壘), 壙 (壙), 壚 (壚), 壞 (壞), 壟 (壟), 壠 (壠) too many values to show | |||
Emoji | Binary | UTS | Emoji | No (N), Yes (Y) |
Emoji_Component | No (N), Yes (Y) | |||
Emoji_Flag_Sequence | No (No), Yes (Yes) | |||
Emoji_Keycap_Sequence | No (No), Yes (Yes) | |||
Emoji_Modifier | No (N), Yes (Y) | |||
Emoji_Modifier_Base | No (N), Yes (Y) | |||
Emoji_Modifier_Sequence | No (No), Yes (Yes) | |||
Emoji_Presentation | No (N), Yes (Y) | |||
Emoji_Tag_Sequence | No (No), Yes (Yes) | |||
Emoji_Zwj_Sequence | No (No), Yes (Yes) | |||
Enumerated | UCD | Regional_Indicator | No (N), Yes (Y) | |
General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
Deprecated | No (N), Yes (Y) | |||
Logical_Order_Exception | No (N), Yes (Y) | |||
Noncharacter_Code_Point | No (N), Yes (Y) | |||
Variation_Selector | No (N), Yes (Y) | |||
White_Space | No (N), Yes (Y) | |||
Catalog | Age | Show Values | ||
Block | Show Values | |||
Script | Show Values | |||
Enumerated | General_Category | Show Values | ||
Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Name_Alias | Show Values | |||
Named_Sequences | Show Values | |||
Named_Sequences_Prov | ||||
String | Nameslist | subhead | Show Values | |
UCD | Name | Show Values | ||
Script_Extensions | Show Values | |||
Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
ID_Start | No (N), Yes (Y) | |||
Pattern_Syntax | No (N), Yes (Y) | |||
Pattern_White_Space | No (N), Yes (Y) | |||
XID_Continue | No (N), Yes (Y) | |||
XID_Start | No (N), Yes (Y) | |||
IDNA | Enumerated | UTS | Idn_2008 | na (na), NV8 (nv8), XV8 (xv8) |
Idn_Status | deviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v), ignored (i), mapped (m), valid (v) | |||
idna2003 | deviation, disallowed, ignored, mapped, valid | |||
idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED | |||
idna2008c | deviation, disallowed, ignored, mapped, valid | |||
uts46 | deviation, disallowed, ignored, mapped, valid | |||
String | Idn_Mapping | Show Values | ||
toIdna2003 | Show Values | |||
toUts46n | Show Values | |||
toUts46t | Show Values | |||
Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
Diacritic | No (N), Yes (Y) | |||
Extender | No (N), Yes (Y) | |||
Grapheme_Base | No (N), Yes (Y) | |||
Grapheme_Extend | No (N), Yes (Y) | |||
Grapheme_Link | No (N), Yes (Y) | |||
Hyphen | No (N), Yes (Y) | |||
Math | No (N), Yes (Y) | |||
Quotation_Mark | No (N), Yes (Y) | |||
Sentence_Terminal | No (N), Yes (Y) | |||
Terminal_Punctuation | No (N), Yes (Y) | |||
Enumerated | Indic_Positional_Category | Show Values | ||
Indic_Syllabic_Category | Show Values | |||
Miscellaneous | ISO_Comment | Show Values | ||
Unicode_1_Name | Show Values | |||
Normalization | Binary | ICU | NFC_Inert | No (N), Yes (Y) |
NFD_Inert | No (N), Yes (Y) | |||
NFKC_Inert | No (N), Yes (Y) | |||
NFKD_Inert | No (N), Yes (Y) | |||
isNFM | No, Yes | |||
UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) | ||
Full_Composition_Exclusion | No (N), Yes (Y) | |||
Unicode | isNFC | No, Yes | ||
isNFD | No, Yes | |||
isNFKC | No, Yes | |||
isNFKD | No, Yes | |||
Enumerated | ICU | Lead_Canonical_Combining_Class | Show Values | |
Trail_Canonical_Combining_Class | Show Values | |||
UCD | Canonical_Combining_Class | Show Values | ||
Decomposition_Type | Show Values | |||
NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFD_Quick_Check | No (N), Yes (Y) | |||
NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFKD_Quick_Check | No (N), Yes (Y) | |||
String | ICU | toNFM | Show Values | |
UCD | NFKC_Casefold | Show Values | ||
Unicode | toNFC | Show Values | ||
toNFD | Show Values | |||
toNFKC | Show Values | |||
toNFKD | Show Values | |||
Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
Hex_Digit | No (N), Yes (Y) | |||
Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
kAccountingNumeric | Show Values | |||
kOtherNumeric | Show Values | |||
kPrimaryNumeric | Show Values | |||
Numeric | Numeric_Value | Show Values | ||
Regex | Binary | UTS | ANY | No, Yes |
ASCII | No, Yes | |||
alnum | No (N), Yes (Y) | |||
blank | No (N), Yes (Y) | |||
bmp | No, Yes | |||
graph | No (N), Yes (Y) | |||
No (N), Yes (Y) | ||||
xdigit | No (N), Yes (Y) | |||
Security | Enumerated | UTS | Confusable_MA | Show Values |
Identifier_Status | Allowed (a), Restricted (r) | |||
Identifier_Type | Show Values | |||
Shaping and Rendering | Binary | ICU | Segment_Starter | No (N), Yes (Y) |
UCD | Join_Control | No (N), Yes (Y) | ||
Enumerated | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | ||
Grapheme_Cluster_Break | Show Values | |||
Joining_Group | Show Values | |||
Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Line_Break | Show Values | |||
Prepended_Concatenation_Mark | No (N), Yes (Y) | |||
Sentence_Break | Show Values | |||
Standardized_Variant | Show Values | |||
Vertical_Orientation | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Word_Break | Show Values | |||
UCA | Binary | UTS | uca | Show Values |
uca2 | Show Values | |||
uca2.5 | Show Values | |||
uca3 | Show Values | |||
Z-Other | Other | Other | Basic_Emoji | Other |
Equivalent_Unified_Ideograph | Other | |||
Extended_Pictographic | Other |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.9; ICU version: 63.1; Unicode version: 12.0;