Properties use ICU for Unicode V15.0; the beta properties support Unicode V15.1β. For more information, see Unicode Utilities Beta.
help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid
Category | Datatype | Source | Property | Values |
---|---|---|---|---|
Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
Bidi_Controlβ | No (N), Yes (Y) | |||
Bidi_Mirrored | No (N), Yes (Y) | |||
Bidi_Mirroredβ | No (N), Yes (Y) | |||
Enumerated | Bidi_Class | Show Values | ||
Bidi_Classβ | Show Values | |||
Bidi_Paired_Bracket_Type | Close (C), None (N), Open (O) | |||
Bidi_Paired_Bracket_Typeβ | Close (c), None (n), Open (o) | |||
String | Bidi_Mirroring_Glyph | Show Values | ||
Bidi_Mirroring_Glyphβ | Show Values | |||
Bidi_Paired_Bracket | Show Values | |||
Bidi_Paired_Bracketβ | Show Values | |||
Case | Binary | UCD | Case_Ignorable | No (N), Yes (Y) |
Case_Ignorableβ | No (N), Yes (Y) | |||
Cased | No (N), Yes (Y) | |||
Casedβ | No (N), Yes (Y) | |||
Changes_When_Casefolded | No (N), Yes (Y) | |||
Changes_When_Casefoldedβ | No (N), Yes (Y) | |||
Changes_When_Casemapped | No (N), Yes (Y) | |||
Changes_When_Casemappedβ | No (N), Yes (Y) | |||
Changes_When_Lowercased | No (N), Yes (Y) | |||
Changes_When_Lowercasedβ | No (N), Yes (Y) | |||
Changes_When_Titlecased | No (N), Yes (Y) | |||
Changes_When_Titlecasedβ | No (N), Yes (Y) | |||
Changes_When_Uppercased | No (N), Yes (Y) | |||
Changes_When_Uppercasedβ | No (N), Yes (Y) | |||
Lowercase | No (N), Yes (Y) | |||
Lowercaseβ | No (N), Yes (Y) | |||
Soft_Dotted | No (N), Yes (Y) | |||
Soft_Dottedβ | No (N), Yes (Y) | |||
Uppercase | No (N), Yes (Y) | |||
Uppercaseβ | No (N), Yes (Y) | |||
Unicode | isCased | No (N), Yes (Y) | ||
isCasefolded | No (N), Yes (Y) | |||
isLowercase | No (N), Yes (Y) | |||
isTitlecase | No (N), Yes (Y) | |||
isUppercase | No (N), Yes (Y) | |||
X-ICU | Case_Sensitive | No (N), Yes (Y) | ||
String | UCD | Case_Folding | Show Values | |
Case_Foldingβ | Show Values | |||
Lowercase_Mapping | Show Values | |||
Lowercase_Mappingβ | Show Values | |||
Simple_Case_Folding | Show Values | |||
Simple_Case_Foldingβ | Show Values | |||
Simple_Lowercase_Mapping | Show Values | |||
Simple_Lowercase_Mappingβ | Show Values | |||
Simple_Titlecase_Mapping | Show Values | |||
Simple_Titlecase_Mappingβ | Show Values | |||
Simple_Uppercase_Mapping | Show Values | |||
Simple_Uppercase_Mappingβ | Show Values | |||
Titlecase_Mapping | Show Values | |||
Titlecase_Mappingβ | Show Values | |||
Uppercase_Mapping | Show Values | |||
Uppercase_Mappingβ | Show Values | |||
Unicode | toCasefold | Show Values | ||
toLowercase | Show Values | |||
toTitlecase | Show Values | |||
toUppercase | Show Values | |||
CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
IDS_Binary_Operatorβ | No (N), Yes (Y) | |||
IDS_Trinary_Operator | No (N), Yes (Y) | |||
IDS_Trinary_Operatorβ | No (N), Yes (Y) | |||
Ideographic | No (N), Yes (Y) | |||
Ideographicβ | No (N), Yes (Y) | |||
Radical | No (N), Yes (Y) | |||
Radicalβ | No (N), Yes (Y) | |||
Unified_Ideograph | No (N), Yes (Y) | |||
Unified_Ideographβ | No (N), Yes (Y) | |||
Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
String | UCD | CJK_Radicalβ | Show Values | |
Equivalent_Unified_Ideographβ | Show Values | |||
kSimplifiedVariantβ | Show Values | |||
kTraditionalVariantβ | Show Values | |||
Emoji | Binary | UCD | Extended_Pictographic | No (N), Yes (Y) |
Extended_Pictographicβ | No (N), Yes (Y) | |||
UTS | Basic_Emoji | No (N), Yes (Y) | ||
Basic_Emojiβ | No (No), Yes (Yes) | |||
Emoji | No (N), Yes (Y) | |||
Emoji_Component | No (N), Yes (Y) | |||
Emoji_Componentβ | No (N), Yes (Y) | |||
Emoji_Modifier | No (N), Yes (Y) | |||
Emoji_Modifier_Base | No (N), Yes (Y) | |||
Emoji_Modifier_Baseβ | No (N), Yes (Y) | |||
Emoji_Modifierβ | No (N), Yes (Y) | |||
Emoji_Presentation | No (N), Yes (Y) | |||
Emoji_Presentationβ | No (N), Yes (Y) | |||
Emojiβ | No (N), Yes (Y) | |||
RGI_Emoji | No, Yes | |||
RGI_Emoji_Flag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Flag_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Keycap_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Modifier_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Modifier_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Tag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Tag_Sequenceβ | No (No), Yes (Yes) | |||
RGI_Emoji_Zwj_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Zwj_Sequenceβ | No (No), Yes (Yes) | |||
Enumerated | UCD | Regional_Indicator | No (N), Yes (Y) | |
Regional_Indicatorβ | No (N), Yes (Y) | |||
General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
Alphabeticβ | No (N), Yes (Y) | |||
Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
Default_Ignorable_Code_Pointβ | No (N), Yes (Y) | |||
Deprecated | No (N), Yes (Y) | |||
Deprecatedβ | No (N), Yes (Y) | |||
Logical_Order_Exception | No (N), Yes (Y) | |||
Logical_Order_Exceptionβ | No (N), Yes (Y) | |||
Noncharacter_Code_Point | No (N), Yes (Y) | |||
Noncharacter_Code_Pointβ | No (N), Yes (Y) | |||
Variation_Selector | No (N), Yes (Y) | |||
Variation_Selectorβ | No (N), Yes (Y) | |||
White_Space | No (N), Yes (Y) | |||
White_Spaceβ | No (N), Yes (Y) | |||
Catalog | Age | Show Values | ||
Ageβ | Show Values | |||
Block | Show Values | |||
Blockβ | Show Values | |||
Script | Show Values | |||
Scriptβ | Show Values | |||
Enumerated | General_Category | Show Values | ||
General_Categoryβ | Show Values | |||
Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Hangul_Syllable_Typeβ | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Name_Aliasβ | ACK; ACKNOWLEDGE, ALERT; BEL, ALM, APC; APPLICATION PROGRAM COMMAND, ARABIC SMALL HIGH LIGATURE ALEF WITH YEH BARREE, BACKSPACE; BS, BOM; BYTE ORDER MARK; ZWNBSP, BPH; BREAK PERMITTED HERE, BYZANTINE MUSICAL SYMBOL FTHORA SKLIRON CHROMA VASIS, CAN; CANCEL, CANCEL CHARACTER; CCH, CARRIAGE RETURN; CR, CGJ, CHARACTER TABULATION SET; HORIZONTAL TABULATION SET; HTS, CHARACTER TABULATION WITH JUSTIFICATION; HORIZONTAL TABULATION WITH JUSTIFICATION; HTJ, CHARACTER TABULATION; HORIZONTAL TABULATION; HT; TAB, CONTROL SEQUENCE INTRODUCER; CSI, CUNEIFORM SIGN NU11 OVER NU11 BUR OVER BUR, CUNEIFORM SIGN NU11 TENU, DATA LINK ESCAPE; DLE, DC1; DEVICE CONTROL ONE, DC2; DEVICE CONTROL TWO, DC3; DEVICE CONTROL THREE, DC4; DEVICE CONTROL FOUR, DCS; DEVICE CONTROL STRING, DEL; DELETE, EM; END OF MEDIUM; EOM, END OF GUARDED AREA; END OF PROTECTED AREA; EPA, END OF LINE; EOL; LF; LINE FEED; NEW LINE; NL, END OF SELECTED AREA; ESA, END OF TEXT; ETX, END OF TRANSMISSION BLOCK; ETB, END OF TRANSMISSION; EOT, ENQ; ENQUIRY, ESC; ESCAPE, FF; FORM FEED, FILE SEPARATOR; FS; INFORMATION SEPARATOR FOUR, FSI, FVS1, FVS2, FVS3, FVS4, GROUP SEPARATOR; GS; INFORMATION SEPARATOR THREE, HANGUL JONGSEONG SSANGYESIEUNG, HANGUL JONGSEONG YESIEUNG-KHIEUKH, HANGUL JONGSEONG YESIEUNG-KIYEOK, HANGUL JONGSEONG YESIEUNG-SSANGKIYEOK, HENTAIGANA LETTER E-1, HIGH OCTET PRESET; HOP, IND; INDEX, INFORMATION SEPARATOR ONE; UNIT SEPARATOR; US, INFORMATION SEPARATOR TWO; RECORD SEPARATOR; RS, KANNADA LETTER LLLA, LAO LETTER FO FAY, LAO LETTER FO FON, LAO LETTER LO, LAO LETTER RO, LATIN CAPITAL LETTER GHA, LATIN SMALL LETTER GHA, LEFTWARDS TRIANGLE-HEADED ARROW WITH DOUBLE VERTICAL STROKE, LINE TABULATION SET; VERTICAL TABULATION SET; VTS, LINE TABULATION; VERTICAL TABULATION; VT, LOCKING-SHIFT ONE; SHIFT OUT; SO, LOCKING-SHIFT ZERO; SHIFT IN; SI, LRE, LRI, LRM, LRO, MEDEFAIDRIN CAPITAL LETTER H, MEDEFAIDRIN CAPITAL LETTER NG, MEDEFAIDRIN SMALL LETTER H, MEDEFAIDRIN SMALL LETTER NG, MESSAGE WAITING; MW, MICR DASH SYMBOL, MICR ON US SYMBOL, MMSP, MVS, MYANMAR LETTER KHAMTI LLA, NAK; NEGATIVE ACKNOWLEDGE, NBH; NO BREAK HERE, NBSP, NEL; NEXT LINE, NNBSP, NUL; NULL, OPERATING SYSTEM COMMAND; OSC, PAD; PADDING CHARACTER, PARTIAL LINE BACKWARD; PARTIAL LINE UP; PLU, PARTIAL LINE DOWN; PARTIAL LINE FORWARD; PLD, PDF, PDI, PM; PRIVACY MESSAGE, PRESENTATION FORM FOR VERTICAL RIGHT WHITE LENTICULAR BRACKET, PRIVATE USE ONE; PRIVATE USE-1; PU1, PRIVATE USE TWO; PRIVATE USE-2; PU2, REVERSE INDEX; REVERSE LINE FEED; RI, RIGHTWARDS TRIANGLE-HEADED ARROW WITH DOUBLE VERTICAL STROKE, RLE, RLI, RLM, RLO, SCI; SINGLE CHARACTER INTRODUCER, SET TRANSMIT STATE; STS, SGC; SINGLE GRAPHIC CHARACTER INTRODUCER, SHY, SINGLE SHIFT THREE; SINGLE-SHIFT-3; SS3, SINGLE SHIFT TWO; SINGLE-SHIFT-2; SS2, SOH; START OF HEADING, SOS; START OF STRING, SP, SPA; START OF GUARDED AREA; START OF PROTECTED AREA, SSA; START OF SELECTED AREA, ST; STRING TERMINATOR, START OF TEXT; STX, SUB; SUBSTITUTE, SUNDANESE LETTER ARCHAIC I, SYN; SYNCHRONOUS IDLE, SYRIAC SUBLINEAR COLON SKEWED LEFT, TIBETAN MARK BKA- SHOG GI MGO RGYAN, VS1, VS2, VS3, VS4, VS5, VS6, VS7, VS8, VS9, VS10, VS11, VS12, VS13, VS14, VS15, VS16, VS17, VS18, VS19, VS20, VS21, VS22, VS23, VS24, VS25, VS26, VS27, VS28, VS29, VS30, VS31, VS32, VS33, VS34, VS35, VS36, VS37, VS38, VS39, VS40, VS41, VS42, VS43, VS44, VS45, VS46, VS47, VS48, VS49, VS50, VS51, VS52, VS53, VS54, VS55, VS56, VS57, VS58, VS59, VS60, VS61, VS62, VS63, VS64, VS65, VS66, VS67, VS68, VS69, VS70, VS71, VS72, VS73, VS74, VS75, VS76, VS77, VS78, VS79, VS80, VS81, VS82, VS83, VS84, VS85, VS86, VS87, VS88, VS89, VS90, VS91, VS92, VS93, VS94, VS95, VS96, VS97, VS98, VS99, VS100, VS101, VS102, VS103, VS104, VS105, VS106, VS107, VS108, VS109, VS110, VS111, VS112, VS113, VS114, VS115, VS116, VS117, VS118, VS119, VS120, VS121, VS122, VS123, VS124, VS125, VS126, VS127, VS128, VS129, VS130, VS131, VS132, VS133, VS134, VS135, VS136, VS137 too many values to show | |||
Named_Sequences_Provβ | ||||
Named_Sequencesβ | Show Values | |||
String | Nameslist | subhead | Show Values | |
UCD | Name | Show Values | ||
Nameβ | Show Values | |||
Script_Extensions | Show Values | |||
Script_Extensionsβ | Show Values | |||
Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
ID_Continueβ | No (N), Yes (Y) | |||
ID_Start | No (N), Yes (Y) | |||
ID_Startβ | No (N), Yes (Y) | |||
Pattern_Syntax | No (N), Yes (Y) | |||
Pattern_Syntaxβ | No (N), Yes (Y) | |||
Pattern_White_Space | No (N), Yes (Y) | |||
Pattern_White_Spaceβ | No (N), Yes (Y) | |||
XID_Continue | No (N), Yes (Y) | |||
XID_Continueβ | No (N), Yes (Y) | |||
XID_Start | No (N), Yes (Y) | |||
XID_Startβ | No (N), Yes (Y) | |||
IDNA | Enumerated | UTS | Idn_2008β | na (na), NV8 (nv8), XV8 (xv8) |
Idn_Statusβ | deviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v), ignored (i), mapped (m), valid (v) | |||
idna2003 | deviation, disallowed, ignored, mapped, valid | |||
idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED | |||
idna2008c | deviation, disallowed, ignored, mapped, valid | |||
uts46 | deviation, disallowed, ignored, mapped, valid | |||
String | Idn_Mappingβ | Show Values | ||
toIdna2003 | Show Values | |||
toUts46n | Show Values | |||
toUts46t | Show Values | |||
Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
Dashβ | No (N), Yes (Y) | |||
Diacritic | No (N), Yes (Y) | |||
Diacriticβ | No (N), Yes (Y) | |||
Extender | No (N), Yes (Y) | |||
Extenderβ | No (N), Yes (Y) | |||
Grapheme_Base | No (N), Yes (Y) | |||
Grapheme_Extend | No (N), Yes (Y) | |||
Grapheme_Link | No (N), Yes (Y) | |||
Hyphen | No (N), Yes (Y) | |||
Math | No (N), Yes (Y) | |||
Mathβ | No (N), Yes (Y) | |||
Quotation_Mark | No (N), Yes (Y) | |||
Quotation_Markβ | No (N), Yes (Y) | |||
STerm | No (N), Yes (Y) | |||
STermβ | No (N), Yes (Y) | |||
Terminal_Punctuation | No (N), Yes (Y) | |||
Terminal_Punctuationβ | No (N), Yes (Y) | |||
Enumerated | Indic_Positional_Category | Show Values | ||
Indic_Positional_Categoryβ | Show Values | |||
Indic_Syllabic_Category | Show Values | |||
Indic_Syllabic_Categoryβ | Show Values | |||
Miscellaneous | ISO_Comment | Show Values | ||
Unicode_1_Name | Show Values | |||
Normalization | Binary | ICU | NFC_Inert | No (N), Yes (Y) |
NFD_Inert | No (N), Yes (Y) | |||
NFKC_Inert | No (N), Yes (Y) | |||
NFKD_Inert | No (N), Yes (Y) | |||
isNFM | No, Yes | |||
UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) | ||
Changes_When_NFKC_Casefoldedβ | No (N), Yes (Y) | |||
Full_Composition_Exclusion | No (N), Yes (Y) | |||
Unicode | isNFC | No, Yes | ||
isNFD | No, Yes | |||
isNFKC | No, Yes | |||
isNFKD | No, Yes | |||
Enumerated | ICU | Lead_Canonical_Combining_Class | Show Values | |
Trail_Canonical_Combining_Class | Show Values | |||
UCD | Canonical_Combining_Class | Show Values | ||
Canonical_Combining_Classβ | Show Values | |||
Decomposition_Type | Show Values | |||
Decomposition_Typeβ | Show Values | |||
NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFC_Quick_Checkβ | Maybe (M), No (N), Yes (Y) | |||
NFD_Quick_Check | No (N), Yes (Y) | |||
NFD_Quick_Checkβ | No (N), Yes (Y) | |||
NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFKC_Quick_Checkβ | Maybe (M), No (N), Yes (Y) | |||
NFKD_Quick_Check | No (N), Yes (Y) | |||
NFKD_Quick_Checkβ | No (N), Yes (Y) | |||
String | ICU | toNFM | Show Values | |
UCD | NFKC_Casefold | Show Values | ||
NFKC_Casefoldβ | Show Values | |||
Unicode | toNFC | Show Values | ||
toNFD | Show Values | |||
toNFKC | Show Values | |||
toNFKD | Show Values | |||
Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
ASCII_Hex_Digitβ | No (N), Yes (Y) | |||
Hex_Digit | No (N), Yes (Y) | |||
Hex_Digitβ | No (N), Yes (Y) | |||
Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
Numeric_Typeβ | Decimal (De), Digit (Di), None (None), Numeric (Nu) | |||
kAccountingNumericβ | Show Values | |||
kOtherNumericβ | Show Values | |||
kPrimaryNumericβ | Show Values | |||
Numeric | Numeric_Value | Show Values | ||
Numeric_Valueβ | Show Values | |||
Regex | Binary | UTS | ANY | No, Yes |
ASCII | No, Yes | |||
alnum | No (N), Yes (Y) | |||
blank | No (N), Yes (Y) | |||
bmp | No, Yes | |||
graph | No (N), Yes (Y) | |||
No (N), Yes (Y) | ||||
xdigit | No (N), Yes (Y) | |||
Security | Enumerated | UTS | Confusable_MAβ | Show Values |
Identifier_Statusβ | Allowed (a), Restricted (r) | |||
Identifier_Typeβ | Show Values | |||
Shaping and Rendering | Binary | ICU | Segment_Starter | No (N), Yes (Y) |
UCD | Join_Control | No (N), Yes (Y) | ||
Join_Controlβ | No (N), Yes (Y) | |||
Enumerated | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | ||
East_Asian_Widthβ | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | |||
Grapheme_Cluster_Break | Show Values | |||
Grapheme_Cluster_Breakβ | Show Values | |||
Joining_Group | Show Values | |||
Joining_Groupβ | Show Values | |||
Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Joining_Typeβ | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Line_Break | Show Values | |||
Line_Breakβ | Show Values | |||
Prepended_Concatenation_Mark | No (N), Yes (Y) | |||
Prepended_Concatenation_Markβ | No (N), Yes (Y) | |||
Sentence_Break | Show Values | |||
Sentence_Breakβ | Show Values | |||
Standardized_Variantβ | Show Values | |||
Vertical_Orientation | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Vertical_Orientationβ | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Word_Break | Show Values | |||
Word_Breakβ | Show Values | |||
UCA | Binary | UTS | uca | Show Values |
uca2 | Show Values | |||
uca2.5 | Show Values | |||
uca3 | Show Values | |||
Z-Other | Other | Other | Emoji_Keycap_Sequence | Other |
ID_Compat_Math_Continueβ | Other | |||
ID_Compat_Math_Startβ | Other | |||
IDS_Unary_Operatorβ | Other | |||
NFKC_Simple_Casefoldβ | Other |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.9; ICU version: 72.0; Unicode/Emoji version: 15.0; Unicodeβ version: 15.0;