Unicode Utilities: Character Property Index

Properties use ICU for Unicode V15.0; the beta properties support Unicode V15.1β. For more information, see Unicode Utilities Beta.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

CategoryDatatypeSourcePropertyValues
BidirectionalBinaryUCDBidi_ControlNo (N),
Yes (Y)
Bidi_ControlβNo (N),
Yes (Y)
Bidi_MirroredNo (N),
Yes (Y)
Bidi_MirroredβNo (N),
Yes (Y)
EnumeratedBidi_ClassShow Values
Bidi_ClassβShow Values
Bidi_Paired_Bracket_TypeClose (C),
None (N),
Open (O)
Bidi_Paired_Bracket_TypeβClose (c),
None (n),
Open (o)
StringBidi_Mirroring_GlyphShow Values
Bidi_Mirroring_GlyphβShow Values
Bidi_Paired_BracketShow Values
Bidi_Paired_BracketβShow Values
CaseBinaryUCDCase_IgnorableNo (N),
Yes (Y)
Case_IgnorableβNo (N),
Yes (Y)
CasedNo (N),
Yes (Y)
CasedβNo (N),
Yes (Y)
Changes_When_CasefoldedNo (N),
Yes (Y)
Changes_When_CasefoldedβNo (N),
Yes (Y)
Changes_When_CasemappedNo (N),
Yes (Y)
Changes_When_CasemappedβNo (N),
Yes (Y)
Changes_When_LowercasedNo (N),
Yes (Y)
Changes_When_LowercasedβNo (N),
Yes (Y)
Changes_When_TitlecasedNo (N),
Yes (Y)
Changes_When_TitlecasedβNo (N),
Yes (Y)
Changes_When_UppercasedNo (N),
Yes (Y)
Changes_When_UppercasedβNo (N),
Yes (Y)
LowercaseNo (N),
Yes (Y)
LowercaseβNo (N),
Yes (Y)
Soft_DottedNo (N),
Yes (Y)
Soft_DottedβNo (N),
Yes (Y)
UppercaseNo (N),
Yes (Y)
UppercaseβNo (N),
Yes (Y)
UnicodeisCasedNo (N),
Yes (Y)
isCasefoldedNo (N),
Yes (Y)
isLowercaseNo (N),
Yes (Y)
isTitlecaseNo (N),
Yes (Y)
isUppercaseNo (N),
Yes (Y)
X-ICUCase_SensitiveNo (N),
Yes (Y)
StringUCDCase_FoldingShow Values
Case_FoldingβShow Values
Lowercase_MappingShow Values
Lowercase_MappingβShow Values
Simple_Case_FoldingShow Values
Simple_Case_FoldingβShow Values
Simple_Lowercase_MappingShow Values
Simple_Lowercase_MappingβShow Values
Simple_Titlecase_MappingShow Values
Simple_Titlecase_MappingβShow Values
Simple_Uppercase_MappingShow Values
Simple_Uppercase_MappingβShow Values
Titlecase_MappingShow Values
Titlecase_MappingβShow Values
Uppercase_MappingShow Values
Uppercase_MappingβShow Values
UnicodetoCasefoldShow Values
toLowercaseShow Values
toTitlecaseShow Values
toUppercaseShow Values
CJKBinaryUCDIDS_Binary_OperatorNo (N),
Yes (Y)
IDS_Binary_OperatorβNo (N),
Yes (Y)
IDS_Trinary_OperatorNo (N),
Yes (Y)
IDS_Trinary_OperatorβNo (N),
Yes (Y)
IdeographicNo (N),
Yes (Y)
IdeographicβNo (N),
Yes (Y)
RadicalNo (N),
Yes (Y)
RadicalβNo (N),
Yes (Y)
Unified_IdeographNo (N),
Yes (Y)
Unified_IdeographβNo (N),
Yes (Y)
EnumeratedX-DemoHanTypeHan, Hans, Hant,
na
StringUCDCJK_RadicalβShow Values
Equivalent_Unified_IdeographβShow Values
kSimplifiedVariantβShow Values
kTraditionalVariantβShow Values
EmojiBinaryUCDExtended_PictographicNo (N),
Yes (Y)
Extended_PictographicβNo (N),
Yes (Y)
UTSBasic_EmojiNo (N),
Yes (Y)
Basic_EmojiβNo (No),
Yes (Yes)
EmojiNo (N),
Yes (Y)
Emoji_ComponentNo (N),
Yes (Y)
Emoji_ComponentβNo (N),
Yes (Y)
Emoji_ModifierNo (N),
Yes (Y)
Emoji_Modifier_BaseNo (N),
Yes (Y)
Emoji_Modifier_BaseβNo (N),
Yes (Y)
Emoji_ModifierβNo (N),
Yes (Y)
Emoji_PresentationNo (N),
Yes (Y)
Emoji_PresentationβNo (N),
Yes (Y)
EmojiβNo (N),
Yes (Y)
RGI_EmojiNo,
Yes
RGI_Emoji_Flag_SequenceNo (N),
Yes (Y)
RGI_Emoji_Flag_SequenceβNo (No),
Yes (Yes)
RGI_Emoji_Keycap_SequenceβNo (No),
Yes (Yes)
RGI_Emoji_Modifier_SequenceNo (N),
Yes (Y)
RGI_Emoji_Modifier_SequenceβNo (No),
Yes (Yes)
RGI_Emoji_Tag_SequenceNo (N),
Yes (Y)
RGI_Emoji_Tag_SequenceβNo (No),
Yes (Yes)
RGI_Emoji_Zwj_SequenceNo (N),
Yes (Y)
RGI_Emoji_Zwj_SequenceβNo (No),
Yes (Yes)
EnumeratedUCDRegional_IndicatorNo (N),
Yes (Y)
Regional_IndicatorβNo (N),
Yes (Y)
GeneralBinaryUCDAlphabeticNo (N),
Yes (Y)
AlphabeticβNo (N),
Yes (Y)
Default_Ignorable_Code_PointNo (N),
Yes (Y)
Default_Ignorable_Code_PointβNo (N),
Yes (Y)
DeprecatedNo (N),
Yes (Y)
DeprecatedβNo (N),
Yes (Y)
Logical_Order_ExceptionNo (N),
Yes (Y)
Logical_Order_ExceptionβNo (N),
Yes (Y)
Noncharacter_Code_PointNo (N),
Yes (Y)
Noncharacter_Code_PointβNo (N),
Yes (Y)
Variation_SelectorNo (N),
Yes (Y)
Variation_SelectorβNo (N),
Yes (Y)
White_SpaceNo (N),
Yes (Y)
White_SpaceβNo (N),
Yes (Y)
CatalogAgeShow Values
AgeβShow Values
BlockShow Values
BlockβShow Values
ScriptShow Values
ScriptβShow Values
EnumeratedGeneral_CategoryShow Values
General_CategoryβShow Values
Hangul_Syllable_TypeLeading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT),
Not_Applicable (NA),
Trailing_Jamo (T),
Vowel_Jamo (V)
Hangul_Syllable_TypeβLeading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT),
Not_Applicable (NA),
Trailing_Jamo (T),
Vowel_Jamo (V)
Name_AliasβACK; ACKNOWLEDGE, ALERT; BEL, ALM, APC; APPLICATION PROGRAM COMMAND, ARABIC SMALL HIGH LIGATURE ALEF WITH YEH BARREE,
BACKSPACE; BS, BOM; BYTE ORDER MARK; ZWNBSP, BPH; BREAK PERMITTED HERE, BYZANTINE MUSICAL SYMBOL FTHORA SKLIRON CHROMA VASIS,
CAN; CANCEL, CANCEL CHARACTER; CCH, CARRIAGE RETURN; CR, CGJ, CHARACTER TABULATION SET; HORIZONTAL TABULATION SET; HTS, CHARACTER TABULATION WITH JUSTIFICATION; HORIZONTAL TABULATION WITH JUSTIFICATION; HTJ, CHARACTER TABULATION; HORIZONTAL TABULATION; HT; TAB, CONTROL SEQUENCE INTRODUCER; CSI, CUNEIFORM SIGN NU11 OVER NU11 BUR OVER BUR, CUNEIFORM SIGN NU11 TENU,
DATA LINK ESCAPE; DLE, DC1; DEVICE CONTROL ONE, DC2; DEVICE CONTROL TWO, DC3; DEVICE CONTROL THREE, DC4; DEVICE CONTROL FOUR, DCS; DEVICE CONTROL STRING, DEL; DELETE,
EM; END OF MEDIUM; EOM, END OF GUARDED AREA; END OF PROTECTED AREA; EPA, END OF LINE; EOL; LF; LINE FEED; NEW LINE; NL, END OF SELECTED AREA; ESA, END OF TEXT; ETX, END OF TRANSMISSION BLOCK; ETB, END OF TRANSMISSION; EOT, ENQ; ENQUIRY, ESC; ESCAPE,
FF; FORM FEED, FILE SEPARATOR; FS; INFORMATION SEPARATOR FOUR, FSI, FVS1, FVS2, FVS3, FVS4,
GROUP SEPARATOR; GS; INFORMATION SEPARATOR THREE,
HANGUL JONGSEONG SSANGYESIEUNG, HANGUL JONGSEONG YESIEUNG-KHIEUKH, HANGUL JONGSEONG YESIEUNG-KIYEOK, HANGUL JONGSEONG YESIEUNG-SSANGKIYEOK, HENTAIGANA LETTER E-1, HIGH OCTET PRESET; HOP,
IND; INDEX, INFORMATION SEPARATOR ONE; UNIT SEPARATOR; US, INFORMATION SEPARATOR TWO; RECORD SEPARATOR; RS,
KANNADA LETTER LLLA,
LAO LETTER FO FAY, LAO LETTER FO FON, LAO LETTER LO, LAO LETTER RO, LATIN CAPITAL LETTER GHA, LATIN SMALL LETTER GHA, LEFTWARDS TRIANGLE-HEADED ARROW WITH DOUBLE VERTICAL STROKE, LINE TABULATION SET; VERTICAL TABULATION SET; VTS, LINE TABULATION; VERTICAL TABULATION; VT, LOCKING-SHIFT ONE; SHIFT OUT; SO, LOCKING-SHIFT ZERO; SHIFT IN; SI, LRE, LRI, LRM, LRO,
MEDEFAIDRIN CAPITAL LETTER H, MEDEFAIDRIN CAPITAL LETTER NG, MEDEFAIDRIN SMALL LETTER H, MEDEFAIDRIN SMALL LETTER NG, MESSAGE WAITING; MW, MICR DASH SYMBOL, MICR ON US SYMBOL, MMSP, MVS, MYANMAR LETTER KHAMTI LLA,
NAK; NEGATIVE ACKNOWLEDGE, NBH; NO BREAK HERE, NBSP, NEL; NEXT LINE, NNBSP, NUL; NULL,
OPERATING SYSTEM COMMAND; OSC,
PAD; PADDING CHARACTER, PARTIAL LINE BACKWARD; PARTIAL LINE UP; PLU, PARTIAL LINE DOWN; PARTIAL LINE FORWARD; PLD, PDF, PDI, PM; PRIVACY MESSAGE, PRESENTATION FORM FOR VERTICAL RIGHT WHITE LENTICULAR BRACKET, PRIVATE USE ONE; PRIVATE USE-1; PU1, PRIVATE USE TWO; PRIVATE USE-2; PU2,
REVERSE INDEX; REVERSE LINE FEED; RI, RIGHTWARDS TRIANGLE-HEADED ARROW WITH DOUBLE VERTICAL STROKE, RLE, RLI, RLM, RLO,
SCI; SINGLE CHARACTER INTRODUCER, SET TRANSMIT STATE; STS, SGC; SINGLE GRAPHIC CHARACTER INTRODUCER, SHY, SINGLE SHIFT THREE; SINGLE-SHIFT-3; SS3, SINGLE SHIFT TWO; SINGLE-SHIFT-2; SS2, SOH; START OF HEADING, SOS; START OF STRING, SP, SPA; START OF GUARDED AREA; START OF PROTECTED AREA, SSA; START OF SELECTED AREA, ST; STRING TERMINATOR, START OF TEXT; STX, SUB; SUBSTITUTE, SUNDANESE LETTER ARCHAIC I, SYN; SYNCHRONOUS IDLE, SYRIAC SUBLINEAR COLON SKEWED LEFT,
TIBETAN MARK BKA- SHOG GI MGO RGYAN,
VS1, VS2, VS3, VS4, VS5, VS6, VS7, VS8, VS9, VS10, VS11, VS12, VS13, VS14, VS15, VS16, VS17, VS18, VS19, VS20, VS21, VS22, VS23, VS24, VS25, VS26, VS27, VS28, VS29, VS30, VS31, VS32, VS33, VS34, VS35, VS36, VS37, VS38, VS39, VS40, VS41, VS42, VS43, VS44, VS45, VS46, VS47, VS48, VS49, VS50, VS51, VS52, VS53, VS54, VS55, VS56, VS57, VS58, VS59, VS60, VS61, VS62, VS63, VS64, VS65, VS66, VS67, VS68, VS69, VS70, VS71, VS72, VS73, VS74, VS75, VS76, VS77, VS78, VS79, VS80, VS81, VS82, VS83, VS84, VS85, VS86, VS87, VS88, VS89, VS90, VS91, VS92, VS93, VS94, VS95, VS96, VS97, VS98, VS99, VS100, VS101, VS102, VS103, VS104, VS105, VS106, VS107, VS108, VS109, VS110, VS111, VS112, VS113, VS114, VS115, VS116, VS117, VS118, VS119, VS120, VS121, VS122, VS123, VS124, VS125, VS126, VS127, VS128, VS129, VS130, VS131, VS132, VS133, VS134, VS135, VS136, VS137 too many values to show
Named_Sequences_Provβ
Named_SequencesβShow Values
StringNameslistsubheadShow Values
UCDNameShow Values
NameβShow Values
Script_ExtensionsShow Values
Script_ExtensionsβShow Values
IdentifiersBinaryUCDID_ContinueNo (N),
Yes (Y)
ID_ContinueβNo (N),
Yes (Y)
ID_StartNo (N),
Yes (Y)
ID_StartβNo (N),
Yes (Y)
Pattern_SyntaxNo (N),
Yes (Y)
Pattern_SyntaxβNo (N),
Yes (Y)
Pattern_White_SpaceNo (N),
Yes (Y)
Pattern_White_SpaceβNo (N),
Yes (Y)
XID_ContinueNo (N),
Yes (Y)
XID_ContinueβNo (N),
Yes (Y)
XID_StartNo (N),
Yes (Y)
XID_StartβNo (N),
Yes (Y)
IDNAEnumeratedUTSIdn_2008βna (na),
NV8 (nv8),
XV8 (xv8)
Idn_Statusβdeviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v),
ignored (i),
mapped (m),
valid (v)
idna2003deviation, disallowed,
ignored,
mapped,
valid
idna2008CONTEXTJ, CONTEXTO,
DISALLOWED,
PVALID,
UNASSIGNED
idna2008cdeviation, disallowed,
ignored,
mapped,
valid
uts46deviation, disallowed,
ignored,
mapped,
valid
StringIdn_MappingβShow Values
toIdna2003Show Values
toUts46nShow Values
toUts46tShow Values
MiscellaneousBinaryUCDDashNo (N),
Yes (Y)
DashβNo (N),
Yes (Y)
DiacriticNo (N),
Yes (Y)
DiacriticβNo (N),
Yes (Y)
ExtenderNo (N),
Yes (Y)
ExtenderβNo (N),
Yes (Y)
Grapheme_BaseNo (N),
Yes (Y)
Grapheme_ExtendNo (N),
Yes (Y)
Grapheme_LinkNo (N),
Yes (Y)
HyphenNo (N),
Yes (Y)
MathNo (N),
Yes (Y)
MathβNo (N),
Yes (Y)
Quotation_MarkNo (N),
Yes (Y)
Quotation_MarkβNo (N),
Yes (Y)
STermNo (N),
Yes (Y)
STermβNo (N),
Yes (Y)
Terminal_PunctuationNo (N),
Yes (Y)
Terminal_PunctuationβNo (N),
Yes (Y)
EnumeratedIndic_Positional_CategoryShow Values
Indic_Positional_CategoryβShow Values
Indic_Syllabic_CategoryShow Values
Indic_Syllabic_CategoryβShow Values
MiscellaneousISO_CommentShow Values
Unicode_1_NameShow Values
NormalizationBinaryICUNFC_InertNo (N),
Yes (Y)
NFD_InertNo (N),
Yes (Y)
NFKC_InertNo (N),
Yes (Y)
NFKD_InertNo (N),
Yes (Y)
isNFMNo,
Yes
UCDChanges_When_NFKC_CasefoldedNo (N),
Yes (Y)
Changes_When_NFKC_CasefoldedβNo (N),
Yes (Y)
Full_Composition_ExclusionNo (N),
Yes (Y)
UnicodeisNFCNo,
Yes
isNFDNo,
Yes
isNFKCNo,
Yes
isNFKDNo,
Yes
EnumeratedICULead_Canonical_Combining_ClassShow Values
Trail_Canonical_Combining_ClassShow Values
UCDCanonical_Combining_ClassShow Values
Canonical_Combining_ClassβShow Values
Decomposition_TypeShow Values
Decomposition_TypeβShow Values
NFC_Quick_CheckMaybe (M),
No (N),
Yes (Y)
NFC_Quick_CheckβMaybe (M),
No (N),
Yes (Y)
NFD_Quick_CheckNo (N),
Yes (Y)
NFD_Quick_CheckβNo (N),
Yes (Y)
NFKC_Quick_CheckMaybe (M),
No (N),
Yes (Y)
NFKC_Quick_CheckβMaybe (M),
No (N),
Yes (Y)
NFKD_Quick_CheckNo (N),
Yes (Y)
NFKD_Quick_CheckβNo (N),
Yes (Y)
StringICUtoNFMShow Values
UCDNFKC_CasefoldShow Values
NFKC_CasefoldβShow Values
UnicodetoNFCShow Values
toNFDShow Values
toNFKCShow Values
toNFKDShow Values
NumericBinaryUCDASCII_Hex_DigitNo (N),
Yes (Y)
ASCII_Hex_DigitβNo (N),
Yes (Y)
Hex_DigitNo (N),
Yes (Y)
Hex_DigitβNo (N),
Yes (Y)
EnumeratedNumeric_TypeDecimal (De), Digit (Di),
None (None), Numeric (Nu)
Numeric_TypeβDecimal (De), Digit (Di),
None (None), Numeric (Nu)
kAccountingNumericβShow Values
kOtherNumericβShow Values
kPrimaryNumericβShow Values
NumericNumeric_ValueShow Values
Numeric_ValueβShow Values
RegexBinaryUTSANYNo,
Yes
ASCIINo,
Yes
alnumNo (N),
Yes (Y)
blankNo (N),
Yes (Y)
bmpNo,
Yes
graphNo (N),
Yes (Y)
printNo (N),
Yes (Y)
xdigitNo (N),
Yes (Y)
SecurityEnumeratedUTSConfusable_MAβShow Values
Identifier_StatusβAllowed (a),
Restricted (r)
Identifier_TypeβShow Values
Shaping and RenderingBinaryICUSegment_StarterNo (N),
Yes (Y)
UCDJoin_ControlNo (N),
Yes (Y)
Join_ControlβNo (N),
Yes (Y)
EnumeratedEast_Asian_WidthAmbiguous (A),
Fullwidth (F),
Halfwidth (H),
Narrow (Na), Neutral (N),
Wide (W)
East_Asian_WidthβAmbiguous (A),
Fullwidth (F),
Halfwidth (H),
Narrow (Na), Neutral (N),
Wide (W)
Grapheme_Cluster_BreakShow Values
Grapheme_Cluster_BreakβShow Values
Joining_GroupShow Values
Joining_GroupβShow Values
Joining_TypeDual_Joining (D),
Join_Causing (C),
Left_Joining (L),
Non_Joining (U),
Right_Joining (R),
Transparent (T)
Joining_TypeβDual_Joining (D),
Join_Causing (C),
Left_Joining (L),
Non_Joining (U),
Right_Joining (R),
Transparent (T)
Line_BreakShow Values
Line_BreakβShow Values
Prepended_Concatenation_MarkNo (N),
Yes (Y)
Prepended_Concatenation_MarkβNo (N),
Yes (Y)
Sentence_BreakShow Values
Sentence_BreakβShow Values
Standardized_VariantβShow Values
Vertical_OrientationRotated (R),
Transformed_Rotated (Tr), Transformed_Upright (Tu),
Upright (U)
Vertical_OrientationβRotated (R),
Transformed_Rotated (Tr), Transformed_Upright (Tu),
Upright (U)
Word_BreakShow Values
Word_BreakβShow Values
UCABinaryUTSucaShow Values
uca2Show Values
uca2.5Show Values
uca3Show Values
Z-OtherOtherOtherEmoji_Keycap_SequenceOther
ID_Compat_Math_ContinueβOther
ID_Compat_Math_StartβOther
IDS_Unary_OperatorβOther
NFKC_Simple_CasefoldβOther

Key

The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.

The Datatypes are from UCD Table 5. Property Type Key.

The Sources are:


Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 72.0; Unicode/Emoji version: 15.0; Unicodeβ version: 15.0;