Unicode Utilities: Confusables

Properties use ICU for Unicode V13.0; the beta properties support Unicode V14.0β. For more information, see Unicode Utilities Beta.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

Input With this demo, you can supply an Input string and see the combinations that are confusable with it, using data collected by the Unicode consortium. You can also try different restrictions, using characters valid in different approaches to international domain names. For more info, see Data below.
  

Confusable Characters

1 I l | Ɩ ǀ Ι І Ӏ ׀ ו ן ا ١ ۱ ߊ 𐊊 𐌉 𐌠 𖼨 𝐈 𝐥 𝐼 𝑙 𝑰 𝒍 𝓁 𝓘 𝓵 𝔩 𝕀 𝕝 𝕴 𝖑 𝖨 𝗅 𝗜 𝗹 𝘐 𝘭 𝙄 𝙡 𝙸 𝚕 𝚰 𝛪 𝜤 𝝞 𝞘 𝟏 𝟙 𝟣 𝟭 𝟷 𞣇 𞸀 𞺀 🯱
00310049006C007C019601C00399040604C005C005D505DF0627066106F107CA16C12110211121132160217C222323FD2C922D4FA4F21028A103091032016F281D4081D4251D43C1D4591D4701D48D1D4C11D4D81D4F51D5291D5401D55D1D5741D5911D5A81D5C51D5DC1D5F91D6101D62D1D6441D6611D6781D6951D6B01D6EA1D7241D75E1D7981D7CF1D7D91D7E31D7ED1D7F71E8C71EE001EE801FBF1FE8DFE8EFF29FF4CFFE8
DIGIT ONELATIN CAPITAL LETTER ILATIN SMALL LETTER LVERTICAL LINELATIN CAPITAL LETTER IOTALATIN LETTER DENTAL CLICKGREEK CAPITAL LETTER IOTACYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN ICYRILLIC LETTER PALOCHKAHEBREW PUNCTUATION PASEQHEBREW LETTER VAVHEBREW LETTER FINAL NUNARABIC LETTER ALEFARABIC-INDIC DIGIT ONEEXTENDED ARABIC-INDIC DIGIT ONENKO LETTER ARUNIC LETTER ISAZ IS ISS ISCRIPT CAPITAL IBLACK-LETTER CAPITAL ISCRIPT SMALL LROMAN NUMERAL ONESMALL ROMAN NUMERAL FIFTYDIVIDESPOWER ON SYMBOLCOPTIC CAPITAL LETTER IAUDATIFINAGH LETTER YANLISU LETTER ILYCIAN LETTER JOLD ITALIC LETTER IOLD ITALIC NUMERAL ONEMIAO LETTER GHAMATHEMATICAL BOLD CAPITAL IMATHEMATICAL BOLD SMALL LMATHEMATICAL ITALIC CAPITAL IMATHEMATICAL ITALIC SMALL LMATHEMATICAL BOLD ITALIC CAPITAL IMATHEMATICAL BOLD ITALIC SMALL LMATHEMATICAL SCRIPT SMALL LMATHEMATICAL BOLD SCRIPT CAPITAL IMATHEMATICAL BOLD SCRIPT SMALL LMATHEMATICAL FRAKTUR SMALL LMATHEMATICAL DOUBLE-STRUCK CAPITAL IMATHEMATICAL DOUBLE-STRUCK SMALL LMATHEMATICAL BOLD FRAKTUR CAPITAL IMATHEMATICAL BOLD FRAKTUR SMALL LMATHEMATICAL SANS-SERIF CAPITAL IMATHEMATICAL SANS-SERIF SMALL LMATHEMATICAL SANS-SERIF BOLD CAPITAL IMATHEMATICAL SANS-SERIF BOLD SMALL LMATHEMATICAL SANS-SERIF ITALIC CAPITAL IMATHEMATICAL SANS-SERIF ITALIC SMALL LMATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL IMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL LMATHEMATICAL MONOSPACE CAPITAL IMATHEMATICAL MONOSPACE SMALL LMATHEMATICAL BOLD CAPITAL IOTAMATHEMATICAL ITALIC CAPITAL IOTAMATHEMATICAL BOLD ITALIC CAPITAL IOTAMATHEMATICAL SANS-SERIF BOLD CAPITAL IOTAMATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL IOTAMATHEMATICAL BOLD DIGIT ONEMATHEMATICAL DOUBLE-STRUCK DIGIT ONEMATHEMATICAL SANS-SERIF DIGIT ONEMATHEMATICAL SANS-SERIF BOLD DIGIT ONEMATHEMATICAL MONOSPACE DIGIT ONEMENDE KIKAKUI DIGIT ONEARABIC MATHEMATICAL ALEFARABIC MATHEMATICAL LOOPED ALEFSEGMENTED DIGIT ONEARABIC LETTER ALEF ISOLATED FORMARABIC LETTER ALEF FINAL FORMFULLWIDTH LATIN CAPITAL LETTER IFULLWIDTH LATIN SMALL LETTER LHALFWIDTH FORMS LIGHT VERTICAL

Total raw values: 74

Confusable Results

𝚰 𝘭 І 𝖨 𝐥 ﺍ ﺎ 𝔩 ℐ ℑ 𐊊 Ⲓ 𐌉 ℓ 𝜤 Ɩ 𝞘 Ι 𝚕 𝟏 ∣ ا I 𝗅 𝕀 1 𝙄 𝓁 𐌠 𝐼 𞸀 𞺀 ׀ 𝑰 ǀ Ӏ ᛁ 𝟭 𝕴 I ߊ l 𝛪 ⵏ 𝝞 𝕝 𝟣 ו 𞣇 𝙡 𝓘 𝗜 𝟙 𝑙 ן Ⅰ 𝘐 ١ 𝒍 𝖑 │ 🯱 𝐈 l ۱ ꓲ 𖼨 𝙸 𝟷 𝓵 | ⅼ ⏽ 𝗹

Total filtered values: 74


Data

Confusable characters are those that may be confused with others (in some common UI fonts), such as the Latin letter "o" and the Greek letter omicron "ο". Fonts make a difference: for example, the Hebrew character "ס" looks confusingly similar to "o" in some fonts (such as Arial Hebrew), but not in others. See also unaccented Latin Characters..

The data for confusables and restrictions is from UTS39. You can suggest additions or changes to the Unicode data for future versions of that standard.

For more information on the use of the data, see proposed updates Unicode Security Mechanisms and Unicode Security Considerations.

The restrictions are purely on a character level. For a more detailed view, see idna.

Caveats

The Unicode data is designed for testing, not enumerating, so not all combinations are generated in this demo; In particular, where a character is confusable with a sequence, not all combinations are generated.



Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 70.0; Unicode/Emoji version: 13.0; Unicodeβ version: 14.0;