Unicode Utilities: Confusables

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

Input With this demo, you can supply an Input string and see the combinations that are confusable with it, using data collected by the Unicode consortium. You can also try different restrictions, using characters valid in different approaches to international domain names. For more info, see Data below.
  

Confusable Characters

                            
FFFD
REPLACEMENT CHARACTER
                            
FFFD
REPLACEMENT CHARACTER
B Β В 𐊂 𐊡 𐌁 𝐁 𝐵 𝑩 𝓑 𝔅 𝔹 𝕭 𝖡 𝗕 𝘉 𝘽 𝙱 𝚩 𝛣 𝜝 𝝗 𝞑
00420392041213F415F7212CA4D0A7B410282102A1103011D4011D4351D4691D4D11D5051D5391D56D1D5A11D5D51D6091D63D1D6711D6A91D6E31D71D1D7571D791FF22
LATIN CAPITAL LETTER BGREEK CAPITAL LETTER BETACYRILLIC CAPITAL LETTER VECHEROKEE LETTER YVCANADIAN SYLLABICS CARRIER KHESCRIPT CAPITAL BLISU LETTER BALATIN CAPITAL LETTER BETALYCIAN LETTER BCARIAN LETTER P2OLD ITALIC LETTER BEMATHEMATICAL BOLD CAPITAL BMATHEMATICAL ITALIC CAPITAL BMATHEMATICAL BOLD ITALIC CAPITAL BMATHEMATICAL BOLD SCRIPT CAPITAL BMATHEMATICAL FRAKTUR CAPITAL BMATHEMATICAL DOUBLE-STRUCK CAPITAL BMATHEMATICAL BOLD FRAKTUR CAPITAL BMATHEMATICAL SANS-SERIF CAPITAL BMATHEMATICAL SANS-SERIF BOLD CAPITAL BMATHEMATICAL SANS-SERIF ITALIC CAPITAL BMATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL BMATHEMATICAL MONOSPACE CAPITAL BMATHEMATICAL BOLD CAPITAL BETAMATHEMATICAL ITALIC CAPITAL BETAMATHEMATICAL BOLD ITALIC CAPITAL BETAMATHEMATICAL SANS-SERIF BOLD CAPITAL BETAMATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL BETAFULLWIDTH LATIN CAPITAL LETTER B
B Β В 𐊂 𐊡 𐌁 𝐁 𝐵 𝑩 𝓑 𝔅 𝔹 𝕭 𝖡 𝗕 𝘉 𝘽 𝙱 𝚩 𝛣 𝜝 𝝗 𝞑
00420392041213F415F7212CA4D0A7B410282102A1103011D4011D4351D4691D4D11D5051D5391D56D1D5A11D5D51D6091D63D1D6711D6A91D6E31D71D1D7571D791FF22
LATIN CAPITAL LETTER BGREEK CAPITAL LETTER BETACYRILLIC CAPITAL LETTER VECHEROKEE LETTER YVCANADIAN SYLLABICS CARRIER KHESCRIPT CAPITAL BLISU LETTER BALATIN CAPITAL LETTER BETALYCIAN LETTER BCARIAN LETTER P2OLD ITALIC LETTER BEMATHEMATICAL BOLD CAPITAL BMATHEMATICAL ITALIC CAPITAL BMATHEMATICAL BOLD ITALIC CAPITAL BMATHEMATICAL BOLD SCRIPT CAPITAL BMATHEMATICAL FRAKTUR CAPITAL BMATHEMATICAL DOUBLE-STRUCK CAPITAL BMATHEMATICAL BOLD FRAKTUR CAPITAL BMATHEMATICAL SANS-SERIF CAPITAL BMATHEMATICAL SANS-SERIF BOLD CAPITAL BMATHEMATICAL SANS-SERIF ITALIC CAPITAL BMATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL BMATHEMATICAL MONOSPACE CAPITAL BMATHEMATICAL BOLD CAPITAL BETAMATHEMATICAL ITALIC CAPITAL BETAMATHEMATICAL BOLD ITALIC CAPITAL BETAMATHEMATICAL SANS-SERIF BOLD CAPITAL BETAMATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL BETAFULLWIDTH LATIN CAPITAL LETTER B
. ٠ ۰ ܁ ܂ 𐩐 𝅭                    
002E066006F0070107022024A4F8A60E10A501D16D
FULL STOPARABIC-INDIC DIGIT ZEROEXTENDED ARABIC-INDIC DIGIT ZEROSYRIAC SUPRALINEAR FULL STOPSYRIAC SUBLINEAR FULL STOPONE DOT LEADERLISU LETTER TONE MYA TIVAI FULL STOPKHAROSHTHI PUNCTUATION DOTMUSICAL SYMBOL COMBINING AUGMENTATION DOT
a ɑ α а 𝐚 𝑎 𝒂 𝒶 𝓪 𝔞 𝕒 𝖆 𝖺 𝗮 𝘢 𝙖 𝚊 𝛂 𝛼 𝜶 𝝰 𝞪      
0061025103B10430237A1D41A1D44E1D4821D4B61D4EA1D51E1D5521D5861D5BA1D5EE1D6221D6561D68A1D6C21D6FC1D7361D7701D7AAFF41
LATIN SMALL LETTER ALATIN SMALL LETTER ALPHAGREEK SMALL LETTER ALPHACYRILLIC SMALL LETTER AAPL FUNCTIONAL SYMBOL ALPHAMATHEMATICAL BOLD SMALL AMATHEMATICAL ITALIC SMALL AMATHEMATICAL BOLD ITALIC SMALL AMATHEMATICAL SCRIPT SMALL AMATHEMATICAL BOLD SCRIPT SMALL AMATHEMATICAL FRAKTUR SMALL AMATHEMATICAL DOUBLE-STRUCK SMALL AMATHEMATICAL BOLD FRAKTUR SMALL AMATHEMATICAL SANS-SERIF SMALL AMATHEMATICAL SANS-SERIF BOLD SMALL AMATHEMATICAL SANS-SERIF ITALIC SMALL AMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL AMATHEMATICAL MONOSPACE SMALL AMATHEMATICAL BOLD SMALL ALPHAMATHEMATICAL ITALIC SMALL ALPHAMATHEMATICAL BOLD ITALIC SMALL ALPHAMATHEMATICAL SANS-SERIF BOLD SMALL ALPHAMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL ALPHAFULLWIDTH LATIN SMALL LETTER A
t 𝐭 𝑡 𝒕 𝓉 𝓽 𝔱 𝕥 𝖙 𝗍 𝘁 𝘵 𝙩 𝚝                
00741D42D1D4611D4951D4C91D4FD1D5311D5651D5991D5CD1D6011D6351D6691D69D
LATIN SMALL LETTER TMATHEMATICAL BOLD SMALL TMATHEMATICAL ITALIC SMALL TMATHEMATICAL BOLD ITALIC SMALL TMATHEMATICAL SCRIPT SMALL TMATHEMATICAL BOLD SCRIPT SMALL TMATHEMATICAL FRAKTUR SMALL TMATHEMATICAL DOUBLE-STRUCK SMALL TMATHEMATICAL BOLD FRAKTUR SMALL TMATHEMATICAL SANS-SERIF SMALL TMATHEMATICAL SANS-SERIF BOLD SMALL TMATHEMATICAL SANS-SERIF ITALIC SMALL TMATHEMATICAL SANS-SERIF BOLD ITALIC SMALL TMATHEMATICAL MONOSPACE SMALL T

Total raw values: 2,825,760

Too many raw items to process.


Data

Confusable characters are those that may be confused with others (in some common UI fonts), such as the Latin letter "o" and the Greek letter omicron "ο". Fonts make a difference: for example, the Hebrew character "ס" looks confusingly similar to "o" in some fonts (such as Arial Hebrew), but not in others. See also unaccented Latin Characters..

The data for confusables and restrictions is from UTS39. You can suggest additions or changes to the Unicode data for future versions of that standard.

For more information on the use of the data, see proposed updates Unicode Security Mechanisms and Unicode Security Considerations.

The restrictions are purely on a character level. For a more detailed view, see idna.

Caveats

The Unicode data is designed for testing, not enumerating, so not all combinations are generated in this demo; In particular, where a character is confusable with a sequence, not all combinations are generated.



Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 72.0; Unicode/Emoji version: 15.0;