Unicode Utilities: Unicode Language Identifers and BCP47

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

Input
  Localization:

Status

Source: en-cmn-Hant-HK

Canonical Form: en-Hant-HK

TypeCodeNameReplacement
Languageen-cmninvalid extlang code - would be valid base-lang code
ScriptHantTraditional Han
RegionHKHong Kong SAR China

Samples

Notes


Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 72.0; Unicode/Emoji version: 15.0;