Unicode Utilities: BIDI (UBA)

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

Shows processing by the UBA (Unicode Bidi Algorithm), which is used to display all Unicode Arabic and Hebrew text. For more information, see bidi info.

This Java implementation supports UBA rules and character properties as of Unicode 6.2. To test current UBA rules, or intermediate versions from Unicode 6.3 forward, see the C Reference demo.

Sample      

Paragraph 1

Base Level 1 = RTLheuristic

Source

Memory Position 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35
Character ‎ה‎ ‎ו‎ ‎א‎ ‎א‎ ‎מ‎ ‎ר‎ " C a r i n H e b r e w i s ‎מ‎ ‎כ‎ ‎ו‎ ‎נ‎ ‎י‎ ‎ת‎ " .
Bidi Class R R R WS R R R WS ON LRE L L L WS L L WS L L L L L L WS L L WS R R R R R R PDF ON CS
Rules Applied N1R
N2R
N2R
I2LRE
N1L
N1L
N1L
N2L
I2L
N2R
W6ON
N2R
I2PDF
I2R
I2R
Resulting Level
L1

L1

L1

L1

L1

L1

L1

L1

L1

L1


L2


L2


L2


L2


L2


L2


L2


L2


L2


L2


L2


L2


L2


L2


L2


L2


L2



L3



L3



L3



L3



L3



L3



L3

L1

L1

Reordered

Display Position 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35
Memory Position 35 34 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 33 32 31 30 29 28 27 9 8 7 6 5 4 3 2 1 0
Character . " C a r i n H e b r e w i s ‎ת‎ ‎י‎ ‎נ‎ ‎ו‎ ‎כ‎ ‎מ‎ " ‎ר‎ ‎מ‎ ‎א‎ ‎א‎ ‎ו‎ ‎ה‎

Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 70.0; Unicode/Emoji version: 14.0;