Support

Unicode Blocks

  • ASCII
  • Latin-9
  • Some additional Latin alphabets, see below for details
  • IPA (U+0250 - U+029F)
  • U+02C6 - U+02CF
  • Monotonic Greek (lacking some bare diacritics)
  • Cyrillic (only Russian, Ukrainian and pre-1918 Russian currently, see below for upcoming updates)
  • Armenian
  • Hebrew (basic letters only, no full Yiddish support yet)
  • Georgian (upcoming in version 0.4)
  • Cherokee (upcoming in version 0.4)
  • Runes
  • Tai Le
  • New Tai Lüe (upcoming)
  • Roman numerals (U+2160 - U+2165)
  • Bagua symbols
  • Astronomical symbols
  • Zodiac symbols
  • Coptic
  • CJK radicals supplement (partial complete)
  • Kangxi radicals (except 9 characters “⼢⼮⿋⿎⿏⿐⿒⿔⿕”)
  • CJK symbols and punctuation (、。々〇「」)
  • “Hangzhou” numerals
  • Hiragana (missing 2 characters “ぱぽ”)
  • Katakana (missing 1 character “パ”)
  • Bopomofo
  • Hangul compatibility jamo (done for modern jamos)
  • CJKV Ideographs (see below for details)
  • Lisu (Fraser alphabet)
  • Phags-pa (upcoming in version 0.4)
  • Full-width ASCII
  • Half-width katakana
  • Half-width Hangul
  • Various currency symbols (partial complete, ¢£¥€¢£¥)
  • Gothic (upcoming?)

EU Official Languages

  • Bulgarian (done in basic Cyrillics)
  • Croatian (no tone marks yet)
  • Czech
  • Danish (done in Latin-9, except Ǿǿ, which is done in 0.3)
  • Dutch (done in Latin-9, except digraph IJij, which is done in 0.3)
  • English (done in ASCII)
  • Estonian (done in Latin-9)
  • Finnish (done in Latin-9)
  • French (done in Latin-9)
  • German (done in Latin-9, except uppercase ẞ, which is done in 0.3)
  • Greek (currently monotonic only)
  • Hungarian (upcoming in version 0.4)
  • Irish (done in Latin-9 for modern orthography, old orthography upcoming in version 0.4)
  • Italian (done in Latin-9)
  • Latvian
  • Lithuanian
  • Maltese (upcoming in version 0.4)
  • Polish
  • Portuguese (done in Latin-9)
  • Romanian (upcoming in version 0.4)
  • Slovak
  • Slovene (no tone marks yet)
  • Spanish (done in Latin-9)
  • Swedish (done in Latin-9)

Latin Alphabets

For the official languages of the European Union, refer to the list above.

  • Serbo – Croatian (Latin) (no tone marks yet)
  • Esperanto

Romanization Systems

  • Hànyǔ Pīnyīn

Cyrillic Alphabets

  • Russian
  • Ukrainian (Ґґ Єє Іі Її)
  • Pre-1918 Russian (four extra letters: Іі Ѣѣ Ѳѳ Ѵѵ)
  • Belarusian (Іі, Ўў)
  • Bulgarian
  • Dungan (Cyrillic) (Җҗ Ңң Әә Ўў Үү, upcoming in version 0.4)
  • Macedonian (Ѓѓ Ѐѐ Ѕѕ Ѝѝ Јј Љљ Њњ Ќќ Џџ)
  • Mongolian (Cyrillic) (Өө Үү)
  • Kazakh (Cyrillic) (Әә Ғғ Ққ Ңң Өө Ұұ Үү Һһ Іі)
  • Kyrgyz (Ңң Өө Үү)
  • Serbo – Croatian (Cyrillic) (Ђђ Јј Љљ Њњ Ћћ Џџ, no tone marks yet)
  • Tajik (Ғғ Ӣӣ Ққ Ӯӯ Ҳҳ Ҷҷ)
  • Tatar (Cyrillic) (Әә Җҗ Ңң Өө Үү Һһ, upcoming in version 0.4)
  • Tuvan (Ңң Өө Үү)
  • Uzbek (Cyrillic) (Ўў Ққ Ғғ Ҳҳ)

CJKV Ideographs

As of version 2023m27a, we currently have about 516 (?) unrepeated glyphs, 606 if counted the repeated ones.

㘯一丁七万丈三上下不丐丑丙丨个丫
中丰丶丸丹主丽丿乃久么义之乙九乞
也习乡书亅了予二于亏云互五井亜亞
亠亡亢亥京人亿什仁仃今介仑仓代令
以任会伝伟何你來侖修倉個偉傳億儿
兀兄兆光克党入八六兰共关其典冂円
冖冫冬几凡凵出刀刁刃力加劳労勇勝
勞勹勺匕北匚匸区區十千午华南卜卝
卤卩卯厂原厡厶去县又叉友发口可台
史司合向吕君吾呂告命和問善営囗囚
四回圆圓土在地场坂基堀場塲士壬声
夂处复夏夕多夜大天女威子孑孓存学
學宀它安宏官宝实実宠室宫宮家寅實
寵寶寸小少尔尢尸尹尼尽尾屮山川工
己已巳巴巷巾布帜幟干平年幸幺广広
庆库庚庫廠廣廴廾廿弋弓弥彌彐彡彥
彦彳復心志忘思恩恭恵悅惠慶戈戊戌
成我戶户戸所手才支攴攵政文斉斗斤
方施无日早时春昭昼時晝智暢曜曰書
會月木未本朱村来松某栄梁梅森榮欠
止正歹殳毅毋母比毛氏民气氣水汉污
河治沼沿法洽流浩海淸清港漢火点無
營爪父爻爾爿片牙牛犬玄玉王瓜瓦甘
生用田由甲申画畅畫疋疒癶癸發白百
皇皮皿盡目直県眞真矛矢知石示祭禸
禾私秋穴空立童竹範籐米糸紀細維網
縣纪细维缶网羊美羔義羽習翠老而耒
耳聲聿肉胜臣自至臺臼舌舛舟艮色艸
艹花草荣菊華营萬藤蘭虍處虧虫蟲血
行衣複襾西覆見见角言詩語論謠謡識
论识语谷豆豊豐豕豸貝贝赤走越足身
車転轉车转辛辰辱辵达过近送透造過
道達邑邓郷鄉鄕鄧酉酒釆里重金長长
門関闗關门问阜阳阴降陰陽隆隶隹雄
難雨雪靑青非面革韋韦韭音頁須页须
風风飛飞食首香馬馱駄马驮骨高髟鬥
鬯鬲鬼魔魚鯉鱼鲤鳥鸟鹵鹿麗麥麦麻
黃黄黍黑黒點黨黽黾鼎齊齐龍龙