SPECIAL CHARACTER MATRIX II
Matrix II continues the “copy-ready” sets from Matrix I, focusing on living script families, historic systems, and phonetic/diacritic inventories developers actually touch. Click a tile to copy the glyph. If a symbol looks blank, it’s almost always font coverage—try a broad Unicode family (e.g., Noto).
Heads-up: some of these pages are heavy (large blocks, lots of tiles). Expect a little render heat on lower-end machines or if you’ve got 500 tabs open already.
East Asia Core (Kana & Hangul)
Everyday Japanese and Korean writing systems. This covers native kana syllabaries (including historical/small forms) and Hangul from compositional Jamo to modern precomposed syllables—plus the halfwidth set used for legacy encodings and UI alignment.
-
Hiragana —
U+3040–U+309F
Native Japanese syllabary used for grammar, particles, and many native words; pairs cleanly with kanji. -
Katakana —
U+30A0–U+30FF(+ Phonetic ExtU+31F0–U+31FF)
Loanwords, onomatopoeia, emphasis; includes small vowel/ya-yu-yo forms and long sound mark. -
Kana Supplement & Extensions —
U+1B000–U+1B2FF(various ranges)
Archaic/rare kana and small forms for precise transcription and lexicographic use. -
Hangul Jamo —
U+1100–U+11FF, Ext-AU+A960–U+A97F, Ext-BU+D7B0–U+D7FF
Initial/medial/final letters for composing syllables; extensions add historical and analysis forms. -
Hangul Syllables —
U+AC00–U+D7AF
Precomposed modern syllables covering everyday Korean text (best font coverage). -
Hangul Compatibility Jamo & Halfwidth Katakana —
U+3130–U+318F,U+FF00–U+FFEF
Legacy/width-aligned forms used for round-trip mapping and layout-tight UIs.
East Asia Historic & Adjacent
Historic and adjacent systems from the Sinosphere and neighbors. Useful for scholarly labels, examples, and tooling demos without resorting to images.
-
Nüshu —
U+1B170–U+1B2FF
Women’s script from Hunan; narrow, flowing letters often used alongside Chinese text. -
Khitan Small Script —
U+18B00–U+18CFF
Liao-dynasty logographic/syllabic system; specialist fonts recommended. -
Tangut —
U+17000–U+187FF; ComponentsU+18800–U+18AFF
Large historic set with structural components for dictionary/analysis work. -
Lisu —
U+A4D0–U+A4FF
Modern living alphabet derived from a romanization; block caps-like forms. -
Yi — Syllables
U+A000–U+A48F; RadicalsU+A490–U+A4CF
Syllabary used in Southwest China; radicals aid lexicographic indexing.
Europe (Historic)
Early European alphabets used for inscriptions and textual reconstruction. Handy for maps, epigraphy notes, and educational examples.
-
Runic —
U+16A0–U+16FF
Elder/Younger Futhark and Anglo-Saxon variants; directional runes and punctuation included. -
Ogham —
U+1680–U+169F
Notches/lines script used in early medieval Ireland and Britain; includes space mark and forfeda. -
Old Italic —
U+10300–U+1032F
Etruscan and related Italic alphabets; right-to-left layout in many fonts. -
Gothic —
U+10330–U+1034F
Bishop Wulfila’s alphabet for Gothic; includes punctuation and numbers. -
Old Hungarian —
U+10C80–U+10CFF
Historic runiform script; directionality and ligatures vary by font. -
Glagolitic —
U+2C00–U+2C5F
Oldest known Slavic alphabet; rounded/angular letterforms by tradition. -
Old Permic —
U+10350–U+1037F
Uralic family script used for Komi; distinctive tall letters.
Greek, Coptic & Cyrillic
Core European families and their scholarly extensions. Good coverage across modern OSes; extended ranges include diacritics and historic numerals.
-
Greek & Greek Extended —
U+0370–U+03FF,U+1F00–U+1FFF
Base set plus polytonic diacritics, numeral signs, and punctuation for classical texts. -
Coptic —
U+2C80–U+2CFF; Epact NumbersU+102E0–U+102FF
Late Egyptian descendant used in Christian liturgy; dedicated numeric notation included. -
Cyrillic & Supplements —
U+0400–U+04FF,U+0500–U+052F, Ext-AU+2DE0–U+2DFF, Ext-BU+A640–U+A69F, Ext-CU+1C80–U+1C8F
Modern Slavic alphabets plus historic/extended letters used in minority and scholarly contexts.
South Asia (Indic)
Brahmic-derived scripts across the subcontinent, with Vedic and numeric adjuncts. Use for localized UI, typography tests, and transliteration demos.
-
Devanagari —
U+0900–U+097F; ExtendedU+A8E0–U+A8FF
Hindi/Marathi/Nepali and Sanskrit; Vedic marks enable classical notation. -
Bengali —
U+0980–U+09FF
Bangla/Assamese; includes dependent vowel signs and conjunct behavior. -
Gurmukhi —
U+0A00–U+0A7F
Punjabi; clear syllabic structure with independent/dependent vowels. -
Gujarati —
U+0A80–U+0AFF
Western India; distinctive looped letterforms and conjunct rules. -
Odia (Oriya) —
U+0B00–U+0B7F
Rounded headline and stacked consonant clusters common in print. -
Tamil —
U+0B80–U+0BFF; Telugu —U+0C00–U+0C7F; Kannada —U+0C80–U+0CFF; Malayalam —U+0D00–U+0D7F; Sinhala —U+0D80–U+0DFF
Dravidian family plus Sinhala; each with unique shaping/diacritic behavior in text engines. -
Vedic Extensions —
U+1CD0–U+1CFF; Indic Siyaq Numbers —U+1EC70–U+1ECBF
Chant marks and historical accounting numerals used in scholarly contexts.
Latin, Phonetics & Diacritics
Extended Latin plus IPA/phonetic inventories and combining mark sets. Great for scientific names, transliteration, and linguistic UI.
-
Latin Extended Blocks — A
U+0100–U+017F, BU+0180–U+024F, AdditionalU+1E00–U+1EFF, DU+A720–U+A7FF, EU+AB30–U+AB6F
Adds diacritics and letter variants used across Europe, Africa, and scholarly orthographies. -
IPA Extensions —
U+0250–U+02AF; Spacing Modifier Letters —U+02B0–U+02FF
Phonetic symbols and superscript modifiers for precise pronunciation guides. -
Combining Diacritical Marks —
U+0300–U+036F; SupplementalU+1DC0–U+1DFF
Attach-to-base accents/marks for building precomposed-equivalent glyphs on the fly. -
Phonetic Extensions —
U+1D00–U+1D7F; SupplementU+1D80–U+1DBF
Additional letters and hooks used in linguistic transcription and research.
Other Living Scripts
Active writing systems that appear in modern content, signage, and UI—outside the big “families” above.
-
Armenian —
U+0530–U+058F
Distinct cursive-capable alphabet with historic numeral use and punctuation. -
Georgian —
U+10A0–U+10FF; Mkhedruli (Ext)U+1C90–U+1CBF; Asomtavruli/NuskhuriU+2D00–U+2D2F
Multiple historic/modern styles; wide daily use with strong font support. -
Mongolian —
U+1800–U+18AF
Vertical script; shaping and selection vary by engine—test with capable fonts. -
Tibetan —
U+0F00–U+0FFF
Complex stacks and signs for liturgical and modern content; needs shaping support.
The Philippines
Indigenous scripts used for cultural works, signage, and educational materials—compact sets with straightforward diacritic behavior.
-
Tagalog (Baybayin) —
U+1700–U+171F
Syllabary with vowel diacritics; revival use in public art and branding. -
Hanunóo —
U+1720–U+173F
South Mindoro; slanted letterforms, marks indicate vowels and finals. -
Buhid —
U+1740–U+175F
Mindoro script with discrete vowel marks; simple baseline presentation. -
Tagbanwa —
U+1760–U+177F
Palawan; compact letter inventory and minimal punctuation.
SE Asia & Indonesia
Brahmic families of mainland Southeast Asia and the Indonesian archipelago. Common in educational content and localized UI where proper shaping matters.
-
Thai —
U+0E00–U+0E7F; Lao —U+0E80–U+0EFF
Abugidas with tone marks and complex shaping; strong OS-level support. -
Khmer —
U+1780–U+17FF; SymbolsU+19E0–U+19FF
Wide glyphs, dependent vowels, and a dedicated symbol set for dates/numbers. -
Myanmar —
U+1000–U+109F; Ext-AU+AA60–U+AA7F, Ext-BU+A9E0–U+A9FF
Stacked consonants and kinzi behavior; multiple languages share the block. -
Balinese —
U+1B00–U+1B7F; Sundanese —U+1B80–U+1BBF(+ SupplementU+1CC0–U+1CCF)
Island scripts with calendar and liturgical usage; supplemental marks extend punctuation. -
Javanese —
U+A980–U+A9DF; Rejang —U+A930–U+A95F; Buginese —U+1A00–U+1A1F; Cham —U+AA00–U+AA5F
Regional systems with distinctive conjunct and numeral behavior; test rendering engines. -
Tai Tham / New Tai Lue / Tai Viet —
U+1A20–U+1AAF/U+1980–U+19DF/U+AA80–U+AADF
Tai family scripts with alt digit sets and tone/diacritic patterns.
Hebrew, Arabic & Middle Eastern
Right-to-left script families and neighbors. Includes presentation forms and extended blocks for complete shaping and diacritic coverage.
-
Hebrew —
U+0590–U+05FF
Consonantal alphabet with niqqud and cantillation marks; modern and liturgical usage. -
Arabic —
U+0600–U+06FF; SupplementU+0750–U+077F; Extended A/B/CU+08A0–U+08FF/U+0870–U+089F/U+0860–U+086F
Contextual shaping with extensive diacritics and Quranic marks; many languages share this family. -
Arabic Presentation Forms — A
U+FB50–U+FDFF, BU+FE70–U+FEFF
Compatibility forms for fixed-shape glyphs and legacy encodings; not for plain-text authoring. -
Syriac & Syriac Supplement —
U+0700–U+074F,U+0860–U+086F
Liturgical/heritage script with multiple vocalization traditions and punctuation. -
Samaritan / Mandaic —
U+0800–U+083F/U+0840–U+085F
Distinct right-to-left alphabets with limited but active liturgical use.