SPECIAL CHARACTER MATRIX II

Matrix II continues the “copy-ready” sets from Matrix I, focusing on living script families, historic systems, and phonetic/diacritic inventories developers actually touch. Click a tile to copy the glyph. If a symbol looks blank, it’s almost always font coverage—try a broad Unicode family (e.g., Noto).

Heads-up: some of these pages are heavy (large blocks, lots of tiles). Expect a little render heat on lower-end machines or if you’ve got 500 tabs open already.

East Asia Core (Kana & Hangul)

Everyday Japanese and Korean writing systems. This covers native kana syllabaries (including historical/small forms) and Hangul from compositional Jamo to modern precomposed syllables—plus the halfwidth set used for legacy encodings and UI alignment.

Hiragana — U+3040–U+309F
Native Japanese syllabary used for grammar, particles, and many native words; pairs cleanly with kanji.
Katakana — U+30A0–U+30FF (+ Phonetic Ext U+31F0–U+31FF)
Loanwords, onomatopoeia, emphasis; includes small vowel/ya-yu-yo forms and long sound mark.
Kana Supplement & Extensions — U+1B000–U+1B2FF (various ranges)
Archaic/rare kana and small forms for precise transcription and lexicographic use.
Hangul Jamo — U+1100–U+11FF, Ext-A U+A960–U+A97F, Ext-B U+D7B0–U+D7FF
Initial/medial/final letters for composing syllables; extensions add historical and analysis forms.
Hangul Syllables — U+AC00–U+D7AF
Precomposed modern syllables covering everyday Korean text (best font coverage).
Hangul Compatibility Jamo & Halfwidth Katakana — U+3130–U+318F, U+FF00–U+FFEF
Legacy/width-aligned forms used for round-trip mapping and layout-tight UIs.

East Asia Historic & Adjacent

Historic and adjacent systems from the Sinosphere and neighbors. Useful for scholarly labels, examples, and tooling demos without resorting to images.

Nüshu — U+1B170–U+1B2FF
Women’s script from Hunan; narrow, flowing letters often used alongside Chinese text.
Khitan Small Script — U+18B00–U+18CFF
Liao-dynasty logographic/syllabic system; specialist fonts recommended.
Tangut — U+17000–U+187FF; Components U+18800–U+18AFF
Large historic set with structural components for dictionary/analysis work.
Lisu — U+A4D0–U+A4FF
Modern living alphabet derived from a romanization; block caps-like forms.
Yi — Syllables U+A000–U+A48F; Radicals U+A490–U+A4CF
Syllabary used in Southwest China; radicals aid lexicographic indexing.

Europe (Historic)

Early European alphabets used for inscriptions and textual reconstruction. Handy for maps, epigraphy notes, and educational examples.

Runic — U+16A0–U+16FF
Elder/Younger Futhark and Anglo-Saxon variants; directional runes and punctuation included.
Ogham — U+1680–U+169F
Notches/lines script used in early medieval Ireland and Britain; includes space mark and forfeda.
Old Italic — U+10300–U+1032F
Etruscan and related Italic alphabets; right-to-left layout in many fonts.
Gothic — U+10330–U+1034F
Bishop Wulfila’s alphabet for Gothic; includes punctuation and numbers.
Old Hungarian — U+10C80–U+10CFF
Historic runiform script; directionality and ligatures vary by font.
Glagolitic — U+2C00–U+2C5F
Oldest known Slavic alphabet; rounded/angular letterforms by tradition.
Old Permic — U+10350–U+1037F
Uralic family script used for Komi; distinctive tall letters.

Greek, Coptic & Cyrillic

Core European families and their scholarly extensions. Good coverage across modern OSes; extended ranges include diacritics and historic numerals.

Greek & Greek Extended — U+0370–U+03FF, U+1F00–U+1FFF
Base set plus polytonic diacritics, numeral signs, and punctuation for classical texts.
Coptic — U+2C80–U+2CFF; Epact Numbers U+102E0–U+102FF
Late Egyptian descendant used in Christian liturgy; dedicated numeric notation included.
Cyrillic & Supplements — U+0400–U+04FF, U+0500–U+052F, Ext-A U+2DE0–U+2DFF, Ext-B U+A640–U+A69F, Ext-C U+1C80–U+1C8F
Modern Slavic alphabets plus historic/extended letters used in minority and scholarly contexts.

South Asia (Indic)

Brahmic-derived scripts across the subcontinent, with Vedic and numeric adjuncts. Use for localized UI, typography tests, and transliteration demos.

Devanagari — U+0900–U+097F; Extended U+A8E0–U+A8FF
Hindi/Marathi/Nepali and Sanskrit; Vedic marks enable classical notation.
Bengali — U+0980–U+09FF
Bangla/Assamese; includes dependent vowel signs and conjunct behavior.
Gurmukhi — U+0A00–U+0A7F
Punjabi; clear syllabic structure with independent/dependent vowels.
Gujarati — U+0A80–U+0AFF
Western India; distinctive looped letterforms and conjunct rules.
Odia (Oriya) — U+0B00–U+0B7F
Rounded headline and stacked consonant clusters common in print.
Tamil — U+0B80–U+0BFF; Telugu — U+0C00–U+0C7F; Kannada — U+0C80–U+0CFF; Malayalam — U+0D00–U+0D7F; Sinhala — U+0D80–U+0DFF
Dravidian family plus Sinhala; each with unique shaping/diacritic behavior in text engines.
Vedic Extensions — U+1CD0–U+1CFF; Indic Siyaq Numbers — U+1EC70–U+1ECBF
Chant marks and historical accounting numerals used in scholarly contexts.

Latin, Phonetics & Diacritics

Extended Latin plus IPA/phonetic inventories and combining mark sets. Great for scientific names, transliteration, and linguistic UI.

Latin Extended Blocks — A U+0100–U+017F, B U+0180–U+024F, Additional U+1E00–U+1EFF, D U+A720–U+A7FF, E U+AB30–U+AB6F
Adds diacritics and letter variants used across Europe, Africa, and scholarly orthographies.
IPA Extensions — U+0250–U+02AF; Spacing Modifier Letters — U+02B0–U+02FF
Phonetic symbols and superscript modifiers for precise pronunciation guides.
Combining Diacritical Marks — U+0300–U+036F; Supplemental U+1DC0–U+1DFF
Attach-to-base accents/marks for building precomposed-equivalent glyphs on the fly.
Phonetic Extensions — U+1D00–U+1D7F; Supplement U+1D80–U+1DBF
Additional letters and hooks used in linguistic transcription and research.

Other Living Scripts

Active writing systems that appear in modern content, signage, and UI—outside the big “families” above.

Armenian — U+0530–U+058F
Distinct cursive-capable alphabet with historic numeral use and punctuation.
Georgian — U+10A0–U+10FF; Mkhedruli (Ext) U+1C90–U+1CBF; Asomtavruli/Nuskhuri U+2D00–U+2D2F
Multiple historic/modern styles; wide daily use with strong font support.
Mongolian — U+1800–U+18AF
Vertical script; shaping and selection vary by engine—test with capable fonts.
Tibetan — U+0F00–U+0FFF
Complex stacks and signs for liturgical and modern content; needs shaping support.

The Philippines

Indigenous scripts used for cultural works, signage, and educational materials—compact sets with straightforward diacritic behavior.

Tagalog (Baybayin) — U+1700–U+171F
Syllabary with vowel diacritics; revival use in public art and branding.
Hanunóo — U+1720–U+173F
South Mindoro; slanted letterforms, marks indicate vowels and finals.
Buhid — U+1740–U+175F
Mindoro script with discrete vowel marks; simple baseline presentation.
Tagbanwa — U+1760–U+177F
Palawan; compact letter inventory and minimal punctuation.

SE Asia & Indonesia

Brahmic families of mainland Southeast Asia and the Indonesian archipelago. Common in educational content and localized UI where proper shaping matters.

Thai — U+0E00–U+0E7F; Lao — U+0E80–U+0EFF
Abugidas with tone marks and complex shaping; strong OS-level support.
Khmer — U+1780–U+17FF; Symbols U+19E0–U+19FF
Wide glyphs, dependent vowels, and a dedicated symbol set for dates/numbers.
Myanmar — U+1000–U+109F; Ext-A U+AA60–U+AA7F, Ext-B U+A9E0–U+A9FF
Stacked consonants and kinzi behavior; multiple languages share the block.
Balinese — U+1B00–U+1B7F; Sundanese — U+1B80–U+1BBF (+ Supplement U+1CC0–U+1CCF)
Island scripts with calendar and liturgical usage; supplemental marks extend punctuation.
Javanese — U+A980–U+A9DF; Rejang — U+A930–U+A95F; Buginese — U+1A00–U+1A1F; Cham — U+AA00–U+AA5F
Regional systems with distinctive conjunct and numeral behavior; test rendering engines.
Tai Tham / New Tai Lue / Tai Viet — U+1A20–U+1AAF / U+1980–U+19DF / U+AA80–U+AADF
Tai family scripts with alt digit sets and tone/diacritic patterns.

Hebrew, Arabic & Middle Eastern

Right-to-left script families and neighbors. Includes presentation forms and extended blocks for complete shaping and diacritic coverage.

Hebrew — U+0590–U+05FF
Consonantal alphabet with niqqud and cantillation marks; modern and liturgical usage.
Arabic — U+0600–U+06FF; Supplement U+0750–U+077F; Extended A/B/C U+08A0–U+08FF / U+0870–U+089F / U+0860–U+086F
Contextual shaping with extensive diacritics and Quranic marks; many languages share this family.
Arabic Presentation Forms — A U+FB50–U+FDFF, B U+FE70–U+FEFF
Compatibility forms for fixed-shape glyphs and legacy encodings; not for plain-text authoring.
Syriac & Syriac Supplement — U+0700–U+074F, U+0860–U+086F
Liturgical/heritage script with multiple vocalization traditions and punctuation.
Samaritan / Mandaic — U+0800–U+083F / U+0840–U+085F
Distinct right-to-left alphabets with limited but active liturgical use.