SPECIAL CHARACTER MATRIX I
As of Unicode version 16.0, there are 292,531 assigned characters with code points, covering 168 modern and historical scripts, as well as multiple symbol sets. It is not technically possible to list all of these characters in a single web page, so this series of list pages is limited to those I’ve decided would be most useful for the average developer. Clicking on a character tile on these pages will copy that character to your clipboard through the magic of JavaScript.
One other thing to keep in mind, particularly if you peruse the exhaustive language character pages: This ain’t the best site for a directory. Think about all the CSS and .js it took to make this site happen. There is a lot of overhead on my site, and dropping an encyclopedia of Unicode into a page is going to cause the pages to lag, particularly on lower-end systems. Some of these pages will max my server memory. Gogela.com is brutal when it comes to bandwidth and resource requirements. This place is more about testing than entertainment. Some of these pages will dial up your browser to 11. …just be aware.
Arrows & Emoticons
Arrows and emotive faces span the earliest days of technical typography and ASCII smileys through to modern emoji. As Unicode grew, the single “Arrows” block gained multiple supplements so authors could express direction, state transitions, mappings, and vector notation without custom artwork. Emoticons evolved from text smileys to standardized code points so feelings and tone could survive copy/paste across platforms.
This page mixes navigation/diagramming workhorses (arrows, box/line art, block fill, braille, OCR/control pictures, currency marks) with classic symbol sets (dingbats, chess pieces) and modern smileys. Everything here is standardized in Unicode so it renders and copy-pastes reliably across platforms.
-
Arrows —
U+2190–U+21FF
Directional, double-headed, curved, and modifier arrows for flows, navigation, and mappings. -
Basic Latin Punctuation & Symbols —
U+0021–U+002F,U+003A–U+0040,U+005B–U+0060,U+007B–U+007E
(subset of Basic Latin)
The ASCII punctuation/symbols you use every day: ! ” # $ % & ‘ ( ) * + , – . / : ; < = > ? @ [ \ ] ^ _ { | } ~ -
Block Elements —
U+2580–U+259F
Half/quarter blocks and shades for progress bars, meters, and compact visual fills. -
Box Drawing —
U+2500–U+257F
Line/tee/corner characters for grid UIs, ASCII tables, terminals, and schematic frames. -
Braille Patterns —
U+2800–U+28FF
6-dot braille cells for accessibility examples, teaching materials, or stylized UI. (Rendering needs a font with braille coverage.) -
Chess Pieces (Classic) —
U+2654–U+265F
White/black ♔–♟ piece set from Miscellaneous Symbols—great for chess notation, puzzles, and icons. -
Chess Symbols (Extended) —
U+1FA00–U+1FA6F
Expanded symbols: fairy pieces, boards, annotation marks, and UI glyphs used by modern chess software. -
Control Pictures & OCR —
U+2400–U+243F(Control Pictures),U+2440–U+245F(OCR)
Printable icons for control codes (␀–␡) and optical-character-recognition symbols used in scanning/finance contexts. -
Currency Symbols —
U+20A0–U+20CF(plus common marks in Latin-1 like ¢ U+00A2, £ U+00A3, ¥ U+00A5)
Historic and modern currency signs for internationalized pricing and financial UI. -
Dingbats —
U+2700–U+27BF
Bullets, ornaments, checkmarks, pointers, and decorative marks—handy for lists, keys, and callouts. -
Emoticons —
U+1F600–U+1F64F
Standardized smileys and expressions. Many support a color emoji presentation; variation selectors can force text/emoji style when needed.
- Notes: Some arrows carry distinct semantics (e.g., implication vs. mapping). Emoji appearance varies by OS/app; variation selectors can force text/emoji style where needed.
Alphanumerics & Mathematical Basics
This page bundles the everyday “typography workhorses” for technical writing and UI polish: enclosed numbers/letters for callouts, letter-like symbols with long print traditions, geometric shapes for keys and legends, the Latin-1 additions that browsers render everywhere, the crucial (often invisible) punctuation controls, and the core mathematical operators used across STEM. Everything here round-trips cleanly via Unicode, so you don’t need images or custom fonts for common notations.
-
Enclosed Alphanumerics —
U+2460–U+24FF
Classic circled/squared numbers and letters (①, ②, ⑴, Ⓐ, Ⓑ) for step markers, diagram labels, and list callouts. -
Enclosed Alphanumeric Supplement —
U+1F100–U+1F1FF
Modern “enclosed” expansions: negative-circled, squared, and parenthesized Latin letters/digits and abbreviations—handy for badges and compact UI labels. -
Enclosed Ideographic Supplement —
U+1F200–U+1F2FF
Enclosed CJK symbols and kana/ideograph markers (e.g., 🈁, 🈯). Useful for internationalized legends and signage-style labels. -
General Punctuation —
U+2000–U+206F
The “text glue” set: en/em dashes, smart quotes, bullets (• U+2022), ellipsis (… U+2026), figure/thin/hair spaces, and control/format characters
(e.g., zero-width joiner U+200D, word-joiner U+2060). Many are intentionally invisible but copy/paste correctly. -
Geometric Shapes —
U+25A0–U+25FF
Filled/outlined squares, circles, diamonds, triangles, lozenges, and pointers—great for keys, bullets, and visual indicators. -
Latin-1 Supplement —
U+0080–U+00FF
Western diacritics (á, ñ, ö), inverted punctuation (¡, ¿), common symbols (¢, £, ¥, §, ©, ®), math signs (×, ÷), and the non-breaking space (U+00A0). -
Letterlike Symbols —
U+2100–U+214F
Print-tradition letter-forms used symbolically: ℂ, ℍ, ℕ, ℙ, ℚ, ℝ, ℤ, ℵ; commercial/legal marks (℠, ™, ℗, №), ℓ, ℮, ℧, and more. -
Mahjong Tiles (Extended) —
U+1F000–U+1F02F
The mahjong tile set (🀄 U+1F004 et al.). Often rendered with color/emoji styling on mobile; useful for examples, games, and iconography. -
Mathematical Operators —
U+2200–U+22FF
Core math toolkit: quantifiers (∀, ∃), arithmetic (√, ∞, ±), comparisons (≤, ≥), sums/products (∑, ∏), integrals (∫), set/logic basics, and more.
- Notes: Some items in General Punctuation are spacing/format controls, so they may appear “blank” but still copy correctly.
Advanced Math & Transport/Map Symbols
Beyond the core operators, Unicode added deeper inventories to cover modern mathematics: specialized relationals, big operators, geometric marks, delimiters, and arrows with algebraic meaning—so proofs and textbooks could be represented faithfully. On the pictographic side, Transport/Map symbols standardized signage and wayfinding icons for use in maps, routing UIs, and legend keys.
This page is the “big toolbox” beyond basic operators: extended mathematics (geometry marks, big operators, specialized relationals, structured brackets, and arrow variants) plus practical pictographs used in signage, mapping, and UI legends. Many of these blocks entered Unicode as math publishing, STIX fonts, and mobile mapping matured—so technical documents and wayfinding keys could be copied, searched, and rendered consistently across platforms.
- Miscellaneous Mathematical Symbols-A —
U+27C0–U+27EF
Geometry and relation symbols, circled operators, and assorted math controls. - Miscellaneous Mathematical Symbols-B —
U+2980–U+29FF
Paired delimiters (⟦⟧, ⟨⟩, ⟪⟫), additional relationals, angle/arc notation, and algebraic arrow variants. - Miscellaneous Symbols —
U+2600–U+26FF
Weather/astrology, dingbats, caution signs, religious/emblematic symbols often used in legends. - Miscellaneous Symbols and Arrows —
U+2B00–U+2BFF
Extra arrows, triangles, geometric pointers, and UI-style nav indicators. - Miscellaneous Technical —
U+2300–U+23FF
Control pictures, measurement/engineering marks, keyboard/terminal glyphs, and diagram helpers. - Number Forms —
U+2150–U+218F
Vulgar fractions, Roman numerals, and numeric marks used in tables and lists. - Playing Cards —
U+1F0A0–U+1F0FF
Full deck, suit symbols, and jokers—useful for games, probability examples, and UI badges. - Supplemental Arrows-A —
U+27F0–U+27FF
Curved, looped, and specialized arrows for mappings and flow diagrams. - Supplemental Arrows-B —
U+2900–U+297F
Heavy/harpoon/multisegment arrows and relation-arrow hybrids common in math texts. - Supplemental Symbols and Pictographs —
U+1F900–U+1F9FF
Modern UI/semantic pictographs (objects, hands, activity symbols). Many have emoji presentation on mobile. - Symbols & Pictographs Extended-A —
U+1FA70–U+1FAFF
Additional modern pictographs (tools, UI marks, game pieces, etc.) expanding the 1F9xx set. - Transport and Map Symbols —
U+1F680–U+1F6FF
Vehicles, transit, infrastructure, and wayfinding icons for maps and signage. Often rendered in color on mobile.
Tip: If a glyph looks “missing,” it’s sometimes a font-coverage issue. I’ve tried to use the most inclusive font for these pages, but I don’t want it to look shitty, so maybe I cut corners where I shouldn’t have. The character is there, even if you can’t see it. Try a broad Unicode family font that might be better maintained (e.g., Noto Sans Symbols / Symbols2) to see the full set.
African Scripts — Living & Historic
From Geʽez-based Ethiopian writing to Tifinagh of the Amazigh, Africa spans living literary traditions and revived/deciphered systems. Many blocks include historical extensions or number systems added in recent Unicode versions.
This page focuses on living use (Ethiopic, Tifinagh, N’Ko, Vai, Adlam, etc.) plus important historic sets (Bamum, Meroitic, Old Nubian, Coptic). Each item links to the examples/tiles on this page.
-
Ethiopic (Geʽez) —
U+1200–U+137F; SupplementU+1380–U+139F; ExtendedU+2D80–U+2DDF; Extended-AU+AB00–U+AB2F; Extended-BU+1E7E0–U+1E7FF
Base for Amharic, Tigrinya, etc.; includes tonal marks, syllables, and historic numerals. -
Tifinagh —
U+2D30–U+2D7F
Amazigh (Berber) consonantary; modern Neo-Tifinagh used in signage/UI. -
N’Ko —
U+07C0–U+07FF
Manding languages across West Africa; right-to-left with digits and tone marks. -
Vai —
U+A500–U+A63F
Indigenous syllabary from Liberia; full set of syllables and digits. -
Bassa Vah —
U+16AD0–U+16AFF
Bassa language of Liberia/Sierra Leone; letters with diacritics and digits. -
Adlam —
U+1E900–U+1E95F
Fulani/Fulfulde alphabet; includes casing, digits, and punctuation. -
Bamum —
U+A6A0–U+A6FF; SupplementU+16800–U+16A3F
Cameroon (historic stages A–G) with syllabic/logographic forms. -
Mende Kikakui —
U+1E800–U+1E8DF
Sierra Leone; syllabary with numerals and punctuation. -
Osmanya —
U+10480–U+104AF
Historic Somali script (20th-century). -
Old Nubian —
U+10C80–U+10CFF
Medieval Nile Valley texts; Coptic/Greek influence. -
Meroitic — Hieroglyphs
U+10980–U+1099F; CursiveU+109A0–U+109FF
Kingdom of Kush; late Egyptian-period writing system. -
Coptic —
U+2C80–U+2CFF; Coptic Epact NumbersU+102E0–U+102FF
Late Egyptian descendant used for Christian liturgy; numeric notation set.
The Americas & Adjacent
Native North American syllabics, 19th–20th-century phonetic alphabets devised in the US/UK, and modern community-born sets used in Canada and Alaska.
These blocks cover living writing systems (Cherokee, UCAS) and “adjacent” constructed sets like Deseret, Shavian, and Canadian/Alaskan numerals and shorthands.
-
Cherokee —
U+13A0–U+13FF; SupplementU+AB70–U+ABBF
Syllabary for Cherokee (Tsalagi); historic & modern forms. -
Unified Canadian Aboriginal Syllabics (UCAS) —
U+1400–U+167F; ExtendedU+18B0–U+18FF
Inuktitut, Cree, Ojibwe, Dene, etc.; core and extensions. -
Osage —
U+104B0–U+104FF
Oklahoma; dedicated alphabet with case, diacritics, digits. -
Deseret —
U+10400–U+1044F
19th-century English phonemic alphabet; uppercase/lowercase set. -
Shavian —
U+10450–U+1047F
English phonemic script; balanced letter inventory for sound mapping. -
Duployan —
U+1BC00–U+1BC9F
Shorthand used in Canada/France; includes stenographic marks and affixes. -
Kaktovik Numerals —
U+1D2C0–U+1D2DF
Base-20 numerals developed by Iñupiat students in Alaska; used alongside Latin. -
Sutton SignWriting —
U+1D800–U+1DAAF
Visual writing for sign languages; handshapes, movements, facial cues.
Ancient Near East Scripts
From Bronze-Age syllabaries to imperial alphabets, these blocks cover the written record of Mesopotamia, the Levant, and neighboring regions.
Great for labeling artifacts, showing transliteration contrasts, or building teaching aids with authentic code points (no images required).
-
Cuneiform —
U+12000–U+123FF; Numbers & PunctuationU+12400–U+1247F; Early DynasticU+12480–U+1254F
Sumerian/Akkadian logograms, numerals, and period marks. -
Ugaritic —
U+10380–U+1039F;
Old Persian —U+103A0–U+103DF
Alphabetic cuneiform (Ugarit); monumental Achaemenid inscriptions (Old Persian). -
Phoenician —
U+10900–U+1091F;
Imperial Aramaic —U+10840–U+1085F
West-Semitic line; ancestor to many later alphabets of the region. -
Old South Arabian —
U+10A60–U+10A7F;
Old North Arabian —U+10A80–U+10A9F
Ancient inscriptions across the Arabian Peninsula. -
Anatolian Hieroglyphs —
U+14400–U+1467F
Luwian hieroglyphic tradition; monumental and seal inscriptions. -
Egyptian Hieroglyphs —
U+13000–U+1342F
Formal signs; great for cartouches/labels (note font support). -
Cypriot, Linear A/B — Cypriot
U+10800–U+1083F; Linear AU+10600–U+1077F; Linear BU+10000–U+100FF
Aegean syllabaries adjacent to the Near East trade sphere.
China / Japan / Korea Symbols & Ideographs — Part 1
Foundational symbol blocks used across CJK texts: punctuation, radicals, description characters, strokes, enclosed forms, and halfwidth/fullwidth variants.
Use these for structural notation (radicals/strokes), layout-friendly enclosed forms, and width-compatible UI.
-
CJK Symbols & Punctuation —
U+3000–U+303F
Core CJK punctuation: ideographic space, corner/angle brackets, ellipsis, wave dash, iteration marks. -
CJK Radicals Supplement —
U+2E80–U+2EFF
Indexing radicals used for dictionaries and lookup; not full characters. -
Kangxi Radicals —
U+2F00–U+2FDF
The classic 214 Kangxi radicals encoded as standalone symbols. -
Ideographic Description Characters —
U+2FF0–U+2FFF
Composition operators (⿰, ⿱, etc.) to describe how an ideograph is built. -
CJK Strokes —
U+31C0–U+31EF
Basic stroke shapes for pedagogical/lexicographic use. -
Enclosed CJK Letters & Months —
U+3200–U+32FF
Circled/squared kana/ideographs and month labels for UI, calendars, badges. -
Enclosed Ideographic Supplement —
U+1F200–U+1F2FF
Extra enclosed forms (e.g., 🈵, 🈺) used in signage and labels. -
Halfwidth and Fullwidth Forms —
U+FF00–U+FFEF
Compatibility glyphs matching legacy encodings; width-aligned Latin/kana.
China / Japan / Korea Symbols & Ideographs — Part 2
Japanese kana (classic + modern small forms) and Chinese phonetic Bopomofo sets, including recent extensions used in dictionaries and educational texts.
These cover text-presentation kana and lookup/teaching phonetics.
-
Hiragana —
U+3040–U+309F
Native Japanese syllabary; core text presentation for grammar/endings. -
Katakana —
U+30A0–U+30FF(plus Phonetic ExtU+31F0–U+31FF)
Loanwords, onomatopoeia, emphasis; small vowel/ya/yu/yo forms included. -
Kana Supplement / Extensions —
U+1B000–U+1B0FF,U+1B100–U+1B12F,U+1B130–U+1B16F,U+1B170–U+1B2FF
Archaic/rare kana, small kana, and extended sets for precise phonetics. -
Bopomofo —
U+3100–U+312F; ExtendedU+31A0–U+31BF
Zhuyin phonetic symbols used for teaching/pronunciation in Chinese.
China / Japan / Korea Symbols & Ideographs — Part 3
Core Hangul blocks: Jamo for decomposition, modern syllables, compatibility Jamo, and extended Jamo used in specialized analysis.
Great for showing canonical decomposition vs. precomposed syllables and halfwidth forms.
-
Hangul Jamo —
U+1100–U+11FF, Ext-AU+A960–U+A97F, Ext-BU+D7B0–U+D7FF
Initial/medial/final letters used to compose syllables; includes extended sets. -
Hangul Syllables —
U+AC00–U+D7AF
Precomposed modern syllables covering contemporary Korean text. -
Hangul Compatibility Jamo —
U+3130–U+318F
Compatibility forms for round-trip mapping and legacy display.
China / Japan / Korea Symbols & Ideographs — Part 4
The core unified ideographs and first extension, plus compatibility ideographs used for round-trip mapping.
Use these for general CJK textual content with broad font coverage.
-
CJK Unified Ideographs —
U+4E00–U+9FFF
The main modern-use han characters used across Chinese, Japanese, Korean. -
Extension A —
U+3400–U+4DBF
Additional less-common characters; often names, dialect, historic forms. -
Compatibility Ideographs —
U+F900–U+FAFF
Duplicates for compatibility with older standards/encodings.
China / Japan / Korea Symbols & Ideographs — Part 5
Rare/archival characters and surname/variant forms in mid extensions.
These blocks require CJK Extension fonts; great for historical or name dictionaries.
-
Extension B —
U+20000–U+2A6DF
Large historic/rare set; requires CJK-Ext capable fonts. -
Extension C —
U+2A700–U+2B73F
Additional uncommon characters from historic sources. -
Extension D —
U+2B740–U+2B81F
Further rare additions (names, place forms, archival usage).
China / Japan / Korea Symbols & Ideographs — Part 6
Later extensions with many historical/geo-regional variants, plus compatibility supplement and variation selector infrastructure for precise glyph selection.
Ideal for academic editions and IVS (Ideographic Variation Sequence) demos.
-
Extension E —
U+2B820–U+2CEAF
Expanded historic/regional variants curated from major dictionaries. -
Extension F —
U+2CEB0–U+2EBEF
Continued rare additions; strong dependency on specialized fonts. -
Compatibility Ideographs Supplement —
U+2F800–U+2FA1F
Extra compatibility forms used to preserve round-trip mappings. -
Variation Selectors Supplement (IVS) —
U+E0100–U+E01EF
Selectors for registered Ideographic Variation Sequences (precise glyph choice).
China / Japan / Korea Symbols & Ideographs — Part 7
Newest batches of rare/archival characters plus shared ideographic punctuation and iteration marks used across East Asian scripts.
Good for demonstrating coverage of modern CJK fonts and the long tail of dictionary forms.
-
Extension G —
U+30000–U+3134F
Newer tranche of rare/archival ideographs (modern Unicode additions). -
Extension H —
U+31350–U+323AF
Latest extension set; expect limited font coverage on some platforms. -
Ideographic Symbols & Punctuation —
U+16FE0–U+16FFF
Iteration/tone marks and shared symbols used with ideographic text.