Unicode/Versions
This page needs additional citations for verification. Please help improve this page by adding reliable references. Unsourced material may be challenged and removed. |
This page is about each version specification, and the differences between the versions.
Unicode 1.0
Unicode 1.0 was the first version of Unicode, released October 1991. It encoded 7,094 new characters.
“Blocks”
This version of Unicode did not formally group characters in blocks. But in comparison with version 2.0, the following “blocks” were available: U+0000-U+FFFD 51 Blocks
- Basic Latin (U+0000-U+007F), containing 128 characters.
- Latin-1 Supplement (U+0080-U+00FF), containing 128 characters.
- Latin Extended-A (U+0100-U+017F), containing 127 characters.
- Latin Extended-B (U+0180-U+01FF), containing 113 characters.
- IPA Extensions (U+0250-U+02AF), containing 89 characters.
- Spacing Modifier Letters (U+02B0-U+02FF), containing 57 characters.
- Combining Diacritical Marks (U+0300-U+036F), containing 66 characters.
- Greek and Coptic (U+0370-U+03FF), containing 112 characters.
- Cyrillic (U+0400-U+04FF), containing 192 characters.
- Armenian (U+0530-U+058F), containing 84 characters.
- Hebrew (U+0590-U+05FF), containing 52 characters.
- Arabic (U+0600-U+06FF), containing 169 characters.
- Devanagari (U+0900-U+097F), containing 104 characters.
- Bengali (U+0980-U+09FF), containing 89 characters.
- Gurmukhi (U+0A00-U+0A7F), containing 74 characters.
- Gujarati (U+0A80-U+0AFF), containing 75 characters.
- Oriya (U+0B00-U+0B7F), containing 78 characters.
- Tamil (U+0B80-U+0BFF), containing 61 characters.
- Telugu (U+0C00-U+0C7F), containing 80 characters.
- Kannada (U+0C80-U+0CFF), containing 80 characters.
- Malayalam (U+0D00-U+0D7F), containing 78 characters.
- Thai (U+0E00-U+0E7F), containing 92 characters.
- Lao (U+0E80-U+0EFF), containing 70 characters.
- Tibetan (U+1000-U+105F), containing 71 characters.
- Georgian (U+10A0-U+10FF), containing 78 characters.
- General Punctuation (U+2000-U+206F), containing 67 characters.
- Superscripts and Subscripts (U+2070-U+209F), containing 28 characters.
- Currency Symbols (U+20A0-U+20CF), containing 11 characters.
- Combining Marks for Symbols (U+20D0-U+20FF), containing 18 characters.
- Letterlike Symbols (U+2100-U+214F), containing 57 characters.
- Number Forms (U+2150-U+218F), containing 48 characters.
- Arrows (U+2190-U+21FF), containing 91 characters.
- Mathematical Operators (U+2200-U+22FF), containing 242 characters.
- Miscellaneous Technical (U+2300-U+23FF), containing 43 characters.
- Control Pictures (U+2400-U+243F), containing 37 characters.
- Optical Character Recognition (U+2440-U+245F), containing 11 characters.
- Enclosed Alphanumerics (U+2460-U+24FF), containing 139 characters.
- Box Drawing (U+2500-U+257F), containing 128 characters.
- Block Elements (U+2580-U+259F), containing 22 characters.
- Geometric Shapes (U+25A0-U+25FF), containing 79 characters.
- Miscellaneous Symbols (U+2600-U+26FF), containing 106 characters.
- Dingbats (U+2700-U+27BF), containing 160 characters.
- CJK Symbols and Punctuation (U+3000-U+303F), containing 56 characters.
- Hiragana (U+3040-U+309F), containing 90 characters.
- Katakana (U+30A0-U+30FF), containing 90 characters.
- Bopomofo (U+3100-U+312F), containing 40 characters.
- Hangul Compatibility Jamo (U+3130-U+318F), containing 94 characters.
- Kanbun (U+3190-U+31FF), containing 16 characters.
- Enclosed CJK Letters and Months (U+3200-U+32FF), containing 191 characters.
- CJK Compatibility (U+3300-U+33FF), containing 187 characters.
- Hangul (U+3400-U+3D2D), containing 2,350 characters.
- Private Use Area (U+E000-U+FDFF), reserved for 5,632 characters.
- CJK Compatibility Forms (U+FE30-U+FE4F), containing 28 characters.
- Small Form Variants (U+FE50-U+FE6F), containing 26 characters.
- Arabic Presentation Forms-B (U+FE70-U+FEFF), containing 140 characters.
- Halfwidth and Fullwidth Forms (U+FF00-U+FFEF), containing 216 characters.
- Specials (U+FFF0-U+FFFF), containing 1 character.
Unicode 1.0.1
Unicode 1.0.1 was released June 1992. It encoded 28,292 characters, adding 21,204 new characters and removing 6 characters, for a net increase of 21,198 characters.
New blocks
- CJK Unified Ideographs (U+4E00-U+9FFF), containing 20,902 Han Ideographs for Chinese, Japanese and Korean, was added.
- CJK Compatibility Ideographs (U+F900-U+FAFF), containing 302 Han Ideographs for compatibility with existing character sets, was added.
Removed characters
- Letters Ka and Kha with Ogonek (total 4 characters) were removed from Cyrillic. (U+04C5-U+04C6 and U+04C9-U+04CA)
- APL Compose Operator and APL Out (total 2 characters) were removed from Miscellaneous Technical. (U+2300-U+2301)
Rearranged characters
- A Japanese Industrial Standard symbol (〄) was moved from Enclosed CJK Letters and Months (U+32FF) to CJK Symbols and Punctuation. (U+3004)
- Circled Katakana: The characters well be arranged in modern order: e.g., A, I, U, E, O, KA, KI (U+32D0-U+32FE)
- Basic Glyphs For Arabic Language: The character shapes will be arranged in different order: Isolate, Final, Initial and Medial (U+FE80-FEFC)
Characters with semantics changed
- Zero Width Non-Joiner [ZWNJ] (U+20DC)
- Zero Width Joiner [ZWJ] (U+20DD)
Unicode 1.1
Unicode 1.1 was released June 1993. It encoded 34,168 characters, adding 5,969 new characters and removing 93 characters, for a net increase of 5,876 characters. It finalized the long anticipated Han Unification.
New blocks
- Hangul Jamo (U+1100-U+11FF), containing 240 jamo for the Hangul script, was added.
- Latin Extended Additional (U+1E00-U+1EFF), containing 245 precomposed characters for transliteration and Vietnamese, was added.
- Greek Extended (U+1F00-U+1FFF), containing 233 precomposed characters for polytonic Greek, was added.
- Hangul Supplementary-A (U+3D2E-U+44B7), containing 1,930 precomposed syllables for the Hangul script, was added.
- Hangul Supplementary-B (U+44B8-U+4DFF), containing 2,376 precomposed syllables for the Hangul script, was added.
- Alphabetic Presentation Forms (U+FB00-U+FB4F), containing 57 precomposed characters and ligatures, was added.
- Arabic Presentation Forms-A (U+FB50-U+FDFF), containing 593 combinations of Arabic letters, was added.
- Combining Half Marks (U+FE20-U+FE2F), containing 4 halves of diacritical marks, was added.
Extended blocks
- The long S (ſ) (total 1 character) was added to Latin Extended-A. (U+017F)
- The Hungarian Dz, characters for transliteration purposes and precomposed characters with double grave and inverted breve (total 35 characters) were added to Latin Extended-B (U+01F1-U+01F5 and U+01FA-U+0217). The block was expanded from (U+0180-U+01FF) to (U+0180-U+024F)
- Diacritics for polytonic Greek and double width diacritics (total 6 characters) were added to Combining Diacritical Marks. (U+0342-U+0345 and U+0360-U+0361)
- Compatibility character now deprecated, Ano Teleia, and other characters (total 5 characters) were added to Greek and Coptic (U+0374-U+0375, U+037A, U+037E and U+0387).
- Additional characters for non-Slavic languages (total 38 characters) were added to Cyrillic. (U+04D0-U+04EB, U+04EE-U+04F5 and U+04F8-U+04F9)
- A ligature of Ech and Yiwn (և) (total 1 character) was added to Armenian. (U+0587)
- One deprecated compatibility character and several characters for biblical texts (total 25 characters) were added to Arabic. (U+066D and U+06D6-U+06ED)
- A sign Virama (total 1 character) was added to Gurmukhi (U+0A4D).
- Letters Candra O and E (total 3 characters) were added to Gujarati. (U+0A8D, U+0A91 and U+0AC9)
- An Ai Length mark (total 1 character) was added to Oriya. (U+0B56)
- An undertie, a pair of brackets and six formatting characters now deprecated (total 9 characters) were added to General Punctuation. (U+203F, U+2045-U+2046 and U+206A-U+206F)
- Some additional symbols and the complete set of APL functional symbols (total 79 characters) were added to Miscellaneous Technical. (U+2300 and U+232D-U+237A)
- A large circle (◯) (total 1 character) was added to Geometric Shapes. (U+25EF)
- The ideographic telegraph line feed separator symbol (〷) (total 1 character) was added to CJK Symbols and Punctuation. (U+3037)
- Four Katakana letters not in use since 1945 (total 4 characters) were added to Katakana. (U+30F7-U+30FA)
- Ideographic telegraph symbols for the twelve months (total 12 characters) were added to Enclosed CJK Letters and Months. (U+32C0-U+32CB)
- Ideographic telegraph symbols for hours and days and six additional measure units (total 62 characters) were added to CJK Compatibility. (U+3358-U+3376 and U+33E0-U+33FE)
- Some more space (total 2,304 characters) was added to the Private Use Area.
- Seven halfwidth geometric shapes (total 7 characters) were added to Halfwidth and Fullwidth Forms. (U+FFE8-U+FFEE)
Removed blocks
- Tibetan, containing 71 letters for the Tibetan script, was removed from the Unicode standard.
Removed characters
- A total of 10 characters were removed from Greek and Coptic. (U+0370-U+0372, U+03D7-U+03D9, U+03DB, U+03DD, U+03DF, and U+03E1)
- Point Varika (total 1 character) was removed from Hebrew. (U+05F5)
- Phonetic Order Vowel Signs (total 5 characters) were removed from Thai. (U+0E70-U+0E74)
- Phonetic Order Vowel Signs (total 5 characters) were removed from Lao. (U+0EF0-U+0EF4)
- An Ideographic Ditto Mark (total 1 character) was removed from CJK Symbols and Punctuation (U+3004) and merged with CJK Unified Ideograph-4EDD.
Rearranged characters
- Greek character U+03F3 was changed from Spacing Tonos to Letter Yot.
- A Japanese Industrial Standard symbol (〄) was moved from Enclosed CJK Letters and Months (U+32FF) to CJK Symbols and Punctuation. (U+3004)
Unicode 2.0
Unicode 2.0 was released July 1996. It encoded 38,885 characters, adding 11,373 new characters and removing 6,656 characters, for a net increase of 4,717 characters. This was the first Unicode version to reserve blocks outside of the Basic Multilingual Plane.
New blocks
- Hangul Syllables (U+AC00-U+D7AF), containing 11,172 precomposed syllables for the Hangul script, was added.
- High Surrogates (U+D800-U+DB7F), containing 896 characters, was added.
- High Private Use Surrogates (U+DB80-U+DBFF), containing 128 characters, was added.
- Low Surrogates (U+DC00-U+DFFF), containing 1,024 characters, was added.
- Supplementary Private Use Area-A (U+F0000-U+FFFFF), reserving 65,534 characters for private use, was added.
- Supplementary Private Use Area-B (U+100000-U+10FFFF), reserving 65,534 characters for private use, was added.
Reinstated blocks
- Tibetan (U+0F00-U+0FFF), now containing 168 characters for the Tibetan script including religious signs, was readded.
Removed blocks
- Hangul, containing 2,350 precomposed syllables for the Hangul script, was removed from the Unicode standard.
- Hangul Supplementary-A, containing 1,930 precomposed syllables for the Hangul script, was removed from the Unicode standard.
- Hangul Supplementary-B, containing 2,376 precomposed syllables for the Hangul script, was removed from the Unicode standard.
Extended blocks
- Cantillation marks for use in religious texts (total 31 characters) were added to Hebrew. (U+0591-U+05A1, U+05A3-U+05AF and U+05C4)
- A long S with Dot Above (total 1 character) was added to Latin Extended Additional. (U+1E9B)
- A Vietnamese Dong sign (total 1 character) was added to Currency Symbols. (U+20AB)
Unicode 2.1
Unicode 2.1 was released May 1998. It encoded 38,887 characters, adding only 2 new characters.
Extended blocks
- A Euro sign (total 1 character) was added to Currency Symbols. (U+20AC)
- An Object Replacement Character (total 1 character) was added to Specials. (U+FFFC)
Unicode 3.0
Unicode 3.0 was released September 1999. It was a big update and encoded 49,194 characters, adding 10,307 new characters.
New blocks
- Syriac (U+0700-U+074F), containing 71 characters used for writing in Syriac script, was added.
- Thaana (U+0780-U+07BF), containing 49 characters used for writing in Thaana script, was added.
- Sinhala (U+0D80-U+0DFF), containing 80 characters for the Sinhala script, was added.
- Myanmar (U+1000-U+109F), containing 78 characters for the Burmese script, was added.
- Ethiopic (U+1200-U+137F), containing 345 syllables and punctuation marks for the Ethiopic script, was added.
- Cherokee (U+13A0-U+13FF), containing 85 syllables for the Cherokee script, was added.
- Unified Canadian Aboriginal Syllabics (U+1400-U+167F), containing 630 syllables and punctuation marks for writing in aboriginal languages of Canada, was added.
- Ogham (U+1680-U+169F), containing 29 characters for the ancient Ogham script, was added.
- Runic (U+16A0-U+16FF), containing 81 characters for the Germanic runes, was added.
- Khmer (U+1780-U+17FF), containing 103 characters for the Khmer script, was added.
- Mongolian (U+1800-U+18AF), containing 155 characters for the classical Mongolian script, was added.
- Braille Patterns (U+2800-U+28FF), containing 256 Braille letters, was added.
- CJK Radicals Supplement (U+2E80-U+2EFF), containing 115 non-Kangxi radicals, was added.
- Kangxi Radicals (U+2F00-U+2FDF), containing 214 radicals from the Kangxi dictionary, was added.
- Ideographic Description Characters (U+2FF0-U+2FFF), containing 12 characters used to describe a Han ideograph not available in the font, was added.
- Bopomofo Extended (U+31A0-U+31BF), containing 24 characters used for phonetic transcription of minority languages of Taiwan, was added.
- CJK Unified Ideographs Extension A (U+3400-U+4DBF), containing 6,582 additional Han Ideographs, was added.
- Yi Syllables (U+A000-U+A48F), containing 1,165 syllables of the modern Yi script, was added.
- Yi Radicals (U+A490-U+A4CF), containing 50 radicals of Yi Syllables, was added.
Extended blocks
- Additional precomposed characters, letters and capital letters of lowercase-only letters (total 30 characters) were added to Latin Extended-B. (U+01F6-U+01F9, U+0218-U+021F and U+0222-U+0233)
- Extensions for disordered speech (total 5 characters) were added to IPA Extensions. (U+02A9-U+02AD)
- Some additional modifier letters (total 6 characters) were added to Spacing Modifier Letters. (U+02DF and U+02EA-U+02EE)
- Additional combining diacritics for IPA (total 10 characters) were added to Combining Diacritical Marks. (U+0346-U+034E and U+0362)
- Lowercase versions of archaic letters and the Kai symbol (total 5 characters) were added to Greek and Coptic. (U+03D7, U+03DB, U+03DD, U+03DF and U+03E1)
- Nonstandard letters for Macedonian, combining numeral signs and three letters for Kildin Sami (total 12 characters) were added to Cyrillic. (U+0400, U+040D, U+0450, U+045D, U+0488-U+0489, U+048C-U+048F and U+04EC-U+04ED)
- A Hyphen (total 1 character) was added to Armenian. (U+058A)
- Combining hamza and maddah and nine additional Arabic characters (total 12 characters) were added to Arabic. (U+0653-U+0655, U+06B8-U+06B9, U+06BF, U+06CF and U+06FA-U+06FE)
- Additional letters and religious symbols (total 25 characters) were added to Tibetan. (U+0F6A, U+0F96, U+0FAE-U+0FB0, U+0FB8, U+0FBA-U+0FBC, U+0FBE-U+0FCC and U+0FCF)
- A narrow no-break space and 6 additional punctuation marks (total 7 characters) were added to General Punctuation. (U+202F and U+2048-U+204D)
- The Kip, Tugrik and Drachma sign (total 3 characters) were added to Currency Symbols. (U+20AD-U+20AF)
- An enclosing screen and an enclosing key (total 2 characters) were added to Combining Diacritical Marks for Symbols. (U+20E2-U+20E3)
- The information symbol and a rotated Q (total 2 characters) were added to Letterlike Symbols. (U+2139-U+213A)
- A mirrored Roman capital numeral hundred (Ↄ) (total 1 character) was added to Number Forms. (U+2183)
- Some additional arrows (total 9 characters) were added to Arrows. (U+21EB-U+21F3)
- Some additional technical symbols, including common keys on a 101 keyboard (total 33 characters) were added to Miscellaneous Technical. (U+2301, U+237B and U+237D-U+239A)
- Two additional control pictures (total 2 characters) were added to Control Pictures. (U+2425-U+2426)
- Squares and circles with quadrants (total 8 characters) were added to Geometric Shapes. (U+25F0-U+25F7)
- Two Syriac crosses and a signature mark (total 3 characters) were added to Miscellaneous Symbols. (U+2619 and U+2670-U+2671)
- Three Hangzhou numerals and a variation indicator (total 4 characters) were added to CJK Symbols and Punctuation. (U+3038-U+303A and U+303E)
- A ligature Yod with Hiriq (יִ) (total 1 character) was added to Alphabetic Presentation Forms. (U+FB1D)
- Three additional control characters for ruby markup (total 3 characters) were added to Specials. (U+FFF9-U+FFFB)
Unicode 3.1
Unicode 3.1 was released March 2001. It encoded 94,140 characters, adding 44,946 new characters, and mainly focused on blocks outside of the Basic Multilingual Plane.
New blocks
- Old Italic (U+10300-U+1032F), containing 35 letters for the Etruscan script, was added.
- Gothic (U+10330-U+1034F), containing 27 letters for the Gothic script, was added.
- Deseret (U+10400-U+1044F), containing 76 letters for the constructed Deseret script, was added.
- Byzantine Musical Symbols (U+1D000-U+1D0FF), containing 246 symbols for musical notation in Byzantine, was added.
- Musical Symbols (U+1D100-U+1D1FF), containing 219 characters for current musical notation, was added.
- Mathematical Alphanumeric Symbols (U+1D400-U+1D7FF), containing 991 Latin and Greek letters in serif, sans-serif, bold, italic, double-struck, script and Fraktur/Blackletter, was added.
- CJK Unified Ideographs Extension B (U+20000-U+2A6DF), containing 42,711 additional Chinese Ideographs, was added.
- CJK Compatibility Ideographs Supplement (U+2F800-U+2FA1F), containing 542 additional Chinese Ideographs for compatibility purposes, was added.
- Tags, containing 97 language tags, was added. (U+E0000-U+E007F)
Extended noncharacters
- The Noncharacters range: U+FDD0..U+FDEF were added to Arabic Presentation Forms-A.
Extended blocks
- The capital Theta symbol and the Lunate Epsilon symbol (total 2 characters) were added to Greek and Coptic. (U+03F4-U+03F5)
Characters and Scripts Under Investigation or Rejected
- Khmer Sign Laak Was Rejected. (U+17DD) From Khmer.
- Georgian Letter U-Brjuu Was Rejected. From Georgian.
Unicode 3.2
Unicode 3.2 was released March 2002. It encoded 95,156 characters, adding 1,016 new characters.
New blocks
- Cyrillic Supplement (U+0500-U+052F), containing 16 characters used for the Komi language, was added.
- Tagalog (U+1700-U+171F), containing 20 characters for the Baybayin script, was added.
- Hanunoo (U+1720-U+173F), containing 23 characters and punctuation for the Hanunoo script, was added.
- Buhid (U+1740-U+175F), containing 20 characters for the Buhid script, was added.
- Tagbanwa (U+1760-U+177F), containing 18 characters for the Tagbanwa script, was added.
- Miscellaneous Mathematical Symbols-A (U+27C0-U+27EF), containing 28 symbols used in math notation, was added.
- Supplemental Arrows-A (U+27F0-U+27FF), containing 16 additional arrows, was added.
- Supplemental Arrows-B (U+2900-U+297F), containing 128 special arrows, was added.
- Miscellaneous Mathematical Symbols-B (U+2980-U+29FF), containing 128 additional mathematical symbols, was added.
- Supplemental Mathematical Operators (U+2A00-U+2AFF), containing 256 additional mathematical operators, was added.
- Katakana Phonetic Extensions (U+31F0-U+31FF), containing 16 Katakana letters used for Ainu, was added.
- Variation Selectors (U+FE00-U+FE0F), containing 16 symbols used for indicating variations, was added.
Extended blocks
- A capital letter N with Long Right Leg (total 1 character) was added to Latin Extended-B. (U+0220)
- The combining grapheme joiner and combining Latin letters used in medieval texts (total 14 characters) were added to Combining Diacritical Marks. (U+034F and U+0363-U+036F)
- The Qoppa and a reversed lunate epsilon symbol (total 3 characters) were added to Greek and Coptic. (U+03D8-U+03D9 and U+03F6)
- Four additional letters used for the Kildin Sami language (total 8 characters) were added to Cyrillic. (U+048A-U+048B, U+04C5-U+04C6, U+04C9-U+04CA and U+04CD-U+04CE)
- A dotless Beh and a dotless Qaf (total 2 characters) were added to Arabic. (U+066E-U+066F)
- A Letter for Addu dialect (total 1 character) was added to Thaana. (U+07B1)
- The letters Yn and Elifi (total 2 characters) were added to Georgian. (U+10F7-U+10F8)
- Some additional punctuation marks and control characters (total 12 characters) were added to General Punctuation. (U+2047, U+204E-U+2052, U+2057 and U+205F-U+2063)
- A superscript letter I (total 1 character) was added to Superscripts and Subscripts. (U+2071)
- German Penny and Peso sign (total 2 characters) were added to Currency Symbols. (U+20B0-U+20B1)
- Some additional combining characters (total 7 characters) were added to Combining Diacritical Marks for Symbols. (U+20E4-U+20EA)
- Some double-struck and reversed/turned letters (total 15 characters) were added to Letterlike Symbols. (U+213D-U+214B)
- Some additional arrows (total 12 characters) were added to Arrows. (U+21F4-U+21FF)
- Some additional mathematical operators (total 14 characters) were added to Mathematical Operators. (U+22F2-U+22FF)
- Variable-width and additional symbols (total 53 characters) were added to Miscellaneous Technical. (U+237C and U+239B-U+23CE)
- Black and double circled numerals (total 20 characters) were added to Enclosed Alphanumerics. U+24EB-U+24FE)
- Quadrant elements (total 10 characters) were added to Block Elements. (U+2596-U+259F)
- Some additional triangles and squares (total 8 characters) were added to Geometric Shapes. (U+25F8-U+25FF)
- Shogi pieces ,recycling symbols, dices and dotted circles (total 24 characters) were added to Miscellaneous Symbols. (U+2616-U+2617, U+2672-U+267D and U+2680-U+2689)
- Additional parenthesis (total 14 characters) were added to Dingbats. (U+2768-U+2775)
- Three additional marks (total 3 characters) were added to CJK Symbols and Punctuation. (U+303B-U+303D)
- A digraph and two additional characters (total 3 characters) were added to Hiragana. (U+3095-U+3096 and U+309F)
- A digraph and a double hyphen (total 2 characters) were added to Katakana. (U+30A0 and U+30FF)
- Additional circled numerals (total 30 characters) were added to Enclosed CJK Letters and Months. (U+3251-U+325F and U+32B1-U+32BF
- Five missing radicals (total 5 characters) were added to Yi Radicals. (U+A4A2-U+A4A3, U+A4B4, U+A4C1, U+A4C5)
- Additional compatibility characters (total 59 characters) were added to CJK Compatibility Ideographs. (U+FA30-U+FA6A)
- A Rial sign (total 1 character) was added to Arabic Presentation Forms-A. (U+FDFC)
- Two sesame dots (total 2 characters) were added to CJK Compatibility Forms. (U+FE45-U+FE46)
- A tail fragment (total 1 character) was added to Arabic Presentation Forms-B. (U+FE73)
- A pair of double parenthesis (total 2 characters) was added to Halfwidth and Fullwidth Forms. (U+FF5F-U+FF60)
Unicode 4.0
Unicode 4.0 was released April 2003. It encoded 96,382 characters, adding 1,226 new characters.
New blocks
- Limbu, containing 66 characters for the Limbu abugida, was added.
- Tai Le, containing 35 letters for the Tai Le script, was added.
- Khmer Symbols, containing 32 symbols for the lunar calendar, was added.
- Phonetic Extensions, containing 108 letters used in phonetic transcription, was added.
- Miscellaneous Symbols and Arrows, containing 14 additional arrows, was added.
- Yijing Hexagram Symbols, containing 64 hexagrams, was added.
- Linear B Syllabary, containing 88 syllables of the ancient Linear B script, was added.
- Linear B Ideograms, containing 123 ideograms of the ancient Linear B script, was added.
- Aegean Numbers, containing 57 numerals used in the Aegean area, was added.
- Ugaritic, containing 31 characters used in Ugaritic cuneiform, was added.
- Shavian, containing 48 letters used for the artificial Shavian script, was added.
- Osmanya, containing 40 characters used in the artificial Osmanya script, was added.
- Cypriot Syllabary, containing 55 characters formerly used on Cyprus, was added.
- Tai Xuan Jing Symbols, containing 87 symbols of Tai Xuan Jing, was added.
- Variation Selectors Supplement, containing 240 additional variation selectors, was added.
Extended blocks
- Letters with curl used in Sinology (total 4 characters) were added to Latin Extended-B.
- Former IPA letters (total 2 characters) were added to IPA Extensions.
- Some additional characters (total 17 characters) were added to Spacing Modifier Letters.
- Additional combining double-width diacritics and diacritics corresponding to their spacing equivalent (total 11 characters) were added to Combining Diacritical Marks.
- The archaic letters Sho and San and the capital Lunate Sigma (total 5 characters) were added to Greek and Coptic.
- Some additional markers, biblical signs, and letters with inverted V (total 19 characters) were added to Arabic.
- Letters used for foreign words from Persian and Sogdian (total 6 characters) were added to Syriac.
- The short A (ऄ) (total 1 character) was added to Devanagari.
- The Avagraha sign (ঽ) (total 1 character) was added to Bengali.
- The Adak Bindi and Visarga signs (total 2 characters) were added to Gurmukhi.
- The vocalic l and ll and the Rupee sign (total 5 characters) were added to Gujarati.
- The letters Va and Wa (total 2 characters) were added to Oriya.
- Additional signs for date and finance environments (total 8 characters) were added to Tamil.
- The Nukta and Avagraha signs (total 2 characters) were added to Kannada.
- Some symbols and signs (total 11 characters) were added to Khmer.
- An inverted undertie and a swung dash (total 2 characters) were added to General Punctuation.
- The facsimile sign (℻) (total 1 character) was added to Letterlike Symbols.
- The eject symbol and a vertical line (total 2 characters) were added to Miscellaneous Technical.
- A black circled digit zero (⓿) (total 1 character) was added to Enclosed Alphanumerics.
- Monograms and diagrams, flags, warning and weather symbols and a cup of tea (total 12 characters) were added to Miscellaneous Symbols.
- Additional parenthesized and circled Korean characters and supplemental signs (total 9 characters) were added to Enclosed CJK Letters and Months.
- Additional measure units (total 7 characters) were added to CJK Compatibility.
- An additional Arabic sign (﷽) (total 1 character) was added to Arabic Presentation Forms-A.
- A pair of vertical parenthesis (total 2 characters) was added to CJK Compatibility Forms.
- The letters Oi and Ew (total 4 characters) were added to Deseret.
- A small script l (ℓ) (total 1 character) was added to Mathematical Alphanumeric Symbols.
Unicode 4.1
Unicode 4.1 was released March 31, 2005. It encoded 97,655 characters, adding 1,273 new characters.
New blocks
- Arabic Supplement, containing 30 characters for various languages written with the Arabic script, was added.
- Ethiopic Supplement, containing 26 characters and signs for Sebatbeit, was added.
- New Tai Lue, containing 80 characters for the New Tai Lue script, was added.
- Buginese, containing 30 characters for the Lontara script, was added.
- Phonetic Extensions Supplement, containing 64 additional letters for phonetic transcription, was added.
- Combining Diacritical Marks Supplement, containing 4 additional diacritics, was added.
- Glagolitic, containing 94 characters for the Glagolitic script, was added.
- Coptic, containing 114 characters for the Coptic script, was added.
- Georgian Supplement, containing 38 Nuskhuri letters, was added.
- Tifinagh, containing 55 characters for the Tifinagh script, was added.
- Ethiopic Extended, containing 79 additional Ethiopic syllables, was added.
- Supplemental Punctuation, containing 26 additional punctuation marks, was added.
- CJK Strokes, containing 16 strokes for Han Ideographs, was added.
- Modifier Tone Letters, containing 23 letters for Chinese tones, was added.
- Syloti Nagri, containing 44 characters for the Syloti Nagri abugida, was added.
- Vertical Forms, containing 10 punctuation marks suited for vertical text, was added.
- Ancient Greek Numbers, containing 75 numerals and signs used in Ancient Greek, was added.
- Old Persian, containing 50 characters for Old Persian cuneiform, was added.
- Kharoshthi, containing 65 characters for the Kharoshthi abugida, was added.
- Ancient Greek Musical Notation, containing 70 musical signs used in Ancient Greek, was added.
Extended blocks
- Letters for Sencoten, digraphs, letters with swash tail and other additions (total 11 characters) were added to Latin Extended-B.
- Additional diacritics for transliteration (total 5 characters) were added to Combining Diacritical Marks.
- Rho with stroke, reversed and dotted Lunate Sigma (total 4 characters) were added to Greek and Coptic.
- Ghe with descender (Ӷ) (total 2 characters) was added to Cyrillic.
- An additional biblical mark and some punctuation marks (total 4 characters) were added to Hebrew.
- Additional biblical marks, punctuation marks and the Afghani sign (total 8 characters) were added to Arabic.
- A glottal stop (ॽ) (total 1 character) was added to Devanagari.
- The Khanda Ta letter (ৎ) (total 1 character) was added to Bengali.
- The letter Sha and the digit zero (total 2 characters) were added to Tamil.
- Two marks used in Bhutan (total 2 characters) were added to Tibetan.
- Two letters and a modifier letter (total 3 characters) were added to Georgian.
- Some additional syllables (total 11 characters) were added to Ethiopic.
- Additional phonetic symbols (total 20 characters) were added to Phonetic Extensions.
- A flower and dot punctuation marks (total 9 characters) were added to General Punctuation.
- Additional subscript letters (total 5 characters) were added to Superscripts and Subscripts.
- The Guarani, Austral, Hryvnia and Cedi signs (total 4 characters) were added to Currency Symbols.
- A combining long double solidus (⃫) (total 1 character) was added to Combining Diacritical Marks for Symbols.
- The per sign and a double-struck letter Pi (total 2 characters) were added to Letterlike Symbols.
- Metrical and electrical signs (total 11 characters) were added to Miscellaneous Technical.
- Additional gender and map symbols (total 30 characters) were added to Miscellaneous Symbols.
- Some additional mathematical symbols (total 7 characters) were added to Miscellaneous Mathematical Symbols-A.
- Additional arrows and squares (total 6 characters) were added to Miscellaneous Symbols and Arrows.
- A circled Hangul character (㉾) (total 1 character) was added to Enclosed CJK Letters and Months.
- Additional Han Ideographs (total 22 characters) were added to CJK Unified Ideographs.
- Additional Compatibility Ideographs (total 106 characters) were added to CJK Compatibility Ideographs.
- Italic dotless small i and j (total 2 characters) were added to Mathematical Alphanumeric Symbols.
Unicode 5.0
Unicode 5.0 was released July 14, 2006. It encoded 99,024 characters, adding 1,369 new characters.
New blocks
- N'Ko, containing 59 characters for the N'Ko script, was added.
- Balinese, containing 121 characters and musical signs for the Balinese abugida, was added.
- Latin Extended-C, containing 17 letters for various languages, was added.
- Latin Extended-D, containing 2 characters for UPA, was added.
- Phags-pa, containing 56 characters for the Phags-pa script, was added.
- Phoenician, containing 27 letters and numerals for the Phoenician script, was added.
- Cuneiform, containing 879 signs for Sumero-Akkadian Cuneiform, was added.
- Cuneiform Numbers and Punctuation, containing 103 numerals and punctuation signs for Sumero-Akkadian Cuneiform, was added.
- Counting Rod Numerals, containing 18 numerals used with counting rods, was added.
Extended blocks
- Various letters used mainly for aboriginal languages (total 14 characters) were added to Latin Extended-B.
- Lowercase lunate sigma symbols (total 3 characters) were added to Greek and Coptic.
- Lowercase palochka and 3 letters used in Nivkh (total 7 characters) were added to Cyrillic.
- Two letters used in Khanty and other languages (total 4 characters) were added to Cyrillic Supplement.
- A specific point meant for Vav (ֺ) (total 1 character) was added to Hebrew.
- Four letters used in Sindhi (total 4 characters) were added to Devanagari.
- Four letters used in Sanskrit (total 4 characters) were added to Kannada.
- Additional IPA diacritics (total 9 characters) were added to Combining Diacritical Marks Supplement.
- Four combining arrows (total 4 characters) were added to Combining Diacritical Marks for Symbols.
- A danish symbol and a lowercase turned F (total 2 characters) were added to Letterlike Symbols.
- A lowercase reversed C (ↄ) (total 1 character) was added to Number Forms.
- Vertical parenthesis, geometric forms and electrical symbols (total 12 characters) were added to Miscellaneous Technical.
- A neuter symbol (⚲) (total 1 character) was added to Miscellaneous Symbols.
- Four additional mathematical symbols (total 4 characters) were added to Miscellaneous Mathematical Symbols-A.
- Additional squares, pentagons and hexagons (total 11 characters) were added to Miscellaneous Symbols and Arrows.
- Four additional tone letters used in Chinantec (total 4 characters) were added to Modifier Tone Letters.
- Bold Digamma (𝟊/Ϝ) (total 2 characters) was added to Mathematical Alphanumeric Symbols.
Unicode 5.1
Unicode 5.1 was released April 4, 2008. It encoded 100,648 characters, adding 1,624 new characters.
New blocks
- Sundanese, containing 55 letters for Sundanese script, was added.
- Lepcha, containing 74 letters for Lepcha script, was added.
- Ol Chiki, containing 48 letters for Ol Chiki script, was added.
- Cyrillic Extended-A, containing 32 letters for combining Cyrillic letters, was added.
- Vai, containing 300 letters for Vai script, was added.
- Cyrillic Extended-B, containing 78 letters for additional Cyrillic characters, was added.
- Saurashtra, containing 81 letters for Saurashtra script, was added.
- Kayah Li, containing 48 letters for Kayah languages, was added.
- Rejang, containing 37 letters for Rejang script, was added.
- Cham, containing 83 letters for Cham script, was added.
- Ancient Symbols, containing 12 characters for weights and measures and other Ancient symbols, was added.
- Phaistos Disc, containing 46 hieroglyphs for Phaistos, was added.
- Lycian, containing 29 letters for Lycian script, was added.
- Carian, containing 49 letters for Carian script, was added.
- Lydian, containing 27 letters for Lydian script, was added.
- Mahjong Tiles, containing 44 mahjong tiles, was added.
- Domino Tiles, containing 100 domino tiles, was added.
Extended blocks
- Archaic letters and capital kai symbol (total 7 characters) were added to Greek and Coptic.
- Combining Pokrytie (total 1 character) was added to Cyrillic.
- Mordvin, Kurdish, Aleut and Chuvash letters (total 16 characters) were added to Cyrillic Supplement.
- Radix symbols, Letterlike, punctuation, Koranic annotation signs and additions for early Persian and Azerbaijani (total 15 characters) were added to Arabic.
- Additional letters in Torwali, Burushaski and early Persian (total 18 characters) were added to Arabic Supplement.
- High spacing dot and candra a (total 2 characters) were added to Devanagari.
- Udaat and yakash signs (total 2 characters) were added to Gurmukhi.
- Vocalic rr, l and ll (total 3 characters) were added to Oriya.
- Om symbol (ௐ) (total 1 character) was added to Tamil.
- Avagraha, additional phonetic letters, vocalic l and ll, fractional signs and tuumu (total 13 characters) were added to Telugu.
- Avagraha, vocalic rr, l and ll, Malayalam numerics and fractions and chillu letters (total 17 characters) were added to Malayalam.
- Letters for Balti and various symbols (total 6 characters) were added to Tibetan.
- Characters for various languages (total 78 characters) were added to Myanmar.
- Manchu Ali Gali lha (ᢪ) (total 1 character) was added to Mongolian.
- Miscellaneous combining marks (total 28 characters) were added to Combining Diacritical Marks Supplement.
- Medievalist latin letters and miscellaneous letters (total 10 characters) were added to Latin Extended Additional.
- Invisible plus (+) (total 1 character) was added to General Punctuation.
- Combining asterisk above ( ⃰)(total 1 character) was added to Combining Diacritical Marks for Symbols.
- Symbol for Samaritan Source (⅏) (total 1 character) was added to Letterlike Symbols.
- Archaic Roman Numerals (total 4 characters) were added to Number Forms.
- Outlined white star and other signs (total 15 characters) were added to Miscellaneous Symbols.
- Long division and additional mathematical brackets (total 5 characters) were added to Miscellaneous Mathematical Symbols-A.
- Miscellaneous signs (total 51 characters) were added to Miscellaneous Symbols and Arrows.
- Additional latin letters (total 12 characters) were added to Latin Extended-C.
- Additional punctuation (total 23 characters) were added to Supplemental Punctuation.
- Letter ih (ㄭ) (total 1 character) was added to Bopomofo.
- Other strokes (total 20 characters) were added to CJK Strokes.
- Miscellaneous additions (total 8 characters) were added to CJK Unified Ideographs.
- Africanist tone letters (total 5 characters) were added to Modifier Tone Letters.
- Miscellaneous letters and symbols (total 112 characters) were added to Latin Extended-D.
- Continuous macrons for Coptic (total 3 characters) were added to Combining Half Marks.
- Musical symbol multiple measure rest (𝄩) (total 1 character) was added to Musical Symbols.
Unicode 5.2
Unicode 5.2 was released in October 1, 2009. It encoded 107,296 characters, adding 6,648 new characters.
New blocks
- Samaritan, containing 61 letters for Samaritan script, was added.
- Unified Canadian Aboriginal Syllabics Extended, containing 70 syllables for various cree languages, was added.
- Tai Tham, containing 127 letters for Tai Tham script, was added.
- Vedic Extensions, containing 35 characters for tone marks and signs, was added.
- Lisu, containing 48 letters for Lisu script, was added.
- Bamum, containing 88 letters for Bamum script, was added.
- Common Indic Number Forms, containing 10 fractions and marks, was added.
- Devanagari Extended, containing 28 additional marks, was added.
- Hangul Jamo Extended-A, containing 29 characters for additional old initial consonants in hangul jamo, was added.
- Javanese, containing 91 letters for Javanese script, was added.
- Myanmar Extended-A, containing 28 letters for Khamti Shan in Myanmar, was added.
- Tai Viet, containing 72 letters for Tai Viet script, was added.
- Meetei Mayek, containing 56 letters for Meetei Mayek script, was added.
- Hangul Jamo Extended-B, containing 72 characters for additional old medieval vowels and final consonants in hangul jamo, was added.
- Imperial Aramaic, containing 31 characters for Old Aramaic, was added.
- Old South Arabian, containing 32 letters and numbers for South Arabian, was added.
- Avestan, containing 61 characters for Avestan script, was added.
- Inscriptional Parthian, containing 30 characters for Inscriptional Parthian script, was added.
- Inscriptional Pahlavi, containing 27 characters for Inscriptional Pahlavi script, was added.
- Old Turkic, containing 73 characters for Orkhon script, was added.
- Rumi Numeral Symbols, containing 31 numeric characters used in Fez, Morocco, and elsewhere in North Africa and the Iberian peninsula, between the tenth and seventeenth centuries, was added.
- Kaithi, containing 66 letters for Kaithi script, was added.
- Egyptian Hieroglyphs, containing 1,071 hieroglyphs for Egyptian, was added.
- Enclosed Alphanumeric Supplement, containing 63 additional circled, parenthesized and squared alphanumerics, was added.
- Enclosed Ideographic Supplement, containing 44 squared and tortoised shell bracketed ideographs, was added.
- CJK Unified Ideographs Extension C, containing 4,149 additional Chinese Ideographs, was added.
Extended blocks
- Abhaz letters (total 2 characters) were added to Cyrillic Supplement.
- Inverted Candrabinbu and additional signs and letters (total 5 characters) were added to Devanagari.
- Ganda Mark (৻) (total 1 character) was added to Bengali.
- Religious svasti signs (total 4 characters) were added to Tibetan.
- Extensions for Khamti Shan and Alton and Phake (total 4 characters) were added to Myanmar.
- Additional old initial consonants, medival vowels, and old final consonants (total 16 characters) were added to Hangul Jamo.
- Hyphen and additional syllables (total 10 characters) were added to Unified Canadian Aboriginal Syllabics.
- Letter Sua and Tham Digit One (total 3 characters) were added to New Tai Lue.
- Combing Almost Equal to Below ( ᷽) (total 1 character) was added to Combining Diacritical Marks Supplement.
- The Live Tournosis, Spesmillo and Tenge signs (total 3 characters) were added to Currency Symbols.
- Additional vulgar fractions from ARIB STD B24 (total 4 characters) were added to Number Forms.
- Decimal exponent symbol (⏨) from ARIB STD B24 (total 1 characters) was added to Miscellaneous Technical.
- A soccer ball and symbols from ARIB STD B24 (total 59 characters) were added to Miscellaneous Symbols.
- Heavy exclamation mark symbol (❗) from ARIB STD B24 (total 1 character) was added to Dingbats.
- Traffic sign, dictionary and map symbols from ARIB STD B24 (total 5 characters) were added to Miscellaneous Symbols and Arrows.
- Capital letter turned alpha and additions for shona (total 3 characters) were added to Latin Extended-C.
- Cryptogrammic letters and combining marks (total 7 characters) were added to Coptic.
- Word separator middle dot used in Avestan (⸱) (total 1 character) was added to Supplemental Punctuation.
- Circled ideographs and numbers on black squares from ARIB STD B24 (total 12 characters) were added to Enclosed CJK Letters and Months.
- Miscellaneous additions (total 8 characters) were added to CJK Unified Ideographs.
- Miscellaneous additions for compatibility (total 3 characters) were added to CJK Compatibility Ideographs.
- Number two and three (total 2 characters) were added to Phoenician.
Unicode 6.0
Unicode 6.0 was released in October 11, 2010. It encoded 109,384 characters, adding 2,088 new characters.
New blocks
- Mandaic, containing 29 letters for Mandaic script, was added.
- Batak, containing 56 letters for Batak script, was added.
- Ethiopic Extended-A, containing 32 letters for Gamo-Gofa-Dawro, Basketo and Gumuz Ethiophic syllables, was added.
- Brahmi, containing 108 characters for ancient Brahmi abugida, was added.
- Bamum Supplement, containing 761 letters for additional Bamum script, was added.
- Kana Supplement, containing 2 characters for archaic katakana, was added.
- Playing Cards, containing 59 playing cards, was added.
- Miscellaneous Symbols and Pictographs, containing 529 additional symbols, was added.
- Emoticons, containing 63 faces, cat faces and gesture symbols, was added.
- Transport and Map Symbols, containing 70 transportation, traffic signs and other symbols, was added.
- Alchemical Symbols, containing 116 symbols for elements, was added.
- CJK Unified Ideographs Extension D, containing 222 miscellaneous Han ideographs, was added.
Extended blocks
- Azerbaijani letters (total 2 characters) were added to Cyrillic Supplement.
- Kashmiri Yeh and Wavy hamza below (total 2 characters) were added to Arabic.
- Dependent vowel signs and letters used in Kashmiri and Bihari (total 10 characters) were added to Devanagari.
- Fraction signs (total 6 characters) were added to Oriya.
- Letters used in scholarly only and letter dot reph (total 3 characters) were added to Malayalam.
- Leading and Trailing Mchan Rtags (total 6 characters) were added to Tibetan.
- Additional combining marks (total 2 characters) were added to Ethiopic.
- Combining Double Inverted Breve Below (᷼) (total 1 character) was added to Combining Diacritical Marks Supplement.
- Miscellaneous subscript letters (total 8 characters) were added to Superscripts and Subscripts.
- Indian Rupee Sign (₹) (total 1 character) was added to Currency Symbols.
- Pointing double triangle and additional mechanical symbols (total 11 characters) were added to Miscellaneous Technical.
- Ophiucisus, astronomical symbol for uranus and pentagrams (total 6 characters) were added to Miscellaneous Symbols.
- Additional heavy punctation marks, raised fist, raised hand, sparkles, heavy arithmetic symbols and curly loops (total 16 characters) were added to Dingbats.
- Squared logicals (total 2 characters) were added to Miscellaneous Mathematical Symbols-A.
- Separator mark and consonant joiner (total 2 characters) were added to Tifinagh.
- Bopomofo for Hmu and Ge (total 3 characters) were added to Bopomofo Extended.
- Reversed Tse (total 2 characters) were added to Cyrillic Extended-B.
- Additional letters (total 15 characters) were added to Latin Extended-D.
- Pedagogical symbols (total 16 characters) were added to Arabic Presentation Forms-A.
- Additional squared, black circled and squared letters and regional indicator letters (total 107 characters) were added to Enclosed Alphanumeric Supplement.
- Squared katakana, squared ideographs and circled advantage and accept (total 13 characters) were added to Enclosed Ideographic Supplement.
Unicode 6.1
Unicode 6.1 was released in January 31, 2012. It encoded 110,116 characters, adding 732 new characters.
New blocks
- Arabic Extended-A (U+08A0-U+08FF), containing 39 characters, was added.
- Sundanese Supplement (U+1CC0-U+1CCF), containing 8 characters, was added.
- Meetei Mayek Extensions (U+AAE0-U+AAFF), containing 23 characters, was added.
- Meroitic Hieroglyphs (U+10980-U+1099F), containing 32 characters, was added.
- Meroitic Cursive (U+109A0-U+109FF), containing 26 characters, was added.
- Sora Sompeng (U+110D0-U+110FF), containing 35 characters, was added.
- Chakma (U+11100-U+1114F), containing 67 characters, was added.
- Sharada (U+11180-U+111DF), containing 83 characters, was added.
- Takri (U+11680-U+116CF), containing 66 characters, was added.
- Miao (U+16F00-U+16F9F), containing 133 characters, was added.
- Arabic Mathematical Alphabetic Symbols (U+1EE00-U+1EEFF), containing 143 characters, was added.
Extended blocks
- An Armenian Dram sign (total 1 character) was added to Armenian. (U+058F)
- A sign Samvat (total 1 character) was added to Arabic. (U+0604)
- An Abbreviation mark (total 1 character) was added to Gujarati. (U+0AF0)
- Letters for Khmu (total 2 characters) were added to Lao. (U+0EDE-U+0EDF)
- Capital letter Yn, letter Aen, Hard and Labial sign (total 5 characters) were added to Georgian. (U+10C7, U+10CD and U+10FD-U+10FF)
- Letters and signs for Old Sundanese (total 9 characters) were added to Sundanese. (U+1BAB-U+1BAD and U+1BBA-U+1BBF)
- Sign Rotated Ardhavisarga, Candra Above, Jihvamuliya and Uphadhmaniya (total 4 characters) were added to Vedic Extensions. (U+1CF3-U+1CF6)
- Mathematical diagonals (total 2 characters) were added to Miscellaneous Mathematical Symbols-A. (U+27CB and U+27CD)
- A letter Bohairic Khei (total 2 characters) were added to Coptic. (U+2CF2-U+2CF3)
- Small letters Yn and Aen (total 2 characters) were added to Georgian Supplement. (U+2D27 and U+2D2D)
- Letters Ye and Yo (total 2 characters) were added to Tifinagh. (U+2D66-U+2D67)
- (total 10 characters) were added to Supplemental Punctuation. (U+2E32-U+2E3B)
- An additional ideograph for Kanji (total 1 character) was added to CJK Unified Ideographs. (U+9FCC)
- Combining letter for Slavonic (total 9 characters) were added to Cyrillic Extended-B. (U+A674-U+A67B and U+A69F)
- Letter C with Bar, capital letter H with Hook and modifier letters for extended IPA (total 5 characters) were added to Latin Extended-D. (U+A792-U+A793, U+A7AA and U+A7F8-U+A7F9)
- Some additional ideographs for Korea (total 2 characters) were added to CJK Compatibility Ideographs. (U+FA2E-U+FA2F)
- Symbols for Canadian legal use (total 2 characters) were added to Enclosed Alphanumeric Supplement. (U+1F16A-U+1F16B)
- Typikon symbols (total 4 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F540-U+1F543)
- (total 13 characters) were added to Emoticons. (U+1F600, U+1F611, U+1F615, U+1F617, U+1F619, U+1F61B, U+1F61F, U+1F626-U+1F627, U+1F62C, U+1F62E-U+1F62F and U+1F634)
Unicode 6.2
Unicode 6.2 was released in September 26, 2012. It encoded 110,117 characters, adding only 1 new character.
Extended blocks
- A Turkish Lira sign (total 1 character) was added to Currency Symbols. (U+20BA)
Unicode 6.3
Unicode 6.3 was released in September 30, 2013. It encoded 110,122 characters, adding only 5 new characters.
Extended blocks
- A Letter mark (total 1 character) was added to Arabic. (U+061C)
- Isolate directional format characters (total 4 characters) were added to General Punctuation. (U+2066-U+2069)
Unicode 7.0
Unicode 7.0 was released in June 16, 2014. It encoded 112,956 characters, adding 2,834 new characters.
New blocks
- Combining Diacritical Marks Extended (U+1AB0-U+1AFF), containing 15 marks, was added.
- Myanmar Extended-B (U+A9E0-U+A9FF), containing 31 letters, was added.
- Latin Extended-E (U+AB30-U+AB6F), containing 50 letters, was added.
- Coptic Epact Numbers (U+102E0-U+102FF), containing 28 numbers, was added.
- Old Permic (U+10350-U+1037F), containing 43 letters, was added.
- Elbasan (U+10500-U+1052F), containing 50 letters, was added.
- Caucasian Albanian (U+10530-U+1056F), containing 53 letters and marks, was added.
- Linear A (U+10600-U+1077F), containing 341 signs, was added.
- Palmyrene (U+10860-U+1087F), containing 32 letters, was added.
- Nabataean (U+10880-U+108AF), containing 40 letters and numbers, was added.
- Old North Arabian (U+10A80-U+10A9F), containing 32 letters and numbers, was added.
- Manichaean (U+10AC0-U+10AFF), containing 51 characters, was added.
- Psalter Pahlavi (U+10B80-U+10BAF), containing 29 characters, was added.
- Mahajani (U+11150-U+1117F), containing 39 letters and signs, was added.
- Sinhala Archaic Numbers (U+111E0-U+111FF), containing 20 numbers, was added.
- Khojki (U+11200-U+1124F), containing 61 characters, was added.
- Khudawadi (U+112B0-U+112FF), containing 69 characters, was added.
- Grantha (U+11300-U+1137F), containing 83 characters, was added.
- Tirhuta (U+11480-U+114DF), containing 82 characters, was added.
- Siddham (U+11580-U+115FF), containing 72 characters, was added.
- Modi (U+11600-U+1165F), containing 79 characters, was added.
- Warang Citi (U+118A0-U+118FF), containing 84 letters and numbers, was added.
- Pau Cin Hau (U+11AC0-U+11AFF), containing 57 characters, was added.
- Mro (U+16A40-U+16A6F), containing 43 characters, was added.
- Bassa Vah (U+16AD0-U+16AFF), containing 36 characters, was added.
- Pahawh Hmong (U+16B00-U+16B8F), containing 127 letters and signs, was added.
- Duployan (U+1BC00-U+1BC9F), containing 143 characters, was added.
- Shorthand Format Controls (U+1BCA0-U+1BCAF), containing 4 format characters, was added.
- Mende Kikakui (U+1E800-U+1E8DF), containing 213 syllables and numbers, was added.
- Ornamental Dingbats (U+1F650-U+1F67F), containing 48 pictographic characters, was added.
- Geometric Shapes Extended (U+1F780-U+1F7FF), containing 85 pictographic characters, was added.
- Supplemental Arrows-C (U+1F800-U+1F8FF), containing 148 pictographic characters, was added.
Extended blocks
- A capital letter Yot (total 1 character) was added to Greek and Coptic. (U+037F)
- Letters for Orok, Komi and Khanty (total 8 characters) were added to Cyrillic Supplement. (U+0528-U+052F)
- An Eternity sign (total 2 characters) were added to Armenian. (U+058D-U+058E)
- A Number Mark Above (total 1 character) was added to Arabic. (U+0605)
- Letters for African, Philippine, Turkic, Berber, Belarusian, Palula and Shina languages (total 8 characters) were added to Arabic Extended-A. (U+08A1, U+08AD-U+08B2 and U+08FF)
- A letter for Marwari (total 1 character) was added to Devanagari. (U+0978)
- A sign Anji (total 1 character) was added to Bengali. (U+0980)
- Sign Candrabindu and letter Llla (total 2 characters) were added to Telugu. (U+0C00 and U+0C34)
- A Sign Candrabindu (total 1 character) was added to Kannada. (U+0C81)
- A Sign Candrabindu (total 1 character) was added to Malayalam. (U+0D01)
- Lith Numerals (total 10 characters) were added to Sinhala. (U+0DE6-U+0DEF)
- Additional Old English runes (total 8 characters) were added to Runic. (U+16F1-U+16F8)
- Letters Gyan and Tra (total 2 characters) were added to Limbu. (U+191D-U+191E)
- Signs for Jaiminiya Sama Veda (total 2 characters) were added to Vedic Extensions. (U+1CF8-U+1CF9)
- Marks for Germanic and American lexicology (total 15 characters) were added to Combining Diacritical Marks Supplement. (U+1DE7-U+1DF5)
- Nordic Mark, Manat and Ruble sign (total 3 characters) were added to Currency Symbols. (U+20BB-U+20BD)
- Playback symbols from Webdings font (total 7 characters) were added to Miscellaneous Technical. (U+23F4-U+23FA)
- A Scissors symbol from Wingdings 2 font (total 1 character) was added to Dingbats. (U+2700)
- Arrows for Lithuanian dialectology and symbols from Wingdings 3 font (total 115 characters) were added to Miscellaneous Symbols and Arrows. (U+2B4D-U+2B4F, U+2B5A-U+2B5F, U+2B60-U+2B73, U+2B76-U+2B95, U+2B98-U+2BB9, U+2BBD-U+2BC8 and U+2BCA-U+2BD1)
- (total 7 characters) were added to Supplemental Punctuation. (U+2E3C-U+2E42)
- Early Cyrillic letters and letters for Lithuanian dialectology (total 6 characters) were added to Cyrillic Extended-B. (U+A698-U+A69D)
- Letters for European, American and African orthography (total 18 characters) were added to Latin Extended-D. (U+A794-U+A79F, U+A7AB-U+A7AD, U+A7B0-U+A7B1 and U+A7F7)
- Tone marks for Tai Laing and letters for Shwe Palaung (total 4 characters) were added to Myanmar Extended-A. (U+AA7C-U+AA7F)
- Combining phonetic marks (total 7 characters) were added to Combining Half Marks. (U+FE27-U+FE2D)
- Additional mathematical symbols (total 2 characters) were added to Ancient Greek Numbers. (U+1018B-U+1018C)
- A Greek Tau Rho symbol (total 1 character) was added to Ancient Symbols. (U+101A0)
- A letter Ess (total 1 character) was added to Old Italic. (U+1031F)
- A Number Joiner (total 1 character) was added to Brahmi. (U+1107F)
- Sutra mark and sign Ekam (total 2 characters) were added to Sharada. (U+111CD and U+111DA)
- Additional cuneiform signs (total 42 characters) were added to Cuneiform. (U+1236F-U+12398)
- Additional numbers, vulgar fractions and a punctuation mark (total 13 characters) were added to Cuneiform Numbers and Punctuation. (U+12463-U+1246E and U+12474)
- Red Joker, Fool and trumps (total 23 characters) were added to Playing Cards. (U+1F0BF and U+1F0E0-U+1F0F5)
- Dingbat normal and negative sans-serif digit zero (total 2 characters) were added to Enclosed Alphanumeric Supplement. (U+1F10B-U+1F10C)
- Symbols from Webdings, Wingdings 1 and 2 font (total 209 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F321-U+1F32C, U+1F336, U+1F37D, U+1F394-U+1F39F, U+1F3C5, U+1F3CB-U+1F3CE, U+1F3D4-U+1F3DF, U+1F3F1-U+1F3F7, U+1F43F, U+1F441, U+1F4F8, U+1F4FD-U+1F4FE, U+1F53E-U+1F53F, U+1F544-U+1F54A, U+1F568-U+1F579, U+1F57B-U+1F5A3 and U+1F5A5-U+1F5FA)
- Slightly frowning and smiling faces emoji (total 2 characters) were added to Emoticons. (U+1F641-U+1F642)
- Symbols from Webdings and Wingdings 2 font (total 27 characters) were added to Transport and Map Symbols. (U+1F6C6-U+1F6CF, U+1F6E0-U+1F6EC and U+1F6F0-U+1F6F3)
Unicode 8.0
Unicode 8.0 was released in June 17, 2015. It encoded 120,672 characters, adding 7,716 new characters.
New blocks
- Cherokee Supplement (U+AB70-U+ABBF), containing 80 lowercase letters, was added.
- Hatran (U+108E0-U+108FF), containing 26 letters, was added.
- Old Hungarian (U+10C80-U+10CFF), containing 108 letters, was added.
- Multani (U+11280-U+112AF), containing 38 letters, was added.
- Ahom (U+11700-U+1173F), containing 57 letters, was added.
- Early Dynastic Cuneiform (U+12480-U+1254F), containing 196 characters, was added.
- Anatolian Hieroglyphs (U+14400-U+1467F), containing 583 characters, was added.
- Sutton SignWriting (U+1D800-U+1DAAF), containing 672 signs, was added.
- Supplemental Symbols and Pictographs (U+1F900-U+1F9FF), containing 15 pictographic characters, was added.
- CJK Unified Ideographs Extension E (U+2B820-U+2CEAF), containing 5762 characters, was added.
Extended blocks
- Letters for Arwi (total 3 characters) were added to Arabic Extended-A. (U+08B3-U+08B4 and U+08E3)
- A letter for Avestan transliteration (total 1 character) was added to Gujarati. (U+0AF9)
- A letter for Andhra Pradesh (total 1 character) was added to Telugu. (U+0C5A)
- An archaic letter II (total 1 character) was added to Malayalam. (U+0D5F)
- A letter Mv and small letters (total 7 characters) were added to Cherokee. (U+13F5 and U+13F8-U+13FD)
- A Georgian Lari sign (total 1 character) was added to Currency Symbols. (U+20BE)
- Turned digits (total 2 characters) were added to Number Forms. (U+218A-U+218B)
- Two headed arrows with triangle arrowheads (total 4 characters) were added to Miscellaneous Symbols and Arrows. (U+2BEC-U+2BEF)
- Some additional ideographs (total 9 characters) were added to CJK Unified Ideographs. (U+9FCD-U+9FD5)
- A combining letter Ef (total 1 character) was added to Cyrillic Extended-B. (U+A69E)
- Sinological dot, phonetic extension for African languages, letters for American and Gabonese orthography (total 7 characters) were added to Latin Extended-D. (U+A78F and U+A7B2-U+A7B7)
- Sign Siddham and letter Jain Om (total 2 characters) were added to Devanagari Extended. (U+A8FC-U+A8FD)
- Letters for Yakut transliteration (total 4 characters) were added to Latin Extended-E. (U+AB60-U+AB63)
- A combining mark for Church Slavonic (total 2 characters) were added to Combining Half Marks. (U+FE2E-U+FE2F)
- Numerals and vulgar fractions (total 64 characters) were added to Meroitic Cursive. (U+109BC-U+109BD, U+109C0-U+109CF and U+109D2-U+109FF)
- Sandhi mark, diacritical marks for Kashmiri, sign Siddham and punctuation marks (total 9 characters) were added to Sharada. (U+111C9-U+111CC and U+111DB-U+111DF)
- Combining Anusvara Above and letter Om (total 2 characters) were added to Grantha. (U+11300 and U+11350)
- Section marks and alternate letters (total 20 characters) were added to Siddham. (U+115CA-U+115DD)
- An additional sign (total 1 character) was added to Cuneiform. (U+12399)
- East-Slavic musical symbols (total 11 characters) were added to Musical Symbols. (U+1D1DE-U+1D1E8)
- (total 24 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F32D-U+1F32F, U+1F37E-U+1F37F, U+1F3CF-U+1F3D3, U+1F3F8-U+1F3FF, U+1F4FF and U+1F54B-U+1F54F)
- Upside Down Face and Face With Rolling Eyes emoji (total 2 characters) were added to Emoticons. (U+1F643-U+1F644)
- A Place of Worship emoji (total 1 character) was added to Transport and Map Symbols. (U+1F6D0)
Unicode 9.0
Unicode 9.0, was released in June 21, 2016. It encoded 128,172 characters, adding 7,500 new characters.
New blocks
- Cyrillic Extended-C (U+1C80-U+1C8F), containing 9 letters, was added.
- Osage (U+104B0-U+104FF), containing 72 letters, was added.
- Newa (U+11400-U+1147F), containing 92 letters, was added.
- Mongolian Supplement (U+11660-U+1167F), containing 13 letters, was added.
- Bhaiksuki (U+11C00-U+11C6F), containing 97 letters, was added.
- Marchen (U+11C70-U+11CBF), containing 68 letters, was added.
- Ideographic Symbols and Punctuation (U+16FE0-U+16FFF), containing 1 letter, was added.
- Tangut (U+17000-U+187FF), containing 6125 letters, was added.
- Tangut Components (U+18800-U+18AFF), containing 755 letters, was added.
- Glagolitic Supplement (U+1E000-U+1E02F), containing 38 letters, was added.
- Adlam (U+1E900-U+1E95F), containing 87 letters, was added.
Extended blocks
- Letters for Bravanese, Warsh and Quranic marks used in Pakistan (total 23 characters) were added to Arabic Extended-A. (U+08B6-U+08BD and U+08D4-U+08E2)
- A sign Spacing Candrabindu (total 1 character) were added to Kannada. (U+0C80)
- Sign Para, Chillu letters and vulgar fractions (total 14 characters) were added to Malayalam. (U+0D4F, U+0D54-U+0D56, U+0D58-U+0D5E and U+0D76-U+0D78)
- A diacritical mark for Newa (total 1 character) was added to Combining Diacritical Marks Supplement. (U+1DFB)
- Power symbols (total 4 characters) were added to Miscellaneous Technical. (U+23FB-U+23FE)
- Punctuation marks for Church Slavonic (total 2 characters) were added to Supplemental Punctuation. (U+2E43-U+2E44)
- A letter for Unifon (total 1 character) was added to Latin Extended-D. (U+A7AE)
- A sign Candrabindu (total 1 character) was added to Saurashtra. (U+A8C5)
- Indiction sign and a currency symbol (total 2 characters) were added to Ancient Greek Numbers. (U+1018D-U+1018E)
- A sign Sukun (total 1 character) was added to Khojki. (U+1123E)
- Japanese TV symbols (total 18 characters) were added to Enclosed Alphanumeric Supplement. (U+1F19B-U+1F1AC)
- A Japanese TV symbol (total 1 character) was added to Enclosed Ideographic Supplement. (U+1F23B)
- A dancing man and Black Heart emoji (total 2 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F57A and U+1F5A4)
- Octagonal Sign, Shopping Trolley, scooters and a Canoe emoji (total 5 characters) were added to Transport and Map Symbols. (U+1F6D1-U+1F6D2 and U+1F6F4-U+1F6F6)
- (total 67 characters) were added to Supplemental Symbols and Pictographs. (U+1F919-U+1F91E, U+1F920-U+1F927, U+1F930, U+1F933-U+1F93E, U+1F940-U+1F94B, U+1F950-U+1F95E and U+1F985-U+1F991)
Variation Sequences
Here is a table with new standardized variation sequences:
Character Sequence | Context | Description of Variation Appearance |
---|---|---|
0030 FE00 | short diagonal stroke form # DIGIT ZERO | |
1000 FE00 | dotted form # MYANMAR LETTER KA | |
1002 FE00 | dotted form # MYANMAR LETTER GA | |
1004 FE00 | dotted form # MYANMAR LETTER NGA | |
1010 FE00 | dotted form # MYANMAR LETTER TA | |
1011 FE00 | dotted form # MYANMAR LETTER THA | |
1015 FE00 | dotted form # MYANMAR LETTER PA | |
1019 FE00 | dotted form # MYANMAR LETTER MA | |
101A FE00 | dotted form # MYANMAR LETTER YA | |
101C FE00 | dotted form # MYANMAR LETTER LA | |
101D FE00 | dotted form # MYANMAR LETTER WA | |
1022 FE00 | dotted form # MYANMAR LETTER SHAN A | |
1031 FE00 | dotted form # MYANMAR VOWEL SIGN E | |
1075 FE00 | dotted form # MYANMAR LETTER SHAN KA | |
1078 FE00 | dotted form # MYANMAR LETTER SHAN CA | |
107A FE00 | dotted form # MYANMAR LETTER SHAN NYA | |
1080 FE00 | dotted form # MYANMAR LETTER SHAN THA | |
2205 FE00 | zero with long diagonal stroke overlay form # EMPTY SET | |
AA60 FE00 | dotted form # MYANMAR LETTER KHAMTI GA | |
AA61 FE00 | dotted form # MYANMAR LETTER KHAMTI CA | |
AA62 FE00 | dotted form # MYANMAR LETTER KHAMTI CHA | |
AA63 FE00 | dotted form # MYANMAR LETTER KHAMTI JA | |
AA64 FE00 | dotted form # MYANMAR LETTER KHAMTI JHA | |
AA65 FE00 | dotted form # MYANMAR LETTER KHAMTI NYA | |
AA66 FE00 | dotted form # MYANMAR LETTER KHAMTI TTA | |
AA6B FE00 | dotted form # MYANMAR LETTER KHAMTI NA | |
AA6C FE00 | dotted form # MYANMAR LETTER KHAMTI SA | |
AA6F FE00 | dotted form # MYANMAR LETTER KHAMTI FA | |
AA7A FE00 | dotted form # MYANMAR LETTER AITON RA | |
… | 278 additional emoji variation sequences |
Unicode 10.0
Unicode 10.0, was released in June 20, 2017. It encoded 136,690 characters, adding 8,518 new characters.
New blocks
- Syriac Supplement (U+0860-U+086F), containing 11 characters, was added.
- Zanabazar Square (U+11A00-U+11A4F), containing 72 characters, was added.
- Soyombo (U+11A50-U+11AAF), containing 80 characters, was added.
- Masaram Gondi (U+11D00-U+11D5F), containing 75 characters, was added.
- Kana Extended-A (U+1B100-U+1B12F), containing 31 characters, was added.
- Nushu (U+1B170-U+1B2FF), containing 396 characters, was added.
- CJK Unified Ideographs Extension F (U+2CEB0-U+2EBEF), containing 7,473 characters, was added.
Extended blocks
- A Vedic Anusvara and Abbreviation mark (total 2 characters) were added to Bengali. (U+09FC-U+09FD)
- Letters for Arabic transliteration (total 6 characters) were added to Gujarati. (U+0AFA-U+0AFF)
- A combining Anusvara Above and Viramas (total 3 characters) were added to Malayalam. (U+0D00 and U+0D3B-U+0D3C)
- A sign Atikrama (total 1 character) was added to Vedic Extensions. (U+1CF7)
- Combining diacritical marks for Church Slavonic (total 4 characters) were added to Combining Diacritical Marks Supplement. (U+1DF6-U+1DF9)
- A Bitcoin sign (total 1 character) was added to Currency Symbols. (U+20BF)
- An Observe Eye symbol (total 1 character) was added to Miscellaneous Technical. (U+23FF)
- A Group mark (total 1 character) was added to Miscellaneous Symbols and Arrows. (U+2BD2)
- Medieval punctuation marks (total 5 characters) were added to Supplemental Punctuation. (U+2E45-U+2E49)
- A letter O with Dot Above (total 1 character) was added to Bopomofo. (U+312E)
- Ideographs for Slavonic transliteration (total 21 characters) were added to CJK Unified Ideographs. (U+9FD6-U+9FEA)
- Letters for North Italic (total 3 characters) were added to Old Italic. (U+1032D-U+1032F)
- An Iteration mark for Nushu (total 1 character) was added to Ideographic Symbols and Punctuation. (U+16FE1)
- Letters for Hentaigana (total 254 characters) were added to Kana Supplement. (U+1B002-U+1B0FF)
- Symbols for Chinese Folk religion (total 6 characters) were added to Enclosed Ideographic Supplement. (U+1F260-U+1F265)
- Stupa, Pagoda, Sled and Flying Saucer emoji (total 4 characters) were added to Transport and Map Symbols. (U+1F6D3-U+1F6D4 and U+1F6F7-U+1F6F8)
- (total 66 characters) were added to Supplemental Symbols and Pictographs. (U+1F900-U+1F90B, U+1F91F, U+1F928-U+1F92F, U+1F931-U+1F932, U+1F94C, U+1F95F-U+1F96B, U+1F992-U+1F997 and U+1F9D0-U+1F9E6)
Unicode 11.0
Unicode 11.0, was released in June 5, 2018. It encoded 137,374 characters, adding 684 new characters.
New blocks
- Georgian Extended (U+1C90-U+1CBF), containing 46 characters, was added.
- Hanifi Rohingya (U+10D00-U+10D3F), containing 50 characters, was added.
- Old Sogdian (U+10F00-U+10F2F), containing 40 characters, was added.
- Sogdian (U+10F30-U+10F6F), containing 42 characters, was added.
- Dogra (U+11800-U+1184F), containing 60 characters, was added.
- Gunjala Gondi (U+11D60-U+11DAF), containing 63 characters, was added.
- Makasar (U+11EE0-U+11EFF), containing 25 characters, was added.
- Medefaidrin (U+16E40-U+16E9F), containing 91 characters, was added.
- Mayan Numerals (U+1D2E0-U+1D2FF), containing 20 characters, was added.
- Indic Siyaq Numbers (U+1EC70-U+1ECBF), containing 68 characters, was added.
- Chess Symbols (U+1FA00-U+1FA6F), containing 14 characters, was added.
Extended blocks
- Small letters Turned Ayb and Yi with Stroke (total 2 characters) were added to Armenian. (U+0560 and U+0588)
- A triangle Yod (total 1 character) were added to Hebrew. (U+05EF)
- A Dantayalan and currency symbols (total 3 characters) were added to N'Ko. (U+07FD-U+07FF)
- A Small Low Waw (total 1 character) was added to Arabic Extended-A. (U+08D3)
- A Sandhi mark (total 1 character) was added to Bengali. (U+09FE)
- An Abbreviation mark (total 1 character) was added to Gurmukhi. (U+0A76)
- A combining Anusvara Above (total 1 character) was added to Telugu. (U+0C04)
- A sign Siddham (total 1 character) was added to Kannada. (U+0C84)
- A letter for Buryat (total 1 character) was added to Mongolian. (U+1878)
- Symbols for chess notation, astrological and half star symbols (total 43 characters) were added to Miscellaneous Symbols and Arrows. (U+2BBA-U+2BBC, U+2BD3-U+2BEB and 2BF0-U+2BFE)
- Medieval punctuation marks (total 5 characters) were added to Supplemental Punctuation. (U+2E4A-U+2E4E)
- A letter NN (total 1 character) was added to Bopomofo. (U+312F)
- Some ideographs for Kanji (total 5 characters) were added to CJK Unified Ideographs. (U+9FEB-U+9FEF)
- A small capital Q and a letter for Mazahua (total 3 characters) were added to Latin Extended-D. (U+A7AF and U+A7B8-U+A7B9)
- Letter and vowel sign Ay (total 2 characters) were added to Devanagari Extended. (U+A8FE-U+A8FF)
- Letters Ttta, Vha and a vulgar fraction (total 3 characters) were added to Kharoshthi. (U+10A34-U+10A35 and U+10A48)
- A Number Sign Above (total 1 character) was added to Kaithi. (U+110CD)
- Letter Lhaa, vowel sign Aa and Ei (total 3 characters) were added to Chakma. (U+11144-U+11146)
- A combining Bindu Below (total 1 character) was added to Grantha. (U+1133B)
- A Sandhi mark (total 1 character) was added to Newa. (U+1145E)
- An alternate letter Ba (total 1 character) was added to Ahom. (U+1171A)
- A mark Pluta (total 1 character) was added to Soyombo. (U+11A9D)
- Additional ideographs (total 5 characters) were added to Tangut. (U+187ED-U+187F1)
- Tally marks (total 7 characters) were added to Counting Rod Numerals. (U+1D372-U+1D378)
- A Copyleft symbol (total 1 character) was added to Enclosed Alphanumeric Supplement. (U+1F12F)
- A Skateboard emoji (total 1 character) was added to Transport and Map Symbols. (U+1F6F9)
- Normal and negative circled shapes (total 4 characters) were added to Geometric Shapes Extended. (U+1F7D5-U+1F7D8)
- (total 65 characters) were added to Supplemental Symbols and Pictographs. (U+1F94D-U+1F94F, U+1F96C-U+1F970, U+1F973-U+1F976, U+1F97A, U+1F97C-U+1F97F, U+1F998-U+1F99F, U+1F9A0-U+1F9A2, U+1F9B0-U+1F9B9, U+1F9C1-U+1F9C2 and U+1F9E7-U+1F9FF)
Variation Sequences
Here is a table with new standardized variation sequences:
Character Sequence | Context | Description of Variation Appearance |
---|---|---|
FF10 FE00 | short diagonal stroke form # FULLWIDTH DIGIT ZERO |
Unicode 12.0
Unicode 12.0 was released on March 5, 2019. It encoded 137,928 characters, adding 554 new characters.
New blocks
- Elymaic (U+10FE0-U+10FFF), containing 23 characters, was added.
- Nandinagari (U+119A0-U+119FF), containing 65 characters, was added.
- Tamil Supplement (U+11FC0-U+11FFF), containing 51 characters, was added.
- Egyptian Hieroglyph Format Controls (U+13430-U+1343F), containing 9 characters, was added.
- Small Kana Extension (U+1B130-U+1B16F), containing 7 characters, was added.
- Nyiakeng Puachue Hmong (U+1E100-U+1E14F), containing 71 characters, was added.
- Wancho (U+1E2C0-U+1E2FF), containing 59 characters, was added.
- Ottoman Siyaq Numbers (U+1ED00-U+1ED4F), containing 61 characters, was added.
- Symbols and Pictographs Extended-A (U+1FA70-U+1FAFF), containing 16 characters, was added.
Extended blocks
- A sign Siddham (total 1 character) was added to Telugu. (U+0C77)
- Letters for Pail and Sanskrit (total 15 characters) were added to Lao. (U+0E86, U+0E89, U+0E8C, U+0E8E-U+0E93, U+0E98, U+0EA0, U+0EA8-U+0EA9, U+0EAC and U+0EBA)
- A sign Double Anusvara Antargomukha (total 1 character) was added to Vedic Extensions. (U+1CFA)
- An astrological symbol and Hellschreiber Pause symbol (total 2 characters) were added to Miscellaneous Symbols and Arrows. (U+2BC9 and U+2BFF)
- A Cornish Verse Divider (total 1 character) was added to Supplemental Punctuation. (U+2E4F)
- Egyptological letters, Anglicana W and letters for early Pinyin (total 11 characters) were added to Latin Extended-D. (U+A7BA-U+A7BF and U+A7C2-U+A7C6)
- Sinological phonetic letters (total 2 characters) were added to Latin Extended-E. (U+AB66-U+AB67)
- A Vedic Anusvara (total 1 character) was added to Newa. (U+1145F)
- An archaic letter Kha (total 1 character) was added to Takri. (U+116B8)
- Sign Jihvamuliya and Uphadhmaniya (total 2 characters) were added to Soyombo. (U+11A84-U+11A85)
- Letters for various Yi and Miao languages (total 16 characters) were added to Miao. (U+16F45-U+16F4A, U+16F4F and U+16F7F-U+16F87)
- Marks for Ancient Chinese texts (total 2 characters) were added to Ideographic Symbols and Punctuation. (U+16FE2-U+16FE3)
- Some additional ideographs (total 6 characters) were added to Tangut. (U+187F2-U+187F7)
- A Nasalization mark (total 1 character) was added to Adlam. (U+1E94B)
- A Spanish and Portuguese register mark (total 1 character) was added to Enclosed Alphanumeric Supplement. (U+1F16C)
- Hindu Temple and Auto Rickshaw emoji (total 2 characters) were added to Transport and Map Symbols. (U+1F6D5 and U+1F6FA)
- Large colored circles and boxes (total 12 characters) were added to Geometric Shapes Extended. (U+1F7E0-U+1F7EB)
- (total 31 characters) were added to Supplemental Symbols and Pictographs. (U+1F90D-U+1F90F, U+1F93F, U+1F971, U+1F97B, U+1F9A5-U+1F9AA, U+1F9AE-U+1F9AF, U+1F9BA-U+1F9BF, U+1F9C3-U+1F9CA and U+1F9CD-U+1F9CF)
- Heterodox chess symbols (total 84 characters) were added to Chess Symbols. (U+1FA00-U+1FA53)
Glyph Changes
Here is a table with glyph changes:
Block Name | Code Points | Count |
---|---|---|
Spacing Modifier Letters | 02EA, 02EB | 2 |
Vedic Extensions | 1CF2..1CF3 | 2 |
Currency Symbols | 20A9 | 1 |
CJK Symbols and Punctuation | 3001, 3002 | 2 |
Bopomofo | 3105..312F | 43 |
Bopomofo Extended | 31A0..31BA | 27 |
CJK Unified Ideographs Extension A | 37C3, 3B9D, 3CFD, 3FE0, 44EC, 4A76 | 6 |
CJK Unified Ideographs | 5344, 55B9, 6ABC, 6FF9, 809E, 80BC, 80E9, 8132, 8159, 841C, 891D, 8C6C, 915E, 9FD4 | 14 |
Phags-pa | A840..A877 | 56 |
Halfwidth and Fullwidth Forms | FF01, FF0C, FF0E, FF1A, FF1B, FF1F | 6 |
CJK Unified Ideographs Extension B | 200DD, 20164, 20BBF, 20C02, 20CED, 21D4C, 2278B, 23AB8, 2459B, 24A7D, 24FB9, 25ED7, 2677C, 26B4C, 26C21, 26CBE, 26E3D, 28834, 289A1, 289C0, 28A0F, 28B46 | 22 |
CJK Unified Ideographs Extension C | 2A8FB, 2A917, 2AA30 | 3 |
CJK Unified Ideographs Extension E | 2BA52, 2BD77, 2C494, 2C72F, 2C734, 2CB38 | 6 |
CJK Unified Ideographs Extension F | 2D23B, 2E83A | 2 |
Total | 192 |
Variation Sequences
Here is a table with new standardized variation sequences:
Character Sequence | Context | Description of Variation Appearance |
---|---|---|
3001 FE00 | corner-justified form # IDEOGRAPHIC COMMA | |
3001 FE01 | centered form # IDEOGRAPHIC COMMA | |
3002 FE00 | corner-justified form # IDEOGRAPHIC FULL STOP | |
3002 FE01 | centered form # IDEOGRAPHIC FULL STOP | |
FF01 FE00 | corner-justified form # FULLWIDTH EXCLAMATION MARK | |
FF01 FE01 | centered form # FULLWIDTH EXCLAMATION MARK | |
FF0C FE00 | corner-justified form # FULLWIDTH COMMA | |
FF0C FE01 | centered form # FULLWIDTH COMMA | |
FF0E FE00 | corner-justified form # FULLWIDTH FULL STOP | |
FF0E FE01 | centered form # FULLWIDTH FULL STOP | |
FF1A FE00 | corner-justified form # FULLWIDTH COLON | |
FF1A FE01 | centered form # FULLWIDTH COLON | |
FF1B FE00 | corner-justified form # FULLWIDTH SEMICOLON | |
FF1B FE01 | centered form # FULLWIDTH SEMICOLON | |
FF1F FE00 | corner-justified form # FULLWIDTH QUESTION MARK | |
FF1F FE01 | centered form # FULLWIDTH QUESTION MARK |
Unicode 12.1
Unicode 12.1 was released on May 7, 2019. It encoded 137,929 characters, adding only 1 new character.
Extended blocks
- A square era name Reiwa (total 1 character) was added to Enclosed CJK Letters and Months. (U+32FF)
Unicode 13.0
Unicode 13.0 was released on March 10, 2020. It encoded 143,859 characters, adding 5,930 new characters.
New blocks
- Yezidi (U+10E80-U+10EBF), containing 47 characters, was added.
- Chorasmian (U+10FB0-U+10FDF), containing 28 characters, was added.
- Dives Akuru (U+11900-U+1195F), containing 72 characters, was added.
- Lisu Supplement (U+11FB0-U+11FBF), containing 1 character, was added.
- Khitan Small Script (U+18B00-U+18CFF), containing 470 characters, was added.
- Tangut Supplement (U+18D00-U+18D08), containing 9 characters, was added.
- Symbols for Legacy Computing (U+1FB00-U+1FBFF), containing 212 characters, was added.
- CJK Unified Ideographs Extension G (U+30000-U+3134F), containing 4939 characters, was added.
Extended blocks
- Letters for African languages and Punjabi (total 10 characters) were added to Arabic Extended-A. (U+08BE-U+08C7)
- A sign Overline (total 1 character) was added to Oriya. (U+0B55)
- A Vedic Anusvara (total 1 character) was added to Malayalam. (U+0D04)
- A sign Candrabindu (total 1 character) was added to Sinhala. (U+0D81)
- Combining diacritical marks for Scottish phonology (total 2 characters) were added to Combining Diacritical Marks Extended. (U+1ABF-U+1AC0)
- A Japanese symbol for Type A Electronics (total 1 character) was added to Miscellaneous Symbols and Arrows. (U+2B97)
- Cross patties and a Tironian sign Capita Et (total 3 characters) were added to Supplemental Punctuation. (U+2E50-U+2E52)
- Letters for Taiwan and Cantonese language (total 5 characters) were added to Bopomofo Extended. (U+31BB-U+31BF)
- Some disunified ideographs (total 10 characters) were added to CJK Unified Ideographs Extension A. (U+4DB6-4DBF)
- Some ideographs for China (total 13 characters) were added to CJK Unified Ideographs. (U+9FF0-U+9FFC)
- Letters for Gaulish (total 6 characters) were added to Latin Extended-D. (U+A7C7-U+A7CA and U+A7F5-U+A7F6)
- An alternate sign Nasanta (total 1 character) was added to Syloti Nagri. (U+A82C)
- Letter R With Midle Tilde and modifier letters for Scottish phonology (total 4 characters) were added to Latin Extended-E. (U+AB68-U+AB6B)
- A symbol Ascia (total 1 character) was added to Ancient Symbols. (U+1019C)
- A letter for Pali (total 1 character) was added to Chakma. (U+11147)
- A vowel sign Prishthamatra E and Inverted Candrabindu (total 2 characters) were added to Sharada. (U+111CE and U+111CF)
- Double comma, sign Jihvamuliya and Uphadhmaniya (total 3 characters) were added to Newa. (U+1145A and U+11460-U+11461)
- Khitan Small Script Filler and reading marks for Vietnamese (total 3 characters) were added to Ideographic Symbols and Punctuation. (U+16FE4 and U+16FF0-U+16FF1)
- Some additional components (total 13 characters) were added to Tangut Components. (U+18AF3-U+18AFF)
- Creative Commons license symbols and Mask Work symbol (total 7 characters) were added to Enclosed Alphanumeric Supplement. (U+1F10D-U+1F10F, U+1F16D-1F16F and U+1F1AD)
- Hut, Elevator, Pickup Truck and Roller Skate emoji (total 4 characters) were added to Transportation and Map Symbols. (U+1F6D6-U+1F6D7 and U+1F6FB-U+1F6FC)
- Arrows for legacy computing (total 2 characters) were added to Supplemental Arrows-C. (U+1F8B0-U+1F8B1)
- (total 10 characters) were added to Supplemental Symbols and Pictographs. (U+1F90C, U+1F972, U+1F977-U+1F978, U+1F9A3-U+1F9A4, U+1F9AB-U+1F9AD and U+1F9CB)
- (total 41 characters) were added to Symbols and Pictographs Extended-A. (U+1FA74, U+1FA83-U+1FA86, U+1FA96-U+1FAA8, U+1FAB0-U+1FAB6, U+1FAC0-U+1FAC2 and U+1FAD0-U+1FAD6)
- Gongche charaters for Kunqu Opera (total 7 characters) were added to CJK Unified Ideographs Extension B. (U+2A6D7-U+2A6DD)
Glyph Changes
Here is a table with glyph changes:
Block Name | Code Points | Count |
---|---|---|
Tagalog | 1700..170C, 170E..1714 | 20 |
Mongolian | 1834, 1871, 1878 | 3 |
Sundanese | 1BAB | 1 |
Currency Symbols | 20BF | 1 |
CJK Radicals Supplement | 2E80..2E99, 2E9B..2EF3 | 115 |
Kangxi Radicals | 2F00..2FD5 | 214 |
CJK Unified Ideographs Extension A | 3472, 38C7, 3DB8, 3FE0, 440B, 46E9 | 6 |
CJK Unified Ideographs | 53FD, 6146, 6711, 671C, 6721, 6725, 6BD2, 7B9A, 87CE, 8956, 93BF, 9B97 | 12 |
Latin Extended-D | A764..A765 | 2 |
Phags-pa | A86D | 1 |
Tangut | 175F6, 17F0D, 17F8A, 17FA5, 180D6, 18139, 18147, 184F1, 18736 | 9 |
Tangut Components | 18843, 18856, 1888C, 1890A, 18915, 1893B | 6 |
Adlam | 1E900..1E94A, 1E950..1E959, 1E95E..1E95F | 71 |
Miscellaneous Symbols and Pictographs | 1F3B1 | 1 |
Supplemental Symbols and Pictographs | 1F995..1F998, 1F99B..1F99E, 1F9B0..1F9B3, 1F9E7 | 13 |
CJK Unified Ideographs Extension B | 20219, 21249, 21827, 22C3A, 2327B, 23496, 2355E, 2363B, 236ED, 23839, 23FD5, 24261, 24726, 248F2, 2548E, 26657, 26C9E, 26FE1, 27334, 27C0E, 27CEF, 2A38C | 22 |
CJK Unified Ideographs Extension C | 2AED5, 2AEF3, 2AF76, 2B09F, 2B1C3, 2B1E5 | 6 |
CJK Unified Ideographs Extension E | 2B83C, 2B8D9..2B8DA, 2B96F, 2BBD7, 2BD61, 2BE4A, 2BF1D, 2BF9D, 2C0B8, 2C142, 2C176, 2C316, 2C3FB, 2C402, 2C7AC, 2C82C, 2C83A, 2C9A1, 2CC88, 2CD68 | 21 |
CJK Unified Ideographs Extension F | 2DC09, 2DE4A, 2EB7E, 2EB89 | 4 |
CJK Compatibility Ideographs Supplement | 2F83B, 2F878, 2F8D6..2F8D7, 2F8DA, 2F8F0, 2F984, 2FA02 | 8 |
Total | 536 |
Unicode 14.0
Unicode 14.0 was released on September 14, 2021. It encoded 144,697 characters, adding 838 new characters.
New blocks
- Arabic Extended-B (U+0870-U+089F), containing 41 characters, was added.
- Vithkuqi (U+10570-U+105BF), containing 70 characters, was added.
- Latin Extended-F (U+10780-U+107BF), containing 57 characters, was added.
- Old Uyghur (U+10F70-U+10FAF), containing 26 characters, was added.
- Unified Canadian Aboriginal Syllabics Extended-A (U+11AB0-U+11ABF), containing 16 characters, was added.
- Cypro-Minoan (U+12F90-U+12FFF), containing 99 characters, was added.
- Tangsa (U+16A70-U+16ACF), containing 89 characters, was added.
- Kana Extended-B (U+1AFF0-U+1AFFF), containing 13 characters, was added.
- Znamenny Musical Notation (U+1CF00-U+1CFFF), containing 185 characters, was added.
- Latin Extended-G (U+1DF00-U+1DFFF), containing 31 characters, was added.
- Toto (U+1E290-U+1E2BF), containing 31 characters, was added.
- Ethiopic Extended-B (U+1E7E0-U+1E7FF), containing 28 characters, was added.
Extended blocks
- An End of Text punctuation mark (total 1 character) was added to Arabic. (U+061D)
- Letters for Balti and Quranic orthography (total 12 characters) were added to Arabic Extended-A. (U+08B5 and U+08C8-U+08D2)
- A sign Nukta and letter Nakaara Pollu (total 2 characters) were added to Telugu. (U+0C3C and U+0C5D)
- A letter Nakaara Pollu (total 1 character) was added to Kannada. (U+0CDD)
- A letter Ra, sign Pamudpod and archaic letter Ra (total 3 characters) were added to Tagalog. (U+170D, U+1715 and U+171F)
- A fourth Free variation selector (total 1 character) was added to Mongolian. (U+180F)
- Combining diacritical marks for extended IPA (total 14 characters) were added to Combining Diacritical Marks Extended. (U+1AC1-U+1ACE)
- An archaic ligature Jnya and punctuation marks (total 3 characters) were added to Balinese. (U+1B4C and U+1B7D-U+1B7E)
- A combining Dot Below Left (total 1 character) was added to Combining Diacritical Marks Supplement. (U+1DFA)
- A Kyrgyz Som sign (total 1 character) was added to Currency Symbols. (U+20C0)
- A letter Caudate Chrivi (total 2 characters) were added to Glagolitic. (U+2C2F and U+2C5F)
- Medieval and phonetic punctuation marks (total 11 characters) were added to Supplemental Punctuation. (U+2E53-U+2E5D)
- Some ideographs for Macao (total 3 characters) were added to CJK Unified Ideographs. (U+9FFD-U+9FFF)
- Archaic European letters, modifier letters for Sokuon and Chatino orthography (total 13 characters) were added to Latin Extended-D. (U+A7C0-U+A7C1, U+A7D0-U+A7D1, U+A7D3, U+A7D5, U+A7D6-U+A7D9 and U+A7F2-U+A7F4)
- A modifier letter Wasla Above and honorifics (total 20 characters) were added to Arabic Presentation Forms-A. (U+FBC2, U+FD40-U+FD4F, U+FDCF and U+FDFE-U+FDFF)
- Letters for Old Tamil (total 6 characters) were added to Brahmi. (U+11070-U+11075)
- A vowel sign Vocalic R (total 1 character) was added to Khaiti. (U+110C2)
- An Abbreviation sign (total 1 character) was added to Takri. (U+116B9)
- Letters for Tai Ahom (total 7 characters) were added to Ahom. (U+11740-U+11746) The block was expanded from (U+11700-U+1173F) to (U+11700-U+1174F)
- Kana archaic letters (total 4 characters) were added to Kana Extended-A. (U+1B11F-U+1B122)
- Accidental symbols for Iranian classical music (total 2 characters) were added to Musical Symbols. (U+1D1E9-U+1D1EA)
- Playground Slide, Wheel and Ring Buoy emoji (total 3 characters) were added to Transportation and Map Symbols. (U+1F6DD-U+1F6DF)
- A Heavy Equals Sign emoji (total 1 character) was added to Geometric Shapes Extended. (U+1F7F0)
- A Troll and Face Holding Back Tears emoji (total 2 characters) were added to Supplemental Symbols and Pictographs. (U+1F979 and U+1F9CC)
- (total 31 characters) were added to Symbols and Pictographs Extended-A. (U+1FA7B-U+1FA7C, U+1FAA9-U+1FAAC, U+1FAB7-U+1FABA, U+1FAC3-U+1FAC5, U+1FAD7-U+1FAD9, U+1FAE0-U+1FAE7 and U+1FAF0-U+1FAF6)
- Some ideographs for Macao (total 2 characters) were added to CJK Unified Ideographs Extension B. (U+2A6DE-U+2A6DF)
- Disunified ideographs and a G source ideograph for China, Hong Kong and Vietnam (total 4 characters) were added to CJK Unified Ideographs Extension C. (U+2B735-U+2B738)
Glyph Changes
Here is a table with glyph changes:
Block Name | Code Points | Count |
---|---|---|
Latin Extended-B | 0184..0185 | 2 |
Arabic | 0674..0678, 06C5, 06C7, 06FE | 8 |
Letterlike Symbols | 210B, 2110, 2112, 211B, 212C, 2130..2131, 2133 | 8 |
Enclosed Alphanumerics | 2460..24FF | 160 |
Dingbats | 2776..2793 | 30 |
CJK Symbols and Punctuation | 3001..3029, 3030..303D, 303F | 56 |
CJK Strokes | 31C0..31E3 | 36 |
Katakana Phonetic Extensions | 31F0..31FF | 16 |
Enclosed CJK Letters and Months | 3200..321E, 3220..32FF | 255 |
CJK Compatibiity | 3300..33FF | 256 |
CJK Unified Ideographs Extension A | 3777, 3B3F | 2 |
CJK Unified Ideographs | 5DD5, 652C, 6AC0 | 3 |
Arabic Presentation Forms-A | FBD7..FBD8, FBDD, FBE0..FBE1 | 5 |
Vertical Forms | FE10..FE19 | 10 |
CJK Compatibiity Forms | FE30..FE4F | 32 |
Small Form Variants | FE50..FE52, FE54..FE66, FE68..FE6B | 26 |
Halfwidth and Fullwidth Forms | FF01..FF9F, FFA1..FFBE, FFC2..FFC7, FFCA..FFCF, FFD2..FFD7, FFDA..FFDC, FFE0..FFE6, FFE8..FFEE | 225 |
Egyptian Hieroglyphs | 1300A, 13017, 1302D, 13032, 13034..13035, 13037..13038, 1303A..1303E, 1304E..1304F, 13055, 13057, 13068, 1309A, 130D2, 130D5, 130F6, 130FE, 13192, 1325F, 13267, 1326A, 13281, 13297, 1329E, 132B4, 132C1, 132E6, 13304, 1331F, 13378..1337B, 1337D..1337E, 133F3, 133FA..13403, 1340D, 13417, 1342B | 55 |
Mathematical Alphanumeric Symbols | 1D49C, 1D49E..1D49F, 1D4A2, 1D4A5..1D4A6, 1D4A9..1D4AC, 1D4AE..1D4B5 | 18 |
Enclosed Alphanumeric Supplement | 1F100..1F1AD, 1F1E6..1F1FF | 200 |
Enclosed Ideographic Supplement | 1F200..1F202, 1F210..1F23B, 1F240..1F248, 1F250..1F251, 1F260..1F265 | 64 |
Supplemental Symbols and Pictographs | 1F930 | 1 |
CJK Unified Ideographs Extension B | 22ADC, 230F2, 25B27, 26F28 | 4 |
Total | 1472 |
Variation Sequences
Here is a table with new standardized variation sequences:
Character Sequence | Context | Description of Variation Appearance |
---|---|---|
1D49C FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL A | |
212C FE00 | chancery style # SCRIPT CAPITAL B | |
1D49E FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL C | |
1D49F FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL D | |
2130 FE00 | chancery style # SCRIPT CAPITAL E | |
2131 FE00 | chancery style # SCRIPT CAPITAL F | |
1D4A2 FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL G | |
210B FE00 | chancery style # SCRIPT CAPITAL H | |
2110 FE00 | chancery style # SCRIPT CAPITAL I | |
1D4A5 FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL J | |
1D4A6 FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL K | |
2112 FE00 | chancery style # SCRIPT CAPITAL L | |
2133 FE00 | chancery style # SCRIPT CAPITAL M | |
1D4A9 FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL N | |
1D4AA FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL O | |
1D4AB FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL P | |
1D4AC FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL Q | |
211B FE00 | chancery style # SCRIPT CAPITAL R | |
1D4AE FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL S | |
1D4AF FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL T | |
1D4B0 FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL U | |
1D4B1 FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL V | |
1D4B2 FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL W | |
1D4B3 FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL X | |
1D4B4 FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL Y | |
1D4B5 FE00 | chancery style # MATHEMATICAL SCRIPT CAPITAL Z | |
1D49C FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL A | |
212C FE01 | roundhand style # SCRIPT CAPITAL B | |
1D49E FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL C | |
1D49F FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL D | |
2130 FE01 | roundhand style # SCRIPT CAPITAL E | |
2131 FE01 | roundhand style # SCRIPT CAPITAL F | |
1D4A2 FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL G | |
210B FE01 | roundhand style # SCRIPT CAPITAL H | |
2110 FE01 | roundhand style # SCRIPT CAPITAL I | |
1D4A5 FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL J | |
1D4A6 FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL K | |
2112 FE01 | roundhand style # SCRIPT CAPITAL L | |
2133 FE01 | roundhand style # SCRIPT CAPITAL M | |
1D4A9 FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL N | |
1D4AA FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL O | |
1D4AB FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL P | |
1D4AC FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL Q | |
211B FE01 | roundhand style # SCRIPT CAPITAL R | |
1D4AE FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL S | |
1D4AF FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL T | |
1D4B0 FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL U | |
1D4B1 FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL V | |
1D4B2 FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL W | |
1D4B3 FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL X | |
1D4B4 FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL Y | |
1D4B5 FE01 | roundhand style # MATHEMATICAL SCRIPT CAPITAL Z |
Named Sequences
Here is a table with new named character sequences:
Character Sequence | Name |
---|---|
0915 093C | DEVANAGARI SEQUENCE FOR LETTER QA |
0916 093C | DEVANAGARI SEQUENCE FOR LETTER KHHA |
0917 093C | DEVANAGARI SEQUENCE FOR LETTER GHHA |
091C 093C | DEVANAGARI SEQUENCE FOR LETTER ZA |
0921 093C | DEVANAGARI SEQUENCE FOR LETTER DDDHA |
0922 093C | DEVANAGARI SEQUENCE FOR LETTER RHA |
092B 093C | DEVANAGARI SEQUENCE FOR LETTER FA |
092F 093C | DEVANAGARI SEQUENCE FOR LETTER YYA |
09A1 09BC | BENGALI SEQUENCE FOR LETTER RRA |
09A2 09BC | BENGALI SEQUENCE FOR LETTER RHA |
09AF 09BC | BENGALI SEQUENCE FOR LETTER YYA |
0A32 0A3C | GURMUKHI SEQUENCE FOR LETTER LLA |
0A38 0A3C | GURMUKHI SEQUENCE FOR LETTER SHA |
0A16 0A3C | GURMUKHI SEQUENCE FOR LETTER KHHA |
0A17 0A3C | GURMUKHI SEQUENCE FOR LETTER GHHA |
0A1C 0A3C | GURMUKHI SEQUENCE FOR LETTER ZA |
0A2B 0A3C | GURMUKHI SEQUENCE FOR LETTER FA |
0B21 0B3C | ORIYA SEQUENCE FOR LETTER RRA |
0B22 0B3C | ORIYA SEQUENCE FOR LETTER RHA |
Unicode 15.0
Unicode 15.0 was released on September 13, 2022. It encoded 149,186 characters, adding 4,489 new characters.
New blocks
- Arabic Extended-C (U+10EC0-U+10EFF), containing 3 characters, was added.
- Devanagari Extended-A (U+11B00-U+11B5F), containing 10 characters, was added.
- Kawi (U+11F00-U+11F5F), containing 86 characters, was added.
- Kaktovik Numerals (U+1D2C0-U+1D2DF), containing 20 characters, was added.
- Cyrillic Extended-D (U+1E030-U+1E08F), containing 63 characters, was added.
- Nag Mundari (U+1E4D0-U+1E4FF), containing 42 characters, was added.
- CJK Unified Ideographs Extension H (U+31350-U+323AF), containing 4192 characters, was added.
Extended blocks
- A Yamakkan (total 1 character) was added to Lao. (U+0ECE)
- A combining Anusvara Above Right (total 1 character) was added to Kannada. (U+0CF3)
- Letters Qa, Short I and Vocalic R (total 3 characters) were added to Khojki. (U+1123F-U+11241)
- An additional hieroglyph to Group V (total 1 character) was added to Egyptian Hieroglyphs
- Extended format controls (total 29 characters) were added to Egyptian Hieroglyph Format Controls. (U+13439-U+13455). The block was expanded from (U+13430-U+1343F) to (U+13430-U+1345F)
- Hiragana and Katakana Small Ko (total 2 characters) were added to Small Kana Extension. (U+1B132 and U+1B155)
- Letters for Malayalam transliteration (total 6 characters) were added to Latin Extended-G. (U+1DF25-U+1DF2A)
- A Wireless emoji (total 1 character) was added to Transport and Map Symbols. (U+1F6DC)
- A Nine Pointed White Star (total 1 character) was be added to Geometric Shapes Extended. (U+1F7D9)
- A Lot of Fortune, eclipse symbols and symbols for dwarf planets (total 6 characters) were added to Alchemical symbols. (U+1F774-U+1F776 and U+1F77B-U+1F77F)
- (total 20 characters) were added to Symbols and Pictographs Extended-A. (U+1FA75-U+1FA77, U+1FA87-U+1FA88, U+1FAAD-U+1FAAF, U+1FABB-U+1FABF, U+1FACE-U+1FACF, U+1FADA-U+1FADB, U+1FAE8 and U+1FAF7-U+1FAF8)
- A disunified ideograph for Macao (total 1 character) was added to CJK Unified Ideographs Extension C. (U+2B739)
Glyph Changes
Here is a table with glyph changes:
Block Name | Code Points | Count |
---|---|---|
IPA Extensions | 025E, 029A | 2 |
United Canadian Aboriginal Syllabics | 144B, 14D1, 1506, 15C0..15C3, 15E8..15EE, 1601, 1604..1607, 160A..160D, 1614..162D, 1630..163F, 1646..1647, 165A | 66 |
United Canadian Aboriginal Syllabics Extended | 18DB, 18EC, 18F1..18F2, 18F5 | 5 |
Sundanese | 1BBF | 1 |
Optical Character Recognition | 2447 | 1 |
CJK Unified Ideographs Extension A | 34DC, 3BF6, 3C43, 48B4, 4DBE | 5 |
CJK Unified Ideographs | 585F, 5F50, 6BC0, 7BC9, 833E | 5 |
Cyrillic Extended-B | A66E | 1 |
Old Turkic | 10C47 | 1 |
Egyptian Hieroglyphs | various (new standardized variation sequences) | 94 |
Khitan Small Script | 18CCA | 1 |
Wancho (font update) | 1E2C0..1E2F9, 1E2FF | 59 |
Alchemical Symbols (font update) | 1F700..1F773 | 116 |
CJK Unified Ideographs Extension B | 20048, 20A1C, 2143F, 21A5F, 21C08, 21FBA, 22ACF, 23392, 238A7, 23D8F, 23F4E, 25D20, 26E30, 27B48, 27C4F, 28633, 28B02, 28E9A, 29760, 2A60F | 20 |
CJK Unified Ideographs Extension C | 2B249 | 1 |
CJK Unified Ideographs Extension E | 2BB37, 2BD7D, 2C151, 2C1E0, 2C2D6, 2C5CA, 2C810, 2CD34 | 8 |
CJK Unified Ideographs Extension F | 2CF4E, 2D25D, 2D3EC, 2D6A7, 2D7BA, 2D979, 2DA74, 2DA97, 2DC13, 2DDC0, 2DF10, 2DF78, 2E05A, 2E0AE, 2E516, 2E640, 2E680, 2EA63 | 18 |
CJK Compatibility Ideographs Supplement | 2F804, 2F805, 2F833, 2F835, 2F84C, 2F84F, 2F852, 2F855, 2F887, 2F88B, 2F899, 2F8A0, 2F8A6, 2F8A7, 2F8AD, 2F8B1, 2F8B4, 2F8B7, 2F8BA, 2F8D0, 2F8E0..2F8E2, 2F8E5, 2F8E6, 2F8FE, 2F900, 2F901, 2F907, 2F912, 2F922, 2F926, 2F936, 2F938, 2F94E, 2F959, 2F95F, 2F96C, 2F99F, 2F9B8, 2F9BA, 2F9D3, 2F9DB, 2F9DC, 2F9E8, 2F9EA, 2F9EE, 2FA00, 2FA0D, 2FA1B | 50 |
CJK Unified Ideographs Extension G | 302FC, 30723, 30A6D, 30CF7, 30DBF, 31006, 3105D | 7 |
Total | 461 |
Variation Sequences
Here is a table with new standardized variation sequences:
Character Sequence | Context | Description of Variation Appearance |
---|---|---|
13091 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH D027 | |
13092 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH D027A | |
13093 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH D028 | |
130A9 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH D047 | |
1310F FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH F016 | |
13117 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH F023 | |
1311C FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH F028 | |
13121 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH F032 | |
13127 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH F037A | |
13139 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH F051 | |
13139 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH F051 | |
13183 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH H005 | |
13187 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH H008 | |
131A0 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH K006 | |
131A0 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH K006 | |
131B1 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH M003 | |
131B1 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH M003 | |
131B8 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH M009 | |
131B9 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH M010 | |
131BA FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH M010A | |
131CB FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH M017 | |
131EE FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH M044 | |
131EE FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH M044 | |
131F8 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH N010 | |
131F9 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH N011 | |
131F9 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH N011 | |
131FA FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH N012 | |
131FA FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH N012 | |
13216 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH N035 | |
13257 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH O006 | |
1327B FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH O029 | |
1327F FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH O031 | |
1327F FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH O031 | |
13285 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH O036 | |
1328C FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH O039 | |
132A4 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH P008 | |
132A4 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH P008 | |
132AA FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH Q003 | |
132CB FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH R024 | |
132DC FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH S010 | |
132E7 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH S018 | |
132E7 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH S018 | |
132E9 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH S020 | |
132F8 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH S033 | |
132FD FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH S037 | |
13302 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH S042 | |
13303 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH S043 | |
13307 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH T001 | |
13308 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH T002 | |
13310 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH T008 | |
13311 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH T008A | |
13312 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH T009 | |
13312 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH T009 | |
13313 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH T009A | |
13313 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH T009A | |
13314 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH T010 | |
13314 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH T010 | |
1331B FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH T016 | |
1331B FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH T016 | |
1331C FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH T016A | |
13321 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH T021 | |
13321 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH T021 | |
13322 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH T022 | |
13322 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH T022 | |
13331 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH T035 | |
13331 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH T035 | |
1333B FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH U007 | |
1333C FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH U008 | |
1334A FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH U022 | |
13361 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH U042 | |
13373 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH V007A | |
13377 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH V010 | |
13378 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH V011 | |
1337D FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH V012A | |
13385 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH V019 | |
13399 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH V026 | |
1339A FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH V027 | |
133AF FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH W001 | |
133B0 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH W002 | |
133BF FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH W014 | |
133D3 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH X004A | |
133DD FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH Y002 | |
133F2 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH Z007 | |
133F5 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH Z010 | |
133F6 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH Z011 | |
13403 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH Z015I | |
13416 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH AA008 | |
13419 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH AA011 | |
13419 FE01 | rotated 180 degrees # EGYPTIAN HIEROGLYPH AA011 | |
13419 FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH AA011 | |
1341A FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH AA012 | |
13423 FE00 | rotated 90 degrees # EGYPTIAN HIEROGLYPH AA021 | |
1342C FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH AA030 | |
1342E FE02 | rotated 270 degrees # EGYPTIAN HIEROGLYPH AA032 | |
13443 FE00 | expanded # EGYPTIAN HIEROGLYPH LOST SIGN | |
13444 FE00 | expanded # EGYPTIAN HIEROGLYPH HALF LOST SIGN | |
13445 FE00 | expanded # EGYPTIAN HIEROGLYPH TALL LOST SIGN | |
13446 FE00 | expanded # EGYPTIAN HIEROGLYPH WIDE LOST SIGN |
Unicode 15.1
Unicode 15.1 was released on September 12th, 2023. It encoded 149,813 characters, adding 627 new characters.
New Blocks
- CJK Unified Ideographs Extension I (U+2EBF0-U+2EE5F), containing 622 characters, was added.
Extended Blocks
- 4 Ideographic characters was added to Ideographic Description Characters. (U+2FFC-U+2FFF)
- An Ideographic subraction (total 1 character) was added to CJK Strokes. (U+31EF)
Glyph Changes
Here is a table with glyph changes:
Block Name | Code Points | Count |
---|---|---|
CJK Unified Ideographs Extension A | 357E, 358B..358E, 3599..359D, 35AF..35B0, 35B2..35B3, 35DF..35E1, 35EF, 360F, 3612, 3F94, 44D5, 48EE | 5 |
CJK Unified Ideographs | 5098, 512D, 517A, 5391, 54DB, 551C, 551F, 55B8, 55ED, 56AB, 591E, 594A, 5B2E, 5DFC..5DFD, 5EE4, 609E, 65B0, 65B3, 65D5, 65F2, 67B2, 6AB6, 6AEC, 6C69, 6FC2, 6FD3, 7019, 7361, 74BD, 7934, 820B, 826E, 83BB, 8412, 8456, 848A, 896F, 8E34, 8FD7, 9166, 9855, 985E, 9C4D | 5 |
Latin Extended-D | A798 | 1 |
Latin Extended-E | AB5A | 1 |
Tangut | 17105, 172A4, 17BD1..17BD3, 17EF9, 18136 | 59 |
Alchemical Symbols | 1F741, 1F747, 1F74C, 1F74F, 1F756, 1F758, 1F763, 1F768, 1F76D, 1F76E | 116 |
CJK Unified Ideographs Extension B | 20302, 2087A, 20C00, 230B7, 2339E, 236EF, 237C3, 23B87, 23CC0, 23CD9, 23E5E, 2486F, 249D6, 249E8, 24D6A, 2585E, 25D89, 26A5A..26A5B, 26A73, 26A82..26A83, 26A90, 26AA6, 26AA8, 26AD8, 27350, 279F8, 284A3, 28BBA, 29516, 29530 | 20 |
CJK Unified Ideographs Extension C | 2A741, 2AB63, 2ACD8, 2AF6F, 2B173, 2B490 | 1 |
CJK Unified Ideographs Extension E | 2BC2E, 2BF45, 2C04C, 2C13A, 2C43C, 2C43E, 2C816 | 8 |
CJK Unified Ideographs Extension F | 2D1CC..2D1CD, 2D1DD, 2D1E4, 2D1F7, 2D203, 2D256, 2D266, 2D2A2, 2D2AC, 2D2DA | 18 |
CJK Unified Ideographs Extension G | 301D4, 301D9, 301E4, 301E8, 301FF..30200, 30205, 3020C, 30211, 30215..30217, 30220, 30234..30235, 30237 | 7 |
CJK Unified Ideographs Extension H | 314B7, 31542, 31569, 31C7F, 31D5A, 31F68 | 7 |
Total | 164 |
Unicode 16.0
Unicode 16.0 was released on September 10th, 2024. It encoded 154,998 characters, adding 5185 new characters.
New Blocks
- Todhri (U+105C0-U+105FF), containing 52 characters, was added.
- Garay (U+10D40-U+10D8F), containing 69 characters, was added.
- Tulu-Tigalari (U+11380-U+113FF), containing 80 characters, was added.
- Myanmar Extended-C (U+116D0-U+116FF), containing 20 characters, was added.
- Sunuwar (U+11BC0-U+11BFF), containing 44 characters, was added.
- Egyptian Hieroglyphs Extended-A (U+13460-U+143FF), containing 3995 characters, was added.
- Gurung Khema (U+16100-U+1613F), containing 58 characters, was added.
- Kirat Rai (U+16D40-U+16D7F), containing 58 characters, was added.
- Symbols for Legacy Computing Supplement (U+1CC00-U+1CEBF), containing 686 characters, was added.
- Ol Onal (U+1E5D0-U+1E5FF), containing 44 characters, was added.
Extended Blocks
- A combining diacritical mark for Jawi (total 1 character) was added to Arabic Extended-B. (U+0897)
- Inverted letters and a punctuation mark (total 3 characters) was added to Balinese. (U+1B4E-U+1B4F and U+1B7F)
- A letter Tje (total 2 characters) was added to Cyrillic Extended-C. (U+1C89-U+1C8A)
- Legacy computing symbols for Delete (total 3 characters) was added to Control Pictures. (U+2427-U+2429)
- CJK strokes Hxg and Szp (total 2 characters) was added to CJK Strokes. (U+31E4-U+31E5)
- A capital Rams Horn, an S with Diagonal Stroke, Lamda Letters, and letters for Wakashan and Salishan Languages (total 6 characters) was added to Latin Extended-D. (U+A7CB-U+A7CD, U+A7DA-U+A7DC)
- A combining Alef overlay and letters with two dots vertically below (total 4 characters) was added to Arabic Extended-C. (U+10EC2-U+10EC4 and U+10EFC)
- A sign Nukta (total 1 character) was added to Kawi. (U+11F5A)
- A blank character (total 1 character) was added to Khitan Small Script. (U+18CFF)
- A rightwards arrow with hook, and arrows for legacy computing and arrows for Egyptology (total 12 characters) was added to Supplemental Arrows-C. (U+1F8B2-U+1F8BB, U+1F8C0-U+1F8C1)
- A Harp, Shovel, Leafless Tree, Fingerprint, Root Vegetable, Splatter, and Face with Bags Under Eyes (total 7 characters) was added to Symbols and Pictographs Extended-A. (U+1FA89, U+1FA8F, U+1FABE, U+1FAC6, U+1FADC, U+1FADF, and U+1FAE9)
- Graphic shapes for legacy computing (total 37 characters) was added to Symbols for Legacy Computing. (U+1FBCB-U+1FBEF)
Unicode 17.0
Unicode 17.0 will be released ca. September 2026.
New Blocks
- CJK Unified Ideographs Extension J (U+323B0-U+3347F), containing 4300 characters will be added.
Extended Blocks
- An additional ideograph (total 1 character) will be added to CJK Unified Ideographs Extension C. (U+2B73A)
Code Points Provisionally Assigned and Roadmap Blocks
This is a section where you can add any upcoming Unicode characters that have been provisionally assigned for mature proposals (but not yet accepted) for a future update of The Unicode Standard and also a section where present proportional maps of a proposed allocations to Unicode and ISO/IEC 10646. Italic indicates scripts for which detailed proposals have not yet been written.[1]
New Blocks
- Northern Palaeohispanic (U+10200-U+1023F)
- Southern Palaeohispanic (U+10240-U+1027F)
- Shavian Quikscript (U+103E0-U+103FF)
- Proto-Sinaitic (U+108B0-U+108DF)
- Sidetic (U+10940-U+1095F), containing 29 characters will be added.
- Numidian (U+10960-U+1097F)
- Balti-A (U+10AA0-U+10ABF)
- Book Pahlavi (U+10BB0-U+10BDF)
- Baburi (U+10BE0-U+10BFF)
- Arabic Extended-D (U+10D90-U+10E5F)
- Landa (U+11250-U+1127F)
- Tani Lipi (U+114E0-U+114FF)
- Ranjana (U+11500-U+1157F)
- Zou (U+11750-U+117AF)
- Pyu (U+117B0-U+117FF)
- Sirmauri (U+11850-U+1188F)
- Vateluttu (U+11960-U+1199F)
- Sharada Supplement (U+11B60-U+11B7F), containing 8 characters will be added.
- Leke (U+11B80-U+11BBF)
- Balti-B (U+11CC0-U+11CFF)
- Tolong Siki (U+11DB0-U+11DEF), containing 54 characters will be added.
- Tocharian (U+11E00-U+11E6F)
- Khotanese (U+11E70-U+11ECF)
- Pallava (U+11F60-U+11FAF)
- Archaic Cuneiform Numerals (U+12550-U+1268F), containing 311 characters will be added.
- Proto-Cuneiform (U+12690-U+12EFF), containing 1905 characters will be added.
- Egyptian Hieroglyphs Extended-B (U+14680-U+151FF)
- Mayan Hieroglyphs (U+15500-U+15AFF)
- Mandombe (U+15B80-U+15FFF)
- Cirth (U+16000-U+1607F)
- Tengwar (U+16080-U+160FF)
- Kurux Banna (U+16140-U+1618F)
- Moon (U+161A0-U+161FF)
- Blissymbols (U+16200-U+167FF)
- Woleai (U+16B90-U+16BFF)
- Kpelle (U+16C00-U+16C7F)
- Afaka (U+16C80-U+16CCF)
- Khimhun Tangsa (U+16CD0-U+16CFF)
- Tikamuli (U+16D00-U+16D3F)
- Chisoi (U+16D80-U+16DAF), containing 40 characters will be added.
- Kulitan (U+16DD0-U+16DFF)
- Mwangwego (U+16E00-U+16E3F)
- Beria Erfe (U+16EA0-U+16EDF), containing 50 characters will be added.
- Bopomofo Extended-A (U+16FA0-U+16FAF)
- Kanbun Extended-A (U+16FB0-U+16FDF)
- Tangut Components Supplement (U+18D80-U+18DFF), containing 115 characters will be added.
- Jurchen (U+18E00-U+1919F), containing 914 characters will be added.
- Jurchen Radicals (U+191A0-U+191DF), containing 51 characters will be added.
- Khitan Large Script (U+19200-U+199FF)
- Pau Cin Hau Syllabary (U+19E00-U+1A2FF)
- Eskaya (U+1A300-U+1A75F)
- Rejang Supplement (U+1A760-U+1A77F)
- Kaida (U+1A780-U+1A7FF)
- Naxi Dongba (U+1A800-U+1ACFF)
- Naxi Geba (U+1AD00-U+1AFCF)
- Kana Extended-C (U+1AFD0-U+1AFEF)
- Shuishu Logograms (U+1B300-U+1B5FF)
- Lisu Syllabic Script (U+1B600-U+1B9FF)
- Indus (U+1BA00-U+1BB8F)
- Pitman Shorthands (U+1BCB0-U+1BCFF)
- Proto-Elamite (U+1BD00-U+1C37F)
- Linear-Elamite (U+1C380-U+1C4FF)
- Oromo (Sheek Bakrii Saphaloo) (U+1C800-U+1CB2F)
- Miscellaneous Symbols Supplement (U+1CEC0-U+1CEFF), containing 34 characters will be added.
- Musical Symbols Supplement (U+1D250-U+1D28F), containing 11 characters will be added.
- Old Chinese Musical Symbols (Flute and Pipa) (U+1D290-U+1D2BF)
- Mathematical Alphanumeric Symbols Supplement (U+1D380-U+1D3FF)
- Jianzi Format Controls (U+1DAE0-U+1DAFF)
- Jianzi Musical Symbols (U+1DB00-U+1DC8F)
- Eebee Hmong (U+1E150-U+1E1FF)
- Western Cham (U+1E200-U+1E26F)
- Loma (U+1E300-U+1E41F)
- Bagam (U+1E420-U+1E4CF)
- Pungchen (U+1E500-U+1E52F)
- Pungchung (U+1E530-U+1E55F)
- Marchung (U+1E560-U+1E59F)
- Brusha (U+1E5A0-U+1E5CF)
- Chola (U+1E600-U+1E65F)
- Chalukya Box-Headed (U+1E660-U+1E6BF)
- Tai Yo (U+1E6C0-U+1E6FF), containing 55 characters will be added.
- Lampung (U+1E700-U+1E73F)
- Kerinci (U+1E740-U+1E76F)
- Buginese Supplement (U+1E770-U+1E7BF)
- Lontara Bilang-Bilang (U+1E7C0-U+1E7DF)
- Byblos (U+1EB90-U+1EBFF)
- Persian Siyaq Numbers (U+1EC00-U+1EC7F)
- Diwani Siyaq Numbers (U+1ECC0-U+1ECFF)
- Arabic Supplemental Symbols (U+1EF00-U+1EF3F)
- Miscellaneous Symbols and Mathematical (U+1FC00-U+1FFFD), containing 991 characters, was added.
- Seal Script (U+38000-U+3AB9F)
Extended Blocks
- Modifier letters Eh, Ini, and Yi (total 3 characters) will be added to Armenian. (U+0558, U+058B, U+058C)
- A Noon with Ring Above (total 1 character) will be added to Arabic Extended-B. (U+088F)
- Bengali sign combining Anusvara above and an alternate letter Ba (total 2 character) will be added to Bengali. (U+0984, U+09FF)
- Signs for dot above and double dot above (total 2 characters) will be added to Oriya. (U+0B53, U+0B54)
- An archaic ligature Shrii (total 1 character) will be added to Telugu. (U+0C5C)
- An archaic ligature Shrii (total 1 character) will be added to Kannada. (U+0CDC)
- Mongolian Letter Manchu Alternative Ue (total 1 character) will be added to Mongolian. (U+1879)
- Compound tone, Harrington, and alternate positioned IPA diacritics (total 27 characters) will be added to Combining Diacritical Marks Extended. (U+1ACF-U+1AEB)
- Equal Sign with Infinity Above (total 1 character) will be added to Miscellaneous Symbols and Arrows. (U+2B96)
- 2 capital letters for Middle English, Latin pharyngeal voiced fricative, and Modifier Letter Capital S (total 5 characters) will be added to Latin Extended-D. (U+A7CE-U+A7CF, U+A7D2, U+A7D4, U+A7F1)
- Arabic Ligature Rahmatu Allaahi Alayh and Arabic Honorifics (total 25 characters) will be added to Arabic Presentation Forms-A. (U+FBC3-U+FBD2, U+FD90, U+FD91, U+FDC8-U+FDCE)
- Latin modifier letters for clicks (total 5 characters) will be added to Latin Extended-F. (U+107BB-U+107BF)
- A Small Yeh Barree with Two Dots Below, Thin Noon, Biblical End of Verse, Yeh with Four Dots Below, Quranic Characters, Biblical End of Verse, Honorifics, Crown Letters, moew Quranic Characters, Crown, Double Vertical Bar Below, and Small Low Noon (total 54 characters) will be added to Arabic Extended-C. (U+10EC5-U+10EC7, U+10EC9-U+10EEE, U+10EF0-U+10EFB)
- Chinese Simplified and Traditional Er and Yangqin Slow Signs Two, Three, and Four (total 5 characters) will be added to Ideographic Symbols and Punctuation. (U+16FF2-U+16FF6)
- Some additional ideographs (total 8 characters) will be added to Tangut. (U+187F8-U+187FF)
- Additional ideographs (total 20 characters) will be added to Tangut Supplement. (U+18D09-U+18D1C)
- Hiragana Digraph Koto, Katakana Diagraphs Toki and Tote (total 3 characters) will be added to Kana Extended-A. (U+1B123-U+1B125)
- Stein Zimmerman Symbols, Digit Slash Symbols, and other Symbols (total 23 characters) will be added to Musical Symbols. (U+1D127-U+1D128, U+1D1EB-U+1D1F6, U+1D1F7-U+1D1FF)
- Nine symbols (total 9 characters) will be added to Symbols for Legacy Computing Supplement. (U+1CCFA-U+1CCFC, U+1CEBA-1CEBF)
- Affricate ligatures, letters with palatal hook, barred letters, and modifier letters, will be added to Latin Extended-G. (U+1DF1F-U+1DF24, U+1DF2B-U+1DF3F, U+1DFD8-U+1DFFF)
- Historical asteroid symbols (total 4 characters) will be added to Alchemical Symbols. (U+1F777-U+1F77A)
- Chemical symbols (total 9 characters) will be added to Supplemental Arrows-C. (U+1F8D0-U+1F8D8)
- White and Black Chess Ferz and Alfil (total 4 characters) will be added to Chess Symbols. (U+1FA54-U+1FA57)
- An alarm bell symbol (total 1 character) will be added to Symbols for Legacy Computing. (U+1FBFA)