Overview
Quivira 3.2 contains 7,535 characters from various Unicode blocks. Most of them are expected to work correctly everywhere, but there are some issues with characters that have codepoints higher than 65,535 and combining characters.
The meaning, names and usage of characters are defined in the Unicode Standard. Characters you cannot type directly can be inserted using a character map, e.g. the one coming with Microsoft Windows (at “Start” → “Programs” → “Accessories” → “System Programs” → “Character Map”. Also, office programs usually have a dialogue for inserting them. However, these dialogues do not always show all characters, even if the programs can display all of them. If you’re using Microsoft Windows, I recommend the BabelMap which not only shows all characters, but also their names and properties.
Supported Unicode blocks
The following Unicode blocks are supported:
- Basic Latin (00000 – 0007F)
- Complete (95 characters)
- Latin-1 Supplement (00080 – 000FF)
- Complete (96 characters)
- Latin Extended-A (00100 – 0017F)
- Complete (128 characters)
- Latin Extended-B (00180 – 0024F)
- Complete (208 characters)
- IPA Extensions (00250 – 002AF)
- Complete (96 characters)
- Spacing Modifier Letters (002B0 – 002FF)
- Complete (80 characters)
- Combining Diacritical Marks (00300 – 0036F)
- Complete (111 characters)
- All characters are combining and may not be placed correctly.
- Greek and Coptic (00370 – 003FF)
- Complete (134 characters)
- Cyrillic (00400 – 004FF)
- Complete (256 characters)
- Contains 7 combining characters which may not be placed correctly.
- Cyrillic Supplement (00500 – 00520)
- Complete (38 characters)
- Armenian (00530 – 0058F)
- Complete (86 characters)
- Hebrew (00590 – 005FF)
- Complete (87 characters)
- Contains 51 combining characters which may not be placed correctly.
- Thai (00E00 – 00E7F)
- Complete (87 characters)
- Contains 16 combining characters which may not be placed correctly.
- Georgian (010A0 – 010FF)
- Complete (83 characters)
- Cherokee (013A0 – 013FF)
- Complete (85 characters)
- Ogham (01680 – 0169F)
- Complete (29 characters)
- Runic (016A0 – 016FF)
- Complete (81 characters)
- Tagalog (01700 – 0171F)
- Complete (20 characters)
- Contains 3 combining characters which may not be placed correctly.
- Hanunoo (01720 – 0173F)
- Complete (23 characters)
- Contains 3 combining characters which may not be placed correctly.
- Buhid (01740 – 0175F)
- Complete (20 characters)
- Contains 2 combining characters which may not be placed correctly.
- Tagbanwa (01760 – 0177F)
- Complete (18 characters)
- Contains 2 combining characters which may not be placed correctly.
- Phonetic Extensions (01D00 – 01D7F)
- Complete (128 characters)
- Phonetic Extensions Supplement (01D80 – 01DBF)
- Complete (64 characters)
- Latin Extended Additional (01E00 – 01EFF)
- Complete (256 characters)
- Greek Extended (01F00 – 01FFF)
- Complete (233 characters)
- General Punctuation (02000 – 0206F)
- Complete (85 characters)
- Superscripts and Subscripts (02070 – 0209F)
- Complete (34 characters)
- Currency Symbols (020A0 – 020CF)
- Complete (25 characters)
- Letterlike Symbols (02100 – 0214F)
- Complete (80 characters)
- Number Forms (02150 – 0218F)
- Complete (58 characters)
- Arrows (02190 – 021FF)
- Complete (112 characters)
- Mathematical Operators (02200 – 022FF)
- Complete (256 characters)
- Miscellanous Technical (02300 – 023FF)
- Complete (233 characters)
- Control Pictures (02400 – 0243F)
- Complete (39 characters)
- Optical Character Recognition (02440 – 0245F)
- Complete (11 characters)
- Enclosed Alphanumerics (02460 – 024FF)
- Complete (160 characters)
- Box Drawing (02500 – 0257F)
- Complete (128 characters)
- Block Elements (02580 – 0259F)
- Complete (32 characters)
- Geometric Shapes (025A0 – 025FF)
- Complete (96 characters)
- Miscellanous Symbols (02600 – 026FF)
- 147 characters (out of 250)
- Dingbats (02700 – 027BF)
- 117 characters (out of 175)
- Miscellanous Mathematical Symbols-A (027C0 – 027EF)
- Complete (44 characters)
- Supplemental Arrows-A (027F0 – 027FF)
- Complete (16 characters)
- Braille Patterns (02800 – 028FF)
- Complete (256 characters)
- Supplemental Arrows-B (02900 – 0297F)
- Complete (128 characters)
- Miscellanous Mathematical Symbols-B (02980 – 029FF)
- Complete (128 characters)
- Supplemental Mathematical Operators (02A00 – 02AFF)
- Complete (256 characters)
- Miscellanous Symbols and Arrows (02B00 – 02BFF)
- Complete (87 characters )
- Latin Extended-C (02C60 – 02C7F)
- Complete (32 characters)
- Coptic (02C80 – 02CFF)
- Complete (121 characters)
- Contains 3 combining characters which may not be placed correctly.
- Georgian Supplement (02D00 – 02D2F)
- Complete (38 characters)
- Tifinagh (02D30 – 02D7F)
- Complete (55 characters)
- Cyrillic Extended-A (02DE0 – 02DFF)
- Complete (32 characters)
- All characters are combining and may not be placed correctly.
- Ideographic Description Characters (02FF0 – 02FFF)
- Complete (12 characters)
- Enclosed CJK Letters and Months (03200 – 032FF)
- 30 characters (out of 254)
- Contains only the circled numbers from 21 to 50
- Yijing Hexagram Symbols (04DC0 – 04DFF)
- Complete (64 characters)
- Lisu (0A4D0 – 0A4FF)
- Complete (48 characters)
- Cyrillic Extended-B (0A640 – 0A69F)
- Complete (78 characters)
- Contains 6 combining characters which may not be placed correctly.
- Modifier Tone Letters (0A700 – 0A71F)
- Complete (32 characters)
- Latin Extended-D (0A720 – 0A7FF)
- Complete (114 characters)
- Private Use Area (0E000 – 0F8FF)
- 449 characters
- These characters are not defined in the Unicode Standard, see below for their meanings and usage in Quivira.
- Alphabetic Presentation Forms (0FB00 – 0FB4F)
- Complete (58 characters)
- Halfwidth and Fullwidth Forms (0FF00 – 0FFEF)
- 110 characters (out of 225)
- These are fixed width variants of the Basic Latin characters, meant for use with East-Asian ideographs.¹
- Specials (0FFF0 – 0FFFF)
- Complete (2 characters)
- Ancient Greek Numbers (10140 – 1018F)
- Complete (75 characters)
- Ancient Symbols (10190 – 101CF)
- Complete (12 characters)
- Lycian (10280 – 1029F)
- Complete (29 characters)
- Carian (102A0 – 102DF)
- Complete (49 characters)
- Old Italic (10300 – 1032F)
- Complete (35 characters)
- Gothic (10330 – 1034F)
- Complete (27 characters)
- Lydian (10920 – 1093F)
- Complete (27 characters)
- Musical Symbols (1D100 – 1D1FF)
- 83 characters (out of 220)
- Ancient Greek Musical Notation (1D200 – 1D24F)
- Complete (70 characters)
- Tai Xuan Jing Symbols (1D300 – 1D35F)
- Complete (87 characters)
- Mathematical Alphanumeric Symbols (1D400 – 1D7FF)
- 596 characters (out of 996)
- Domino Tiles (1F030 – 1F09F)
- Complete (100 characters)
¹ As Quivira does not support ideographs, the Halfwidth and Fullwidth Forms characters are quite useless. They are only included because they are used internally to compose the Control Characters.
Private Use Area
Quivira 3.2 contains 449 characters in the Unicode block “Private Use Area”. They have generic names like “PRIVATE USE CHARACTER-E000” (the last part being their codepoints in hexadecimal notation) and the General Category “Co [Other, Private Use]”.
In Quivira, the Private Use Area is divided into self-invented blocks, and the characters are given names much like the standardised characters. However, these blocks and names are own inventions and only used in the documentation of Quivira to give hints what these characters are meant for. They are not approved by any standard (like e.g. Unicode) and will not be used or displayed by any program (the Unicode names mentioned above will be used instead).
For the same reason, they do not have any defined properties, so rendering programs do not know whether they are letters, symbols or whatever else. Conversion between upper- and lowercase letters, small caps and so on will not work. Spell checking software will not recognize them and treat them as symbols. If you change the font, they will not be displayed correctly any more, because the other font may use the same codepoints for totally different characters (or may not use them at all).
In the Unicode standard, an adopted character will never be changed nor removed in future versions of Unicode. However, Quivira’s Private Use Area is not as stable: If a Private Use character is adopted into Unicode, it is moved from its private use codepoint to its new standardised codepoint. For this reason, the Private Use Area only contains characters which are unlikely to be approved by the Unicode Consortium.
See the PDF file List of Private Use characters for detailed descriptions.
Future Additions
The development of Quivira is still in progress, but as I do it in my free time, new versions occur very irregularly. I cannot predict when the next version will be ready for release.
Also, I decide about the characters to include very spontaneously, i.e. there is no roadmap for future versions. If you need a special character or script, you can contact me and I will try to include it in the next possible version (this may still take some months, though).
Please note that there are some kinds of characters that will not be included, even if requested. These are:
- Characters that contradict the Unicode standard, e.g. slightly different glyphs for existing characters. If you like variation, use different fonts, but don’t confuse your spell checking software with unnecessary private use characters.
- Characters that are in the Unicode Pipeline Table (Proposed New Characters). I cannot assign them to the Code Point they will probably get in the Unicode Standard, because it may change before final adoption. I also don’t like to assign them to private use codepoints, because then I have to re-assign them in future version, thus either having duplicates or destroying existing documents.
- Characters that are technically impossible. In a TrueType font, glyphs cannot be coloured, animated or protrude outside the line.
- Characters that belong to unsupported scripts. E.g. it wouldn’t make sense to include a rare Chinese ideograph without also having the common ones.
In future, Quivira is meant to support as many scripts and languages as possible, but there are limits. The first limit is the mere number of defined characters which cannot be reached by a single font. For this reason, Quivira does not support the Han ideographs (their number is greater than that of all other scripts together).
The second limit are technical issues: For some characters I could include some nice glyphs, but a text containing them would not be rendered correctly. This issue generally affects combining marks and complex scripts like e.g. most Indian and Southeast Asian scripts. It also affects letters which change their forms depending on surrounding letters, like e.g. in Arabic (this is the reason why Arabic isn’t supported yet, despite its importance).
These issues may be solved by converting Quivira into an OpenType font. But in all my attempts to do so, various problems arose which made some of the existing characters unusable. I’m confident I will find a good solution one day, but at the moment it’s not in sight.

Deutsch
Social Bookmarks