Following is the data you can use for testing. You can copy/paste the characters into a UTF-8 editor to insert into any of your test files.
The following information is provided:
The characters display properly on Windows 7 with either IE 8 or Firefox 3.6. I have the Chinese language pack installed, as well as a number of other fonts, so my system is not representative. If you have problems displaying the supplementary characters, let me know and I will provide additional information about fonts needed, etc.
Top of pageIn the Unicode Character Standard, Supplementary Characters are the characters assigned code points from U+10000 to U+10FFFF. In other words, these are the Unicode characters greater than U+FFFF.
There are a number of characters in the Supplementary Planes that are frequently used in Asian markets. Therefore the Supplementary Planes are required to be supported. To identify useful test characters, I selected a set from IICORE.
In 2005, the IRG (Ideographic Rapporteur Group) identified a set ideographs, called the Ideographic International Core (IICore). The 10,000 ideographs in the IICore are the most frequently used characters that would cover the vast majority of modern texts in all locales where ideographs are used. This collection is intended for use in devices with limited resources, such as mobile phones.
To have characters that are good for testing software support for the Supplementary Plane, I extracted the 62 characters from the IICORE that are in the Supplementary Plane. These characters have the properties that:
It is important to test software with characters from the Supplementary Plane.
Testing with supplementary characters can detect if there is code that does not provide the necessary support.
Top of pageUnicode Scalar Value | UTF-8 | NCR |
---|---|---|
U+2070E | 𠜎 | 𠜎 |
U+20731 | 𠜱 | 𠜱 |
U+20779 | 𠝹 | 𠝹 |
U+20C53 | 𠱓 | 𠱓 |
U+20C78 | 𠱸 | 𠱸 |
U+20C96 | 𠲖 | 𠲖 |
U+20CCF | 𠳏 | 𠳏 |
U+20CD5 | 𠳕 | 𠳕 |
U+20D15 | 𠴕 | 𠴕 |
U+20D7C | 𠵼 | 𠵼 |
U+20D7F | 𠵿 | 𠵿 |
U+20E0E | 𠸎 | 𠸎 |
U+20E0F | 𠸏 | 𠸏 |
U+20E77 | 𠹷 | 𠹷 |
U+20E9D | 𠺝 | 𠺝 |
U+20EA2 | 𠺢 | 𠺢 |
U+20ED7 | 𠻗 | 𠻗 |
U+20EF9 | 𠻹 | 𠻹 |
U+20EFA | 𠻺 | 𠻺 |
U+20F2D | 𠼭 | 𠼭 |
U+20F2E | 𠼮 | 𠼮 |
U+20F4C | 𠽌 | 𠽌 |
U+20FB4 | 𠾴 | 𠾴 |
U+20FBC | 𠾼 | 𠾼 |
U+20FEA | 𠿪 | 𠿪 |
U+2105C | 𡁜 | 𡁜 |
U+2106F | 𡁯 | 𡁯 |
U+21075 | 𡁵 | 𡁵 |
U+21076 | 𡁶 | 𡁶 |
U+2107B | 𡁻 | 𡁻 |
U+210C1 | 𡃁 | 𡃁 |
U+210C9 | 𡃉 | 𡃉 |
U+211D9 | 𡇙 | 𡇙 |
U+220C7 | 𢃇 | 𢃇 |
U+227B5 | 𢞵 | 𢞵 |
U+22AD5 | 𢫕 | 𢫕 |
U+22B43 | 𢭃 | 𢭃 |
U+22BCA | 𢯊 | 𢯊 |
U+22C51 | 𢱑 | 𢱑 |
U+22C55 | 𢱕 | 𢱕 |
U+22CC2 | 𢳂 | 𢳂 |
U+22D08 | 𢴈 | 𢴈 |
U+22D4C | 𢵌 | 𢵌 |
U+22D67 | 𢵧 | 𢵧 |
U+22EB3 | 𢺳 | 𢺳 |
U+23CB7 | 𣲷 | 𣲷 |
U+244D3 | 𤓓 | 𤓓 |
U+24DB8 | 𤶸 | 𤶸 |
U+24DEA | 𤷪 | 𤷪 |
U+2512B | 𥄫 | 𥄫 |
U+26258 | 𦉘 | 𦉘 |
U+267CC | 𦟌 | 𦟌 |
U+269F2 | 𦧲 | 𦧲 |
U+269FA | 𦧺 | 𦧺 |
U+27A3E | 𧨾 | 𧨾 |
U+2815D | 𨅝 | 𨅝 |
U+28207 | 𨈇 | 𨈇 |
U+282E2 | 𨋢 | 𨋢 |
U+28CCA | 𨳊 | 𨳊 |
U+28CCD | 𨳍 | 𨳍 |
U+28CD2 | 𨳒 | 𨳒 |
U+29D98 | 𩶘 | 𩶘 |
Copyright © 2010 Tex Texin. All rights reserved.
Top of page