I18nGuy Home Page --> Unicode Supplementary Characters Test Data

Unicode Supplementary Characters Test Data

It is important that software and Web applications support Unicode Supplementary characters. This page provides 62 supplementary characters that are useful test cases.

I18n Guy

Supplementary Character Test Data

Following is the data you can use for testing. You can copy/paste the characters into a UTF-8 editor to insert into any of your test files.

The following information is provided:

Top of page

Image of Characters on My Machine

supplementary characters for testing

4-Byte UTF-8

𠜎 𠜱 𠝹 𠱓 𠱸 𠲖 𠳏 𠳕 𠴕 𠵼 𠵿 𠸎 𠸏 𠹷 𠺝 𠺢 𠻗 𠻹 𠻺 𠼭 𠼮 𠽌 𠾴 𠾼 𠿪 𡁜 𡁯 𡁵 𡁶 𡁻 𡃁 𡃉 𡇙 𢃇 𢞵 𢫕 𢭃 𢯊 𢱑 𢱕 𢳂 𢴈 𢵌 𢵧 𢺳 𣲷 𤓓 𤶸 𤷪 𥄫 𦉘 𦟌 𦧲 𦧺 𧨾 𨅝 𨈇 𨋢 𨳊 𨳍 𨳒 𩶘

Numeric Character References (NCR)

𠜎 𠜱 𠝹 𠱓 𠱸 𠲖 𠳏 𠳕 𠴕 𠵼 𠵿 𠸎 𠸏 𠹷 𠺝 𠺢 𠻗 𠻹 𠻺 𠼭 𠼮 𠽌 𠾴 𠾼 𠿪 𡁜 𡁯 𡁵 𡁶 𡁻 𡃁 𡃉 𡇙 𢃇 𢞵 𢫕 𢭃 𢯊 𢱑 𢱕 𢳂 𢴈 𢵌 𢵧 𢺳 𣲷 𤓓 𤶸 𤷪 𥄫 𦉘 𦟌 𦧲 𦧺 𧨾 𨅝 𨈇 𨋢 𨳊 𨳍 𨳒 𩶘

The characters display properly on Windows 7 with either IE 8 or Firefox 3.6. I have the Chinese language pack installed, as well as a number of other fonts, so my system is not representative. If you have problems displaying the supplementary characters, let me know and I will provide additional information about fonts needed, etc.

Top of page

Background

Supplementary Characters

In the Unicode Character Standard, Supplementary Characters are the characters assigned code points from U+10000 to U+10FFFF. In other words, these are the Unicode characters greater than U+FFFF.

Supplementary Character Use Requirements

There are a number of characters in the Supplementary Planes that are frequently used in Asian markets. Therefore the Supplementary Planes are required to be supported. To identify useful test characters, I selected a set from IICORE.

IICORE

In 2005, the IRG (Ideographic Rapporteur Group) identified a set ideographs, called the Ideographic International Core (IICore). The 10,000 ideographs in the IICore are the most frequently used characters that would cover the vast majority of modern texts in all locales where ideographs are used. This collection is intended for use in devices with limited resources, such as mobile phones.

Test Characters

To have characters that are good for testing software support for the Supplementary Plane, I extracted the 62 characters from the IICORE that are in the Supplementary Plane. These characters have the properties that:

Top of page

Requirement for Testing Supplementary Characters

It is important to test software with characters from the Supplementary Plane.

Testing with supplementary characters can detect if there is code that does not provide the necessary support.

Top of page

References

Supplementary Characters

Unicode
Scalar
Value
UTF-8NCR
U+2070E𠜎𠜎
U+20731𠜱𠜱
U+20779𠝹𠝹
U+20C53𠱓𠱓
U+20C78𠱸𠱸
U+20C96𠲖𠲖
U+20CCF𠳏𠳏
U+20CD5𠳕𠳕
U+20D15𠴕𠴕
U+20D7C𠵼𠵼
U+20D7F𠵿𠵿
U+20E0E𠸎𠸎
U+20E0F𠸏𠸏
U+20E77𠹷𠹷
U+20E9D𠺝𠺝
U+20EA2𠺢𠺢
U+20ED7𠻗𠻗
U+20EF9𠻹𠻹
U+20EFA𠻺𠻺
U+20F2D𠼭𠼭
U+20F2E𠼮𠼮
U+20F4C𠽌𠽌
U+20FB4𠾴𠾴
U+20FBC𠾼𠾼
U+20FEA𠿪𠿪
U+2105C𡁜𡁜
U+2106F𡁯𡁯
U+21075𡁵𡁵
U+21076𡁶𡁶
U+2107B𡁻𡁻
U+210C1𡃁𡃁
U+210C9𡃉𡃉
U+211D9𡇙𡇙
U+220C7𢃇𢃇
U+227B5𢞵𢞵
U+22AD5𢫕𢫕
U+22B43𢭃𢭃
U+22BCA𢯊𢯊
U+22C51𢱑𢱑
U+22C55𢱕𢱕
U+22CC2𢳂𢳂
U+22D08𢴈𢴈
U+22D4C𢵌𢵌
U+22D67𢵧𢵧
U+22EB3𢺳𢺳
U+23CB7𣲷𣲷
U+244D3𤓓𤓓
U+24DB8𤶸𤶸
U+24DEA𤷪𤷪
U+2512B𥄫𥄫
U+26258𦉘𦉘
U+267CC𦟌𦟌
U+269F2𦧲𦧲
U+269FA𦧺𦧺
U+27A3E𧨾𧨾
U+2815D𨅝𨅝
U+28207𨈇𨈇
U+282E2𨋢𨋢
U+28CCA𨳊𨳊
U+28CCD𨳍𨳍
U+28CD2𨳒𨳒
U+29D98𩶘𩶘

Top of page