Home Page
>
Internationalization
>
Working with Text
Supplementary Characters as Surrogates
To support supplementary characters without changing the
char primitive data type and causing incompatibility
with previous Java programs,
supplementary characters are defined by a pair of code point
values that are called surrogates.
The first code point is from the high surrogates range of
U+D800 to U+DFBB,
and the second code point is from the low surrogates range
of U+DC00 to U+DFFF.
For example, the Deseret character LONG I, U+10400,
is defined with this pair of surrogate values: U+D801 and
U+DC00.