[Info-vax] Does OpenVMS Use Unicode?
Neil Rieck
n.rieck at sympatico.ca
Tue Jun 14 07:28:39 EDT 2016
Not wanting to engage in a flame war, the following quote from a popular web site says it all:
The original specification covered numbers up to 31 bits (the original limit of the Universal Character Set). In November 2003, UTF-8 was restricted by RFC 3629 to end at U+10FFFF, in order to match the constraints of the UTF-16 character encoding. This removed all five- and six-byte sequences, and almost half the four-byte sequences.
###
This restricts UTF-8 (which is a unicode encoding) to a subset of the entire unicode map. BTW, there are large holes (called planes) in the unicode map which allow for future growth. But new codes will not appear in UTF-8 unless RFC-3629 is superseded.
Neil Rieck
Waterloo, Ontario, Canada.
More information about the Info-vax
mailing list