[Info-vax] Does OpenVMS Use Unicode?

Neil Rieck n.rieck at sympatico.ca
Tue Jun 14 07:28:39 EDT 2016


Not wanting to engage in a flame war, the following quote from a popular web site says it all:

The original specification covered numbers up to 31 bits (the original limit of the Universal Character Set). In November 2003, UTF-8 was restricted by RFC 3629 to end at U+10FFFF, in order to match the constraints of the UTF-16 character encoding. This removed all five- and six-byte sequences, and almost half the four-byte sequences.

###

This restricts UTF-8 (which is a unicode encoding) to a subset of the entire unicode map. BTW, there are large holes (called planes) in the unicode map which allow for future growth. But new codes will not appear in UTF-8 unless RFC-3629 is superseded.

Neil Rieck
Waterloo, Ontario, Canada.
 




More information about the Info-vax mailing list