[Info-vax] 8-bit characters

Lawrence D’Oliveiro lawrencedo99 at gmail.com
Thu Nov 11 16:57:02 EST 2021


On Friday, November 12, 2021 at 5:21:48 AM UTC+13, Arne Vajhøj wrote:
> On 11/10/2021 11:48 PM, Lawrence D’Oliveiro wrote: 
>> On Thursday, November 11, 2021 at 3:33:33 PM UTC+13, Arne Vajhøj wrote: 
>>> The biggest problems with UTF-8 is that the byte length is not 
>>> necessarily the character length ... 
>> 
>> That would be true of any Unicode encoding, even UCS-4.
>
> No. 

You didn’t know, then, that what Unicode codes define are not characters, but code points?



More information about the Info-vax mailing list