[Info-vax] VMS Basic strings class D vs class S

Stephen Hoffman seaohveh at hoffmanlabs.invalid
Wed Mar 6 18:44:29 EST 2024


On 2024-02-28 14:38:03 +0000, Arne Vajhøj said:

> On 2/26/2024 4:17 PM, Stephen Hoffman wrote:
> 
>> I also wouldn't expect the RTLs to work with encodings other than ASCII 
>> and DEC MCS, either. And UTF-8 will fail in the expected places, and 
>> most searching and sorting tends not to be sensitive to the (written) 
>> language used within the text string.
> 
> I would assume that it works as long as the string is considered a 
> sequence of bytes not a sequence of characters.

The assumption that one byte is one character is embedded deeply in 
OpenVMS system and app code and APIs.

I would assume that such code will break in various ways when presented 
with UTF-8.

Anything assuming a correspondence between string length and displayed 
width is going to fail, for instance.

That's before discussing sorting and searching and language 
differences, as was mentioned. And normalization.

OpenVMS has (had) support some of those differences with NCS and with 
ICU, though those APIs aren't (weren't) widely used by apps.


-- 
Pure Personal Opinion | HoffmanLabs LLC 




More information about the Info-vax mailing list