[Info-vax] Does OpenVMS Use Unicode?

Phillip Helbig undress to reply helbig at asclothestro.multivax.de
Wed Jun 15 07:02:15 EDT 2016


In article <njr9r9$ldl$1 at news.albasani.net>, Jan-Erik Soderholm
<jan-erik.soderholm at telia.com> writes: 

> That has never work for Swedish with "åäöÅÄÖ" anyway even with
> 7-bit ASCII or DEC-MCS. Not if the sort routines was not specificaly
> written to deal with it. The same with (simple) upper() and lower()
> funcions.
> 
> Hm, just tested and f$edit with "upcase" or "lowercase" *does* handle
> åäöÅÄÖ correctly, didn't thought that before... :-)

EDT will properly change the case of such letters, but searching for A
or any "variant" of A such as Á, À, Â, Ã, Ä, Å will match all (and A).
SEARCH in DCL behaves as it should (matches only the searched-for 
variant, by default in both cases).

I've used an EDT macro to transform what Jan-Erik wrote (on a Mac?) into 
DEC MCS/ISO-8869-{1|15}.  :-)

> But a simple SORT of a textfile gets it wrong. It sorts Ä ->Å ->Ö
> while the correct order is Å ->Ä ->Ö .

Presumably it sorts them in the order of the corresponding bit values.

Your "correct" order is correct in Swedish.  In Norwegian (where there 
is no Ö but there is Ø instead and no Ä but Æ instead) the correct order 
is Æ ->Ø ->Å.

By the way, HELP FORTRAN CHARAC ASCII brings up the ASCII table, and 
HELP FORTRAN CHAR DEC brings up the DEC MCS (similar to ISO-8859-1 and 
ISO-8859-15) table.

The logic in these tables is that the order is, first, the order of 
the "similar" ASCII letters, then variants are listed in roughly the 
order of appearance (though with the same order for all corresponding 
ASCII letters; see above).

I've written this on an LK411-AA in a DECterm on a DS10 monitor running
CDE; non-ASCII characters with the COMPOSE key.

Even though I use NEWSRDR, and generally don't mess with mime for usenet 
posts, so that stuff like this will work, I add the following lines to 
the headers (figure out how I do it with NEWSRDR for extra credit):

   Mime-Version: 1.0
   Content-Type: text/plain; charset=ISO-8859-15
   Content-Transfer-Encoding: 8bit

If I want the EURO sign, I just type COMPOSE-x-o, €, and see an arrow 
flying away from me (as used in circuit diagrams) but someone correctly 
interpreting based on the headers should see the EUR sign.




More information about the Info-vax mailing list