[Info-vax] Does OpenVMS Use Unicode?
Phillip Helbig undress to reply
helbig at asclothestro.multivax.de
Wed Jun 15 07:02:15 EDT 2016
In article <njr9r9$ldl$1 at news.albasani.net>, Jan-Erik Soderholm
<jan-erik.soderholm at telia.com> writes:
> That has never work for Swedish with "åäöÅÄÖ" anyway even with
> 7-bit ASCII or DEC-MCS. Not if the sort routines was not specificaly
> written to deal with it. The same with (simple) upper() and lower()
> funcions.
>
> Hm, just tested and f$edit with "upcase" or "lowercase" *does* handle
> åäöÅÄÖ correctly, didn't thought that before... :-)
EDT will properly change the case of such letters, but searching for A
or any "variant" of A such as Á, À, Â, Ã, Ä, Å will match all (and A).
SEARCH in DCL behaves as it should (matches only the searched-for
variant, by default in both cases).
I've used an EDT macro to transform what Jan-Erik wrote (on a Mac?) into
DEC MCS/ISO-8869-{1|15}. :-)
> But a simple SORT of a textfile gets it wrong. It sorts Ä ->Å ->Ö
> while the correct order is Å ->Ä ->Ö .
Presumably it sorts them in the order of the corresponding bit values.
Your "correct" order is correct in Swedish. In Norwegian (where there
is no Ö but there is Ø instead and no Ä but Æ instead) the correct order
is Æ ->Ø ->Å.
By the way, HELP FORTRAN CHARAC ASCII brings up the ASCII table, and
HELP FORTRAN CHAR DEC brings up the DEC MCS (similar to ISO-8859-1 and
ISO-8859-15) table.
The logic in these tables is that the order is, first, the order of
the "similar" ASCII letters, then variants are listed in roughly the
order of appearance (though with the same order for all corresponding
ASCII letters; see above).
I've written this on an LK411-AA in a DECterm on a DS10 monitor running
CDE; non-ASCII characters with the COMPOSE key.
Even though I use NEWSRDR, and generally don't mess with mime for usenet
posts, so that stuff like this will work, I add the following lines to
the headers (figure out how I do it with NEWSRDR for extra credit):
Mime-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-15
Content-Transfer-Encoding: 8bit
If I want the EURO sign, I just type COMPOSE-x-o, €, and see an arrow
flying away from me (as used in circuit diagrams) but someone correctly
interpreting based on the headers should see the EUR sign.
More information about the Info-vax
mailing list