[Info-vax] character set translation for language accents

JF Mezei jfmezei.spamnot at vaxination.ca
Fri Apr 17 15:54:15 EDT 2009


jcwoman1963 at hotmail.com wrote:
> Vous répondez à une personne....

> 56 6f 75 73 20 72 e9 70 6f 6e 64 65 7a 20 e0 20 75 6e 65 20 70
> 65 .....


tape a look at :
http://www.vaxination.ca/vms/index.html

In the "other resources", I have a character set table which you can
print. Very userful stuff.

e9 in DEC and ISO Latin1 is "é"
e0 in DEC and ISO Latin1 is "à"

If you program in C, you need to have all character variables declared
as "unsigned char" because "signed char" (the default) does cut off high
order bits when you print/compare them.

Since you mentioned "macro", I am not sure what you are using to output
your data.

When you cut off é's high order bit it goes from e9 ( 1110 1001 ) into
69 ( 0110 1001 ) and that is an "i".

So if nothing at all is outputted with this character, it is more than
just 8 to 7 bit truncation.

What are you using to display this ? Some terminal emulator ? Has it
been set to accept all modern characters or is it still set to 1960s USA
ASCII ?

The fact that your trace shows that you are getting the é as an e9 shows
that your communications link is fine.



More information about the Info-vax mailing list