Hey mark!
MK>> so-called latin-1 has been misidentified as iso-8859-1 by many (too
many).
MK>> [...]
MK>> BTW latin1 is iso-8859-1.
ml> the above seems contridictory... is it??
Nope. latin1 is not equal to latin-1.
For example;
printf "A M\u00f8\u00f8se once bit my sister" | iconv -f utf8 -t latin1
works but;
printf "A M\u00f8\u00f8se once bit my sister" | iconv -f utf8 -t latin-1
will exit with;
iconv: conversion to `latin-1' is not supported
Try `iconv --help' or `iconv --usage' for more information.
When researching the problem I came across some references to cp1252 but nobody
would go out on a limb over it. If you search IANA they produce output for
LATIN-1 but no insight about conversion to any particular 8-bit encoding. From
observations it seems to me that the Windows people use cp1252 for latin-1.
Björn Felton comes to mind but I've seen others totally get it wrong when they
assume iso-8859-1.
Anyhow the bottomline here is to avoid it like the plague. Besides cp1252
works with iconv so it isn't like there isn't a suitable encoding that will
work when targetting most 8-bit charset users these days.
Life is good,
Maurice
... Don't cry for me I have vi.
--- GNU bash, version 4.4.0(1)-rc1 (x86_64-atom-linux-gnu)
* Origin: Little Mikey's Brain - Ladysmith BC, Canada (1:153/7001.0)
|