Quote:
Originally Posted by thibaulthalpern
I think typos can creep in when, say for example, you're switching from one character set to another...maybe. So maybe in the conversion process, something goes awry and the character set goes from Unicode 8 (UTF-8) to ASCII. Problems can appear there, perhaps.
Talking about character sets, I wish the computing industry would abandon ASCII entirely and move entirely to UTF-8. ASCII is such an Anglo-centric character set leaving the languages of the rest of the world aside.
|
Yes, UTF-8 (or UTF-16 if necessary) is a much better choice. One thing to keep in mind, at least as far as epub: for most non-ASCII characters you have to use the numeric form, as very few of the extended characters are defined in XHTML. This is not a problem with epub, but with the XHTML spec.