Quote:
Originally Posted by llasram
Actually, I take it back. The Mobiperl 'mobi2html' errors with the Mobipocket book I've generated appear to be errors with Mobiperl's handling of UTF-8 encoded books. With UTF-8 encoding, each text record is followed by 0 or more "overlapping" bytes finishing the current multibyte character, plus an 8-bit integer count of the overlapping bytes as an additional byte. These additional bytes are not counted as part of the content length for the purposes of computing the "filepos" of link targets.
|
Well, UTF-8 encoded books does not seem to be so common so I have not noticed this before. Why use UTF-8 and not entities?
Coding UTF-8 or other character set related things is probably the most boring thing I know. When I started using computers there was problem with Swedish characters and ISO 8859-1 support in programs. It is so depressing to have similar kind of problems 25 years later...