Quote:
Originally Posted by kovidgoyal
calibre detects encodings declared in HTML. I have no idea what Sigil does, from your description I'd guess it assumes UTF-8 always.
As for checking encodings, IIRC the editor autoconverts to UTF-8 whenever it processes any HTML, so I dont think checking will be possible or even necessary.
|
Well, the Calibre editor may autoconvert to UTF-8 on opening, but the header retains the old encoding:
<meta content="application/xhtml+xml; charset=iso-8859-1" http-equiv="content-type"/>
and I think the e-book readers that do not autoconvert may display text with different strange characters (â, Ã, Â, ¦, etc. instead of “, ”, etc.), as Sigil does.
I know I can use Modify ePub add-on to encode in UTF-8, just would like to know that I
need to. The way it is, I must use Sigil to open the book and find out if the encoding is off.
I guess my question is: would it be possible for Check Book to report "There is a non-UTF-8 encoding declared in xyz.html"? That would be really helpful.