View Single Post
Old 06-21-2014, 11:08 AM   #3
mikapanja
Perfectionist
mikapanja can solve quadratic equations while standing on his or her head reciting poetry in iambic pentametermikapanja can solve quadratic equations while standing on his or her head reciting poetry in iambic pentametermikapanja can solve quadratic equations while standing on his or her head reciting poetry in iambic pentametermikapanja can solve quadratic equations while standing on his or her head reciting poetry in iambic pentametermikapanja can solve quadratic equations while standing on his or her head reciting poetry in iambic pentametermikapanja can solve quadratic equations while standing on his or her head reciting poetry in iambic pentametermikapanja can solve quadratic equations while standing on his or her head reciting poetry in iambic pentametermikapanja can solve quadratic equations while standing on his or her head reciting poetry in iambic pentametermikapanja can solve quadratic equations while standing on his or her head reciting poetry in iambic pentametermikapanja can solve quadratic equations while standing on his or her head reciting poetry in iambic pentametermikapanja can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameter
 
Posts: 72
Karma: 12802
Join Date: Apr 2014
Device: none
Quote:
Originally Posted by kovidgoyal View Post
calibre detects encodings declared in HTML. I have no idea what Sigil does, from your description I'd guess it assumes UTF-8 always.

As for checking encodings, IIRC the editor autoconverts to UTF-8 whenever it processes any HTML, so I dont think checking will be possible or even necessary.
Well, the Calibre editor may autoconvert to UTF-8 on opening, but the header retains the old encoding:

<meta content="application/xhtml+xml; charset=iso-8859-1" http-equiv="content-type"/>

and I think the e-book readers that do not autoconvert may display text with different strange characters (â, Ã, Â, ¦, etc. instead of “, ”, etc.), as Sigil does.

I know I can use Modify ePub add-on to encode in UTF-8, just would like to know that I need to. The way it is, I must use Sigil to open the book and find out if the encoding is off.

I guess my question is: would it be possible for Check Book to report "There is a non-UTF-8 encoding declared in xyz.html"? That would be really helpful.
mikapanja is offline   Reply With Quote