07-11-2009, 12:08 AM | #16 |
creator of calibre
Posts: 43,866
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
yeah that bit of code tells calibre (or any other software using the HTML) what encoding the file is in, so it is very important. Without that the software has to guess, and depending on the technique it uses to guess, results will vary...
|
07-15-2009, 02:47 AM | #17 |
Too Maner Secrets
Posts: 32
Karma: 10
Join Date: Jul 2009
Device: Hanline V3 (current) Hanlin V5 (future)
|
Ran into this myself today (just bought a Hanlin V5) and it's showing large gaps after smartquotes in books that use them that I converted to ePub (haven't tried the original format).
Kovid, It seems like one solution to this would be to have a (probably defaulted off) conversion option that looks for smart quotes and apostrophes and converts them to vanilla ascii quotes and apostrophes. Do you think that would be hard or trivial thing to do? Just looking through one file I have issues with I see smartquotes sequences as: E2 80 9C and E2 80 9D apostrophes as: E2 80 99 In ascii, quotes are 22, and apostrophes are 27. A search and replace in the displayed text areas might be enough. It could possibly even globally, assuming that neither the HTML or CSS portions would be using any of those unicode sequences. It looks like a pretty good list can be found here: http://www.utf8-chartable.de/unicode...192&number=128 Meh. It's probably a function of the reader software as much as anything. I'll give openinkpot a try tomorrow after I buy a smaller SD card and see if it handles them any better. |
Advert | |
|
07-15-2009, 11:08 AM | #18 |
creator of calibre
Posts: 43,866
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Converting smart quotes is certainly do able, opena feture request ticket so I don't forget.
|
07-15-2009, 12:40 PM | #19 |
Too Maner Secrets
Posts: 32
Karma: 10
Join Date: Jul 2009
Device: Hanline V3 (current) Hanlin V5 (future)
|
Thanks for looking into this. I've created ticket #2846 for tracking purposes. I look forward to seeing the feature implemented.
|
07-15-2009, 03:52 PM | #20 |
hopeless n00b
Posts: 5,111
Karma: 19597086
Join Date: Jan 2009
Location: in the middle of nowhere
Device: PW4, PW3, Libra H2O, iPad 10.5, iPad 11, iPad 12.9
|
Do you mean removing smart quotes/apostrophes and converting them to regular ASCII equivalents? If you do implement it, would it be possible to make a switch to disable conversion? I happen to like smart quotes and I recall reading a thread here where people created complex regular expressions for converting ASCII quotes to smart quotes so I'm assuming I'm not alone in my preference.
|
Advert | |
|
07-15-2009, 04:06 PM | #21 |
Too Maner Secrets
Posts: 32
Karma: 10
Join Date: Jul 2009
Device: Hanline V3 (current) Hanlin V5 (future)
|
I'd think it would be off by default. It's not a benefit to everyone, but it would be a nice workaround/hack for some.
|
08-11-2009, 09:23 AM | #22 |
Zealot
Posts: 121
Karma: 1000021
Join Date: Feb 2008
Location: Hook, UK
Device: Cybook Bookeen
|
I am also missing apostrophes. I converted an ereader file with ereader2html, and then used Calibre to convert that to Mobi. A curious thing is that the file that came out of the ereader2html is actually a zip file and that is what Calibre converted. It seemed to work OK, except for the missing apostrophes--on the cybook they are just missing; in mobireader they are a funny block symbol. I haven't done much conversion(yet) and am a bit at a loss.
Thanks Rene |
08-11-2009, 12:47 PM | #23 |
creator of calibre
Posts: 43,866
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Set the input encoding in the conversion options to cp1252
|
08-11-2009, 04:56 PM | #24 |
Zealot
Posts: 121
Karma: 1000021
Join Date: Feb 2008
Location: Hook, UK
Device: Cybook Bookeen
|
Ummmm. How to put this.......Hunh?? I looked in Calibre and couldn't find that as an option, nor could I find a place where I could input anything. Am I being amazingly obtuse? While I'm not a complete Luddite I'm not a programmer and definitely need the spoonfeeding. Is there maybe another thread that goes into this?
Rene edit--The file that was made by ereader2html is in fact a .html; but for some reason Calibre insists that the input is a zip format. Could that be the problem? Last edited by momghoti; 08-11-2009 at 05:06 PM. |
08-11-2009, 08:15 PM | #25 |
You kids get off my lawn!
Posts: 4,220
Karma: 73492664
Join Date: Aug 2007
Location: Columbus, Ohio
Device: Oasis 2 and Libra H2O and half a dozen older models I can't let go of
|
Hey, momghoti!
You might want to check out this post on another thread. This'll put the encoding in your converted html file when you run the ereader2html script, and you won't have to worry about where to put it in Calibre any more! |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
PRS-600 Am I just missing something here?! | linzylou63 | Sony Reader | 0 | 09-03-2010 02:44 PM |
Am I missing something here? | SavalBork | Alternative Devices | 3 | 08-27-2010 09:17 PM |
Um am I missing something? | hpjrt | Kobo Reader | 5 | 08-12-2010 12:27 AM |
Hello, this is what I've been missing. | Elimad | Introduce Yourself | 10 | 06-24-2010 08:06 PM |
Missing covers, missing content. Getting worse with each sync. | Mememememe | Kobo Reader | 7 | 06-16-2010 09:02 AM |