|10-30-2012, 06:57 PM||#1|
Join Date: Oct 2012
Device: Sony PRS-T1
Problem with character encoding
I recently got a book in mobi format. When converted with Calibre to epub and sent to my sony prs-t1, a lot of ? symbols appear. When I looked into the mobi html file using the mobi extractor plugin, I saw that there were many Â characters before quotes,question marks, etc. I tried changing into all available char encoding, but it didn't work. The book is in English, but the characters that appear after the Â are different to the ones I normally see in texts. For example, the " are tilted to the left or to the right. Is there any way to solve this?
|11-01-2012, 07:53 AM||#2|
Join Date: Sep 2010
Device: Kobo aura HD, Kobo Arc, Sony T2, Kindle Fire HDX 8.9 , Kindle for PC
you can usually manually fix those things in the epub, in sigil, before sending it to sony, though it is often less hassle to find a better, retail quality, source,
A lot of books have curly quotes for speech, no need to change those, unless you really want to.
how does the mobi version look if you open it in kindle for PC , not in calibre viewer.
TIP: to see how the book will long on Sony, before actually sending it, open the epub in ADE on your PC.
|11-09-2012, 10:11 PM||#3|
Join Date: Jul 2012
Device: Nook Simple Touch Glowlight; Sony PRS-T1; Kindle
It has to do with the specified character set. If you use mobi-unpack tool to disassemble it to its component parts, you can open the main html file with notepad or wordpad and see what character encoding has been used. Then when you convert in calibre, you can supply that character set name in the "Look & Feel" section of the conversion dialog.
My experience has been that nearly every time, it is iso8859-1. So now I just try that first, before bothering with taking anything apart. On the rare occasion it doesn't work, then you can still dig into the mobi file to see what it really is.
Also, you're best off NOT trying to make the changes in Sigil, which assumes by default that everything is UTF-8. Once you've opened the file with Sigil and then saved any changes. You're pretty much stuck with the bad characters, until you do as cybmole suggests, and manually edit them. Ugh.
Last edited by Barb-B; 11-09-2012 at 10:15 PM.
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|What character encoding am I seeing?||Claghorn||Conversion||1||08-22-2012 10:02 AM|
|Problem with font or character encoding||no harmony||Calibre||2||11-25-2011 09:50 AM|
|Pdf to epub Turkish character encoding problem||blueresistance||Conversion||1||02-25-2011 05:31 PM|
|how to tell the character encoding???||rheostaticsfan||Calibre||23||06-21-2010 03:26 PM|
|FBReader fixes character encoding problem||jbenny||News||1||10-18-2007 10:50 PM|