View Single Post
Old 07-27-2009, 01:05 PM   #26
Mordak
Junior Member
Mordak began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Jul 2009
Device: PRS-505
Quote:
Originally Posted by tfarrell View Post
My experience with ereader books is that some of them apparently "just work" because they don't use any of the offending characters: they use three periods instead of an ellipsis character, a hyphen instead of an em-dash, etc. I think it has more to do with the way the individual book than the fact that it comes from ereader.
I've had exactly the same experience - some books just work, and some books have messed up characters. It depends on the publisher, and a single bookstore sometimes has books that use the odd characters and sometimes does not.

For me, I work around the issue with just a text editor. I use SubEthaEdit to open the document, use the select menu at the bottom of the window to Reinterpret as Windows Latin 1, and then use the same menu to Convert to UTF-8. It only takes a few seconds, and afterwards they import into Calibre and convert to ePub without difficulty.

Sometimes I have found books that use weird &#XXX codes to specify quotes and dashes that display fine in a web browser, but do not display correctly on my PRS-505, in which case I just use the SubEthaEdit to find/replace the offending characters, then re-import and convert. A little laborious, but it gets the job done.
Mordak is offline   Reply With Quote