View Single Post
Old 08-03-2011, 02:28 AM   #4
delphin
Evangelist
delphin ought to be getting tired of karma fortunes by now.delphin ought to be getting tired of karma fortunes by now.delphin ought to be getting tired of karma fortunes by now.delphin ought to be getting tired of karma fortunes by now.delphin ought to be getting tired of karma fortunes by now.delphin ought to be getting tired of karma fortunes by now.delphin ought to be getting tired of karma fortunes by now.delphin ought to be getting tired of karma fortunes by now.delphin ought to be getting tired of karma fortunes by now.delphin ought to be getting tired of karma fortunes by now.delphin ought to be getting tired of karma fortunes by now.
 
Posts: 434
Karma: 346901
Join Date: Dec 2010
Device: SONY PRS-650
Quote:
Originally Posted by Jashn View Post
You hit the nail on the head, Delphin. I seem to have made my error whilst trying to use Calibre. When I avoid that and just dump it on the unit, all's well.

So it's the embedded fonts, which allow me to see the PDF, that also prevent me from converting it to an ePub? Oh the irony! Would you know any other software I could use to convert it to an ePub or is there no hope? It's not really necessary but it would make reading easier.

Thanks for all your help, Delphin!!!

Take care,

Jashn
It's possible that there is a better conversion program for PDFs, but I haven't found one.

The PDF conversion apps that I have found, don't work much better than the built in PDF re-flow feature on the PRS-650. They they can extract only simple text, but no fonts or formatting.

I tried to load your doc into the app I use in Linux, and the Hindi text gets messed up and converted to roman characters, just as you saw in Calibre.

This is was the case even when I tried to set the converter to use a proper unicode font that should support Hindi text.

My guess is that the font encoding inside your PDF is not standard uni-code, but is instead based on an earlier Adobe font encoding that works only inside PDFs and that doesn't easily map to unicode fonts.

Calibre can convert documents with non-roman language character sets only if they can be mapped to a proper unicode font (and even then, this would require you to manually embed the font and create a custom CSS inside the EPUB to point to the proper unicode font.)

In any case, even when these apps do work, they only give you simple unformatted text (which usually doesn't work any better than the re-flow on your PRS-650).
delphin is offline   Reply With Quote