Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 01-02-2013, 12:36 PM   #1
claytoncarney
Junior Member
claytoncarney began at the beginning.
 
claytoncarney's Avatar
 
Posts: 4
Karma: 10
Join Date: Jan 2013
Device: none
EPUB -> PDF: Image Rather Than Text

I've only been using calibre a few hours, so please bear with me if my questions seem obvious...

Background: I have my technical library in PDF files. I use Foxit Reader to read, highlight, and annotate my books. In order to highlight, the PDF files must contain embedded text, as opposed to an image of text. Sometimes desired books are not available in PDF, only EPUB. I'm seeking to use calibre to make the necessary conversion.

As a start, I converted the calibre Quick Start Guide EPUB. After tweaking the conversion preferences, I got a functional PDF. I obviously have much to learn, but so far, so good.

However, when I convert a technical book EPUB with the same preferences, I encounter an odd problem. The resulting PDF appears to be mostly an image, rather than embedded text. Only certain types of headings or mono-spaced fonts are converted to text. The remainder of the content appears to convert to an image of text, rather than actual text.

I've looked at the Debug dump. The OEBPS HTML files in the input, parsed, structure, and processed directories all clearly have the content text intact. Yet, the converted PDF appears to be almost completely an image.

The only possible clue I see is that the technical book EPUB includes several OTF font files. The calibre Quick Start Guide EPUB does not. Outside of that, I'm lost as to why I get such drastically different results from the same conversion preferences.

Any suggestions are greatly appreciated...
claytoncarney is offline   Reply With Quote
Old 01-02-2013, 12:56 PM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 26,124
Karma: 5101571
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
THe problem is caused by the fonts, and it will be fixed in the next release.
kovidgoyal is offline   Reply With Quote
Old 01-03-2013, 04:32 AM   #3
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 26,124
Karma: 5101571
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Actually, it turns out that it cannot be fixed as yet, on windows. It will not happen if you convert on linux or OS X, but Qt on windows has no support for Postscript fonts (it uses GDI instead of DirectWrite). The only way to fix it would be to drop support for Windows versions earlier than Vista SP2, which is not something that is worth the tradeoff at this time.

So you will have to either replace the otf fonts with truetype version of run the conversion ona linux or os x system.
kovidgoyal is offline   Reply With Quote
Old 01-03-2013, 12:15 PM   #4
claytoncarney
Junior Member
claytoncarney began at the beginning.
 
claytoncarney's Avatar
 
Posts: 4
Karma: 10
Join Date: Jan 2013
Device: none
Thank you for the excellent support!

Ran the conversion using calibre 0.9.12 on Fedora 17 with no problem.

Also used FreeFontConverter to convert otf to ttf, edited the various otf references, rezipped the epub, and ran the conversion using calibre 0.9.12 on Windows 7 with no problem.

calibre rocks! Many thanks for this great product...
claytoncarney is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
PDF Batch Text/Image Identifier owlman112 PDF 2 01-22-2012 09:12 PM
HTML to EPUB Inline Text/Image Issue HoushaSen Conversion 2 07-02-2011 08:03 PM
PDF Text AND Page Image.. wierd.. mathewb Sony Reader 0 07-08-2010 02:46 PM
PDF virtual printer as text not image mowbray Amazon Kindle 7 02-05-2010 12:32 PM
PDF Image -> OCR -> text frikk Workshop 9 07-08-2009 07:21 PM


All times are GMT -4. The time now is 04:53 AM.


MobileRead.com is a privately owned, operated and funded community.