Thread: PDF to EPUB
View Single Post
Old 02-25-2014, 04:38 PM   #3
eschwartz
Ex-Helpdesk Junkie
eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.
 
eschwartz's Avatar
 
Posts: 19,421
Karma: 85400180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
Quote:
Originally Posted by sumguy View Post
Calibre's PDF conversion is awful, in particular the "Heuristics" for unwrapping text just based on line length is basically unusable. Try Mobipocket Creator instead, it does a much better job, and can be used with Sigil to make EPUBs. It's really worthwhile to learn Sigil rather than struggling with Calibre to author EPUBs.

My workflow is import PDF into Mobipocket Creator, and then just quit without doing anything else. Grab the resulting HTML file & images and import into Sigil. Clean it up by hand and/or regular expressions, add table of contents and cover, etc. Much better results than Calibre, though still a lot of hand editing to do often, Mobipocket does make some irritating mistakes, particularly with links and footnotes.
That's really doing it by hand vs doing automated processes to save time. Not calibre's fault... It can fix a lot of mistakes, but it will never be perfect. There's a lot of options to control how to make the attempt to derive meaning, and different PDFs will yield different results.

And you can use calibre's Edit Book to the same effect as Sigil once you have you EPUB. Saves having to install two programs, and gets a lot more attention to bugfixes nowadays, although it doesn't yet have spellcheck.
eschwartz is offline   Reply With Quote