|06-17-2013, 09:56 AM||#1|
Join Date: Feb 2012
Location: São Paulo / Brazil
Device: Kindle Touch, Kindle PaperWhite, ASUS Nexus 7, LG Nexus 4
PDF to Kindle, but not only contents!
This weekend I decided to face the whole procedure to convert a PDF to read as an e-Book on my Kindle PW.
I followed the formula:
PDF -> MobiPocket -> HTML -> Sigil -> EPUB -> Calibre -> AZW3 -> Kindle PW.
Very easy if you're only interested in read the book, if you're only interested in the contents of the book. BUT, if you want the formatation, the design, the presentation, the table of contents, parts and chapters' division, footnotes with proper links and so on, it's another story...
So, my question: is there any [easy] way beyond the HARD WORK that you need to put on the Sigil phase in order to have an e-Book from a PDF properly formatted? By "properly formatted" I mean the way that the book is presented in the PDF version.
|06-17-2013, 10:33 AM||#2|
Join Date: Nov 2010
Location: Goettingen, Germany
Device: Kindle Paperwhite, Kobo Mini
I guess that would be to use a sophisticated OCR engine to process the PDF. They do PDFs nowadays, too, besides scans. And the CR rate is excellent, is mostly their layout recognition that is going to help here. You can go to document formats from there, e.g. MS Office.
|06-17-2013, 11:06 AM||#3|
Join Date: Nov 2006
Device: PW2, iPad Retina Mini, iPhone 4, MS Surface Pro, Kobo H2O, N7
Agreed - a decent OCR program like Abbyy FineReader is the only way to do it.
|06-17-2013, 11:18 AM||#4|
Join Date: Apr 2013
I don't know how to retain the formatting but retaining the links (table of contents, footnotes, all internal and external links) is easy:
PDF > Mobipocket Creator > Build Mobi > Copy to KPW > Enjoy!
|06-17-2013, 02:23 PM||#5|
Join Date: Feb 2010
Location: Monroe Wisconsin
Device: K3, Kindle Paperwhite, Calibre, and Mobipocket for Pc (netbook)
PDF to epub works well and then it's just a matter of running it through sigil (for fixing the TOC) and calibre (to convert to mobi format).
|06-17-2013, 03:00 PM||#6|
Join Date: Mar 2011
Device: Kindle 5.3.7
Instead of Sigil you may use MsWord (with macros) . On outline view you can easily determine chapters. And you can also use the other ordinary tools of MsWord to format the text. (pagebreak, etc also possible)
PDF -> MobiPocket -> HTML -> MsWord (with macros)-> filtered HTML-> zipped filtered HTML -> Calibre (Toc according to h1,h2,h3) -> AZW3 -> Kindle PW.
Alternativley this way also possible though not used myself :
PDF -> MobiPocket -> HTML -> MsWord-> DOCX -> Calibre (Toc according to h1,h2,h3) -> AZW3 -> Kindle PW.
|kindle pw, pdf|
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|PRS-T2 PDF Table of Contents in PRS-T2?||Zistmo||Sony Reader||1||10-28-2012 09:25 AM|
|Kindle DX and PDF Table of Contents||Gideon||Amazon Kindle||12||03-19-2012 06:14 PM|
|Questions about table of contents with PDF||invinceable||Amazon Kindle||0||12-25-2010 09:05 PM|
|Blank pdf Table of Contents||eruthven||Sony Reader||0||01-07-2009 01:08 PM|
|PDF Table of Contents||Aerlock||Sony Reader||0||01-12-2007 02:26 PM|