Quote:
Originally Posted by vastav
I hope you tried the solution at http://www.pdf2epub.com and not the similar sounding offering by dnaml. The ePub output will have same paragraph breaks as those you find in the RTF or HTML export from Acrobat.
|
vastav, I originally did use that website. I tried it again last evening and it still resulted in a file with the paragraphs all run together. OK, so then I downloaded and installed the plugin into Acrobat 9 and the issue with the run-in paragraphs is still there.
The paragraphs in this pdf may not be formatted in a standard way, but when I do an intermediate conversion to RTF or HTML in Acrobat, they are all picked up correctly!
You can see for yourself as I am going to send you the ebook file by email so you can investigate what is going on with your methods.
Quote:
For the TOC, if you use the plugin I supply, all bookmarks in PDF automatically get converted to TOC in ePub. If you have a PDF which is tagged by the authoring application, you can simply create the bookmarks in Acrobat by choosing "New bookmarks from Structure" from the top drop-down available in the bookmarks tab in Acrobat. If you have a PDF which is not tagged (you can check by opening View > Navigation Panels > Tags), you should create the bookmarks manually in Acrobat before running the conversion filter for HTML/ RTF/ ePub to ensure that bookmarks get exported in a valid manner in the exported file.
|
Thanks for this very useful information. I did not know about these Acrobat features and I used them on another book. It made a difference in being able to generate a metadata TOC. I don't think Calibre can read bookmarks in a pdf this way and generate the metadata TOC.
pdf2epub seems to do a very nice job indeed on most conversions where other tools fail, but it's still not there yet if it can't correctly break the paragraphs on all files. I will still keep on using/testing it for other pdf ebooks, however. In the meantime, let me know when you get the ebook and find out what was the hitch.
Also, does anyone know of a good automated way to insert a blank line between paragraphs in the body text of a pdf ebook? I couldn't figure out how to do it in Acrobat, except manually of course. There is no global search/replace feature in it.
I was figuring if I could first insert a space between paragraphs in the problematic document, then when converting via pdf2epub it wouldn't run the paragraphs all together.
--Pat