View Single Post
Old 08-10-2010, 01:35 PM   #30
PatNY
Zennist
PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.
 
PatNY's Avatar
 
Posts: 1,022
Karma: 47809468
Join Date: Jul 2010
Device: iPod Touch, Sony PRS-350, Nook HD+ & HD
Quote:
Originally Posted by vastav View Post
I hope you tried the solution at http://www.pdf2epub.com and not the similar sounding offering by dnaml. The ePub output will have same paragraph breaks as those you find in the RTF or HTML export from Acrobat.
vastav, I originally did use that website. I tried it again last evening and it still resulted in a file with the paragraphs all run together. OK, so then I downloaded and installed the plugin into Acrobat 9 and the issue with the run-in paragraphs is still there.

The paragraphs in this pdf may not be formatted in a standard way, but when I do an intermediate conversion to RTF or HTML in Acrobat, they are all picked up correctly!

You can see for yourself as I am going to send you the ebook file by email so you can investigate what is going on with your methods.


Quote:
For the TOC, if you use the plugin I supply, all bookmarks in PDF automatically get converted to TOC in ePub. If you have a PDF which is tagged by the authoring application, you can simply create the bookmarks in Acrobat by choosing "New bookmarks from Structure" from the top drop-down available in the bookmarks tab in Acrobat. If you have a PDF which is not tagged (you can check by opening View > Navigation Panels > Tags), you should create the bookmarks manually in Acrobat before running the conversion filter for HTML/ RTF/ ePub to ensure that bookmarks get exported in a valid manner in the exported file.
Thanks for this very useful information. I did not know about these Acrobat features and I used them on another book. It made a difference in being able to generate a metadata TOC. I don't think Calibre can read bookmarks in a pdf this way and generate the metadata TOC.

pdf2epub seems to do a very nice job indeed on most conversions where other tools fail, but it's still not there yet if it can't correctly break the paragraphs on all files. I will still keep on using/testing it for other pdf ebooks, however. In the meantime, let me know when you get the ebook and find out what was the hitch.

Also, does anyone know of a good automated way to insert a blank line between paragraphs in the body text of a pdf ebook? I couldn't figure out how to do it in Acrobat, except manually of course. There is no global search/replace feature in it.

I was figuring if I could first insert a space between paragraphs in the problematic document, then when converting via pdf2epub it wouldn't run the paragraphs all together.

--Pat

Last edited by PatNY; 08-10-2010 at 01:42 PM.
PatNY is offline   Reply With Quote