View Single Post
Old 05-04-2009, 12:59 PM   #19
chrisophus
Junior Member
chrisophus began at the beginning.
 
Posts: 9
Karma: 10
Join Date: Aug 2008
Device: Kindle
Complex PDF to HTML

I wrote a python script which converts the output of pdf2xml to html and attempts to maintain formatting of complex pdf's. I then use calibre to generate the ebook format (mobi in my case). It seems to work pretty well. You can read more about it on my blog at http://talkings.org/2009/05/03/complex-pdf-html/.
chrisophus is offline   Reply With Quote