View Single Post
Old 07-23-2011, 08:36 AM   #5
user_none
Sigil & calibre developer
user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.
 
user_none's Avatar
 
Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
Here are a couple of ways you can go about doing it. Each method really depends on how comfortable you are with the file type. It also depends on how much work you want need to do with formatting. #2 for example will need the most formatting added by you but this might be easier than trying to fix formatting errors due to conversion from PDF.

1) Use pdfmanipulate (part of calibre but it's command line only) or a similar tool to merge all of the individual PDFs into one file. Convert this single PDF into a any format you're comfortable working with. HTMLZ, EPUB, TXT, ect. Then use Sigil to make your changes.

2) Convert each individual PDF to TXT either using calibre or Acrobate. Combine the files by copy and paste. In this case I would recommend using Textile to do the bulk of the formatting. Convert to EPUB and make any minor formatting changes (if necessary) using Sigil.

3) Convert each individual PDF to HTML. Then create an index.html file that has a link to each individual HTML file. A toc essentially and import the index.html into calibre. calibre will read each link (<a> tags) in the index file and gather all of the individual HTML files putting it all into a ZIP archive. You can then convert to EPUB or HTMLZ to get a combined file suitable for Sigil.
user_none is offline   Reply With Quote