View Single Post
Old 08-23-2011, 06:51 AM   #3
mrmikel
Color me gone
mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.
 
Posts: 2,089
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
You can add duplicated caption text to the list of possible errors.

Some people use Mobipocket Creator and feed its html output into Calibre or Sigil.

Whatever you do, plan on some work.

BTW these only work if the PDFs have actual text and are not just containers for scans of the page. PDFs containing just images will have to be OCRed and the resulting product, often a mess, cleaned up. Then you get to appreciate that a 2% error rate means an error on every page times the number of pages to correct.
mrmikel is offline   Reply With Quote