Quote:
Originally Posted by Hitch
That is one of 3 kinds of uncleaned output: 1) Word, direct to HTML, filtered, without any subsequent cleaning, but, more likely, is: 2) AbbyyFineReader output, direct to Word-->HTML or HTML, w/o any subsequent cleaning, or, god help us, 3) InDesign, with character style overrides, w/o any subsequent HTML and CSS cleaning.
|
You forgot one.
"I have an old version of Word and can't export to PDF, so I first print the file, then rescan it to jpg's at 100 DPI using a cheap-ass scanner from the 90's (can't do more because it's still USB 1.1), create a PDF from those JPG's using whatever program, and then I'll OCR it, output it to HTML, and have Smashwords create an EPUB automatically."
All done