View Single Post
Old 10-25-2011, 10:00 AM   #2
DSpider
Evangelist
DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.
 
DSpider's Avatar
 
Posts: 450
Karma: 343115
Join Date: Nov 2009
Location: Romania
Device: PW2 2014
AFAIK, FineReader opens DjVu files as JPG images (well, perhaps not JPG, but something close, maybe a compressed TIFF, I'm not really sure).


The quick and dirty method would be to have the OCR text under the image. In ABBYY FineReader 10 (haven't tried 11 yet), you can export PDFs with Text under the page image:




However, this wouldn't be any better than your average DjVu file with text underneath. The quality method, would be to properly proof-read after OCR-ing which takes time and patience - it basically means you read the whole book using ABBYY FineReader once, and once more the final version using Foxit Reader, Adobe Reader, Apple's iBooks, etc.

I think retouching should be done either using Word 2010 SP1 (for .docx), or LibreOffice 3.4.3 (for .rtf), the latest right now. Not some shoddy/half-assed text processing program.


If you want to preserve "the EXACT same look of the printed book", font matching can be a pain sometimes, especially since most publishers use commercial fonts. But it's totally worth it if you do it right.

I mean, sure, you could use a close match using one of the websites bellow. Or, if you're willing to go the extra mile track down the commercial variants (which I think is probably called piracy - but hey, you're not making any money off of it... are you?).

http://www.identifont.com
http://www.whatthefont.com
http://www.whatfontis.com

As a last resort, ask for someone's help: http://typophile.com/typeid

Hope this helps! Some Word 2010 training videos wouldn't hurt either. Learn to make use of macros instead of editing character spacing by hand. It would save a lot (and I mean A LOT) of time. Hook them up to hotkeys instead of right clicking - Font - Advanced... etc. etc.
DSpider is offline   Reply With Quote