View Single Post
Old 12-12-2012, 03:27 AM   #5
Jimbo724
Connoisseur
Jimbo724 began at the beginning.
 
Posts: 60
Karma: 10
Join Date: Jun 2012
Device: Kindle Touch
Quote:
Originally Posted by Joykins View Post
search and replace (with nothing) the html elements you don't need.
That appears to be the best approach.

My guess is that someone scanned the paper book, did an OCR, did little to clean it up, and then created an epub file. Somehow a lot of html dross was inserted along the way. I almost have to go back to the OCR output to clean it up. To make matters really complicated, the text has a lot of intentional misspellings to imitate an Eastern European mangling of the English language. I can sympathize with the original processor. Alas, there is no retail ebook available.

I'm ordering the paper book through my local library in order to proofread the text. Maybe I will start over with my own scan and OCR. I haven't decided yet.

Last edited by Jimbo724; 12-12-2012 at 03:41 AM.
Jimbo724 is offline   Reply With Quote