Quote:
Originally Posted by Joykins
search and replace (with nothing) the html elements you don't need.
|
That appears to be the best approach.
My guess is that someone scanned the paper book, did an OCR, did little to clean it up, and then created an epub file. Somehow a lot of html dross was inserted along the way. I almost have to go back to the OCR output to clean it up. To make matters really complicated, the text has a lot of intentional misspellings to imitate an Eastern European mangling of the English language. I can sympathize with the original processor. Alas, there is no retail ebook available.
I'm ordering the paper book through my local library in order to proofread the text. Maybe I will start over with my own scan and OCR. I haven't decided yet.