View Single Post
Old 02-03-2024, 07:24 PM   #13
Joseph The Grave
Member
Joseph The Grave began at the beginning.
 
Posts: 21
Karma: 10
Join Date: Jul 2022
Device: PC
Quote:
Originally Posted by DNSB View Post
The books from IA tend to be scanned images of a page with a very poorly OCRred text page to allow for searching. Unless you are heavily into masochism, I would avoid trying to do anything with them.

And again, since there is an epub version of the book on Gutenberg, why not simply download that instead of saving the web page version and converting to epub? This link should point to Before the Dawn (epub) which I find much easier than messing with conversions.
Yeah, I'm giving up on the OCR PDF to HTML thing. Too much tedium and pain. I would like to find a way to substantially reduce the size of those PDFs though without too much loss of quality but when I attempt it I notice the new file size is barely smaller than the original so it hardly seems worth it.

I have problems finding stuff on PG with their search engine. If I type in "John Taine" his name won't even come up though I know he has books on there. I also have trouble finding the epub versions there but maybe I didn't look hard enough. It's otherwise been a valuable learning experience doing it the hard way with creating my own epubs so I don't regret it too much. Thanks for the help, everyone. Peace out.
Joseph The Grave is offline   Reply With Quote