Quote:
Originally Posted by Tex2002ans
Sounds great. Welcome to the forum.
I linked to a ton of my best summaries/resources/tutorials back in:
That should cover pretty much any/all best practices + digitization questions.
What are the images? Are they photographs? Can you post some samples?
Sounds to me like you may accidentally just be plopping in images of "scanned pages" into your EPUBs.
If your images are just scans of pages out of books, you'd need to OCR and change those into actual text.
If the images are photographs—like of people, trees, etc.—you can probably use JPGs instead of PNGs. That will save lots of space too.
|
Oh, sorry about that, it looks like I could have taken more care to clarify the above: I'm actually intentionally scanning pages in "photo mode" and adding the resultant PNGs (or JPGs, or whatever I end up going with) to the ePub as they are
and manually transcribing the same pages into text. The goal being to provide both versions for every document in the collection.
RE OCR, I'd be (very pleasantly) surprised if that were a viable option given that the majority of the original documents are handwritten.