Quote:
Originally Posted by anonlivros
I didn't know this software ... I'll check it out, thanks!
Do you recommend making these edits with it, to then generate PNG pages that would be further processed by FineReader?
|
Yes. Use Scan Tailor Advanced as an in-between step.
Instead of:
1. Take your pictures.
2. Use Finereader to crop, dewarp, change to B&W, [...].
You:
1. Take your pictures.
2. Use Scan Tailor Advanced to crop, dewarp, turn images B&W, [...].
3. Feed those into Finereader.
* * *
You can see some example images I posted in Post #15 in the "OCRing + EPUBing my first book: Tips?" thread:
So pictures of pages like this:
or this:
could turn into this:
* * *
Instead, when you only use Finereader's built-in stuff, you get pages like this:
vs.
Of course, those images are easy examples.
But when you have pages that are:
- crooked or very curved because of the spine
- uneven lighting
- very speckled
you'll see how much better Scan Tailor is at those steps.
Plus, with Scan Tailor, you can adjust all the sliders along every step, or even different settings on a per-page basis.
So let's say one page had lots of speckles (tiny dots):
https://www.mobileread.com/forums/at...3&d=1567734681
You could set the despeckling strength to very high, so you might get something like this:
https://www.mobileread.com/forums/at...4&d=1567734681
You can adjust the strength so it catches the speckles, while still leaving the actual "." (periods).
Quote:
Originally Posted by anonlivros
good to know ... I wrote with a focus on the public "poor students in universities with few books". In Brazil, there is a massive public in these conditions.
|
Quote:
Originally Posted by anonlivros
And thanks for reading recommendations. I really enjoyed them.
|
Side Note: You may also like his podcast episode from a few weeks ago:
"Kinsella on Liberty 328 | Heterodorx Ep. 10 with Nina Paley: I.P. Everywhere!"