View Single Post
Old 04-16-2021, 02:01 PM   #6
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by anonlivros View Post
I didn't know this software ... I'll check it out, thanks!

Do you recommend making these edits with it, to then generate PNG pages that would be further processed by FineReader?
Yes. Use Scan Tailor Advanced as an in-between step.

Instead of:

1. Take your pictures.
2. Use Finereader to crop, dewarp, change to B&W, [...].

You:

1. Take your pictures.
2. Use Scan Tailor Advanced to crop, dewarp, turn images B&W, [...].
3. Feed those into Finereader.

* * *

You can see some example images I posted in Post #15 in the "OCRing + EPUBing my first book: Tips?" thread:

So pictures of pages like this:

or this:

could turn into this:

* * *

Instead, when you only use Finereader's built-in stuff, you get pages like this:

vs.

Of course, those images are easy examples.

But when you have pages that are:
  • crooked or very curved because of the spine
  • uneven lighting
  • very speckled

you'll see how much better Scan Tailor is at those steps.

Plus, with Scan Tailor, you can adjust all the sliders along every step, or even different settings on a per-page basis.

So let's say one page had lots of speckles (tiny dots):

https://www.mobileread.com/forums/at...3&d=1567734681

You could set the despeckling strength to very high, so you might get something like this:

https://www.mobileread.com/forums/at...4&d=1567734681

You can adjust the strength so it catches the speckles, while still leaving the actual "." (periods).

Quote:
Originally Posted by anonlivros View Post
good to know ... I wrote with a focus on the public "poor students in universities with few books". In Brazil, there is a massive public in these conditions.


Quote:
Originally Posted by anonlivros View Post
And thanks for reading recommendations. I really enjoyed them.
Side Note: You may also like his podcast episode from a few weeks ago:

"Kinsella on Liberty 328 | Heterodorx Ep. 10 with Nina Paley: I.P. Everywhere!"

Last edited by Tex2002ans; 04-16-2021 at 02:09 PM.
Tex2002ans is offline   Reply With Quote