MobileRead Forums - View Single Post

tomsem · 01-25-2023, 04:27 PM

The AABBYY OCR that comes with CZUR is much better than Acrobat's (or else I cannot figure out how to tell Acrobat to stop detecting Chinese text inline with the specified English). But at least with default options the PDF it created pages of varying sizes even though the images all had the same dimensions. It doesn't have that issue when creating PDF w/o OCR. Hopefully there's some tweak I can discover, or it's something they can fix with an update.

Removing page curl seems to require some way of determining 3D. The Fujitsu scanner has stereo 'vision', and CZUR has lasers that draw lines across the material and they use that to determine the curl.

This apparently is beyond scope for DIY, which is why the effort to flatten pages is necessary.

I can't find any open source code that can take raw images of books with page curl and flatten them. LIDAR (e.g. iPhone) probably isn't of sufficient resolution to help much with this. Probably you could get somewhere where it is mostly horizontal text, with some CV library to extract the curves, and applying an appropriate transform to straighten them.

01-25-2023, 04:27 PM	#8
tomsem Grand Sorcerer Posts: 6,973 Karma: 27060153 Join Date: Apr 2009 Location: USA Device: iPhone 15PM, Kindle Scribe, iPad mini 6, PocketBook InkPad Color 3	The AABBYY OCR that comes with CZUR is much better than Acrobat's (or else I cannot figure out how to tell Acrobat to stop detecting Chinese text inline with the specified English). But at least with default options the PDF it created pages of varying sizes even though the images all had the same dimensions. It doesn't have that issue when creating PDF w/o OCR. Hopefully there's some tweak I can discover, or it's something they can fix with an update. Removing page curl seems to require some way of determining 3D. The Fujitsu scanner has stereo 'vision', and CZUR has lasers that draw lines across the material and they use that to determine the curl. This apparently is beyond scope for DIY, which is why the effort to flatten pages is necessary. I can't find any open source code that can take raw images of books with page curl and flatten them. LIDAR (e.g. iPhone) probably isn't of sufficient resolution to help much with this. Probably you could get somewhere where it is mostly horizontal text, with some CV library to extract the curves, and applying an appropriate transform to straighten them.