View Single Post
Old 01-16-2015, 12:17 AM   #971
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by Psyny View Post
Hi Willus, sorry for the delay.

Indeed the idea to use OCR layer to detect columns/regions.
That could help in cases where graphics could mess up region detection.
[...]
Dont know if possible... =P
FYI for now you can already get k2pdfopt to crop around the OCR regions:

k2pdfopt -cbox 0x,0x,1x,1x myfile.pdf


Will trim each page exactly around the OCR text. See the -h usage in the command-line options. If you want a little pad around the OCR text:

k2pdfopt -cbox -.05x,-.05x,1.1x,1.1x myfile.pdf
willus is offline   Reply With Quote