View Single Post
Old 01-18-2025, 09:23 AM   #2099
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by tetraxi33 View Post
I mean I couldn't make the software detect both the image and the text next to it as seperate blocks. So the text next to the image is always in small font size. I beg your help. Thanks beforehand.
Okay--I made a PDF of your screen shot (at 96 dpi) and was able to get k2pdfopt to parse it the way you want with this command:

Code:
k2pdfopt -sm -crgh .01 -gtr .02 -f2p -1 sample.pdf
The -sm option just creates a PDF showing how k2pdfopt parsed the page--this is a useful debugging tool.

The -crgh option default is .013 (1/72) inch. I lowered it. This makes it easier to detect a 2-column sub-section of the document. See this page for how it is used.

The -gtr option makes it easier for k2pdfopt to detect the row breaks since your text lines are kind of crowded together. The default value is .006. The higher value means there can be more "black" in the gaps between rows--they don't have to have pristinely all-white gaps between them.

Finally, -ftp -1 shrinks the figures onto a single page if they are extra tall.
Attached Files
File Type: pdf sample.pdf (129.3 KB, 290 views)
File Type: pdf sample_k2opt.pdf (646.4 KB, 313 views)
File Type: pdf sample_marked.pdf (373.3 KB, 299 views)
willus is offline   Reply With Quote