View Single Post
Old 01-07-2015, 09:24 AM   #966
Psyny
Member
Psyny can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterPsyny can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterPsyny can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterPsyny can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterPsyny can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterPsyny can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterPsyny can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterPsyny can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterPsyny can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterPsyny can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterPsyny can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameter
 
Posts: 10
Karma: 12848
Join Date: Dec 2014
Device: Kindle PapperWhite
Quote:
Originally Posted by willus View Post
You've done a great job experimenting with the options. I'm impressed. The cropbox option (-cbox) is the most powerful option because you can specify custom individual regions, and they can be page specific. E.g., try this in the "Additional options" box (keeping your other settings--e.g. margins--how they are):

-cbox1 0s,0s,0.5s,1s -cbox1 .5s,0s -cbox2- 0,0

The above options select two crop boxes on page 1 (-cbox1)--the left and right halves of the page. The -cbox2- 0,0 selects the full page for all of the rest of the pages (2 and up).

I realize this is painful because you have to manually do the regions yourself. There's no automatic parsing that works for all of the pages of your particular PDF because the graphic at the bottom of page 1 screws up the column detection, as you've shown.

I may eventually give the -grid option the ability to be page specific.
Thanks. I will try out cbox.

In above case, found out that working with -wt helped a little:
[Image violates guidelines for size - MODERATOR]


One day, a command to let k2pdfopt ignore areas without OCR/Textmarkings when defining coluns would be great.

Something like:
-corc[+] [i|t] <inches>

Where:
<inches> : max ocr/markings distance k2pdfopt will look for another text to define a colum.
+ : allow process of areas without oct/markings
i : to include in the colum the area around markings defined by <inches>
t : to not include in the colum the areas around markings defined by <nches>

I know its too much, just a ideia.

Last edited by Dr. Drib; 06-22-2015 at 02:11 PM.
Psyny is offline   Reply With Quote