Thanks for your answer!
The cheating way is not useful for me (or super-very-incredibly tedious) because I want to convert more than 100 pdf's :P
And as the non cheating way doesn't order it well, I will have to stay with the original pdf's.
Quote:
Originally Posted by willus
I'll try to think about ways I could get k2pdfopt to handle cases like this better. I may even have some options in there to help with this, but I'm not remembering...
|
What I can think of is this:
Would it work better if there was no picture but just a blank spot? I mean, maybe it would be easier for the software to "see" the column if it could manage to first extract the image and check for the columns later. I see that with your commands line, you manage to define the image box pretty well (I tried it with a bunch of other pdf's and it sometimes also takes some text lines in the box, but many times just the image). Maybe if that is polished to avoid having text in the image box you can in a first step extract the image from the text, then convert the text and finally add again (or not) the image somewhere.
I don't really know how your software works and I'm not into it so I don't think that's going to help you much, but I had to try :P
Thank you again,
m