View Single Post
Old 02-28-2020, 09:48 PM   #18
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by ctop View Post
Wow, this looks really great, exactly what I had in mind! Awesome! One question though, the file you created has the page breaks at different places than the original, which is astonishing. What is the reason for this?

And one more question, since I like to highlight things in my PDFs, is the text layer the same as before, or does k2pdfopt do its own OCR?

All the best,

Ctop
The default behavior of k2pdfopt in "fitwidth" mode is to concatenate pages as it fits them into the converted PDF, and it disregards page breaks in the source document. You can add the -bp option to force a page break in the converted document wherever there is a page break in the source. There are other options that are better if you prefer to have a 1-to-1 source page to converted page correlation. The k2pdfopt options are documented here.

By default, k2pdfopt keeps the OCR layer from the source PDF, but it can also do its own OCR.

Last edited by willus; 02-28-2020 at 09:52 PM.
willus is offline   Reply With Quote