MobileRead Forums - View Single Post - k2pdfopt: optimizes PDFs for viewing on e-readers

willus · 05-16-2025, 08:07 PM

Quote:

Originally Posted by dhdurgee

I am now wondering if it would be possible to extract the ocr text from the k2pdfopt output file and use it as the starting point to create a text version of the file to create an azw3 file.

Is this possible with k2pdfopt or another pdf tool?

You can extract the OCR text in UTF-8 format using the -ocrout <file> option. See the command-line usage. You might take a look at my PDF conversion tips page, though it's a bit stale.