Quote:
Originally Posted by dhdurgee
I am now wondering if it would be possible to extract the ocr text from the k2pdfopt output file and use it as the starting point to create a text version of the file to create an azw3 file.
Is this possible with k2pdfopt or another pdf tool?
|
You can extract the OCR text in UTF-8 format using the -ocrout <file> option. See the
command-line usage. You might take a look at my
PDF conversion tips page, though it's a bit stale.