View Single Post
Old 05-16-2025, 08:07 PM   #2119
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,305
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by dhdurgee View Post
I am now wondering if it would be possible to extract the ocr text from the k2pdfopt output file and use it as the starting point to create a text version of the file to create an azw3 file.

Is this possible with k2pdfopt or another pdf tool?
You can extract the OCR text in UTF-8 format using the -ocrout <file> option. See the command-line usage. You might take a look at my PDF conversion tips page, though it's a bit stale.
willus is offline   Reply With Quote