After I got things sorted out with your tool I had yet another thought occur to me. I passed the original scanned image document along with some guidance to the Grok AI engine and asked it to do the OCR for me detecting the images and inserting placeholders for them in the output. This was after trying to get it to do a full conversion to an epub format, which it turned out was beyond its capabilities. It was, however, able to give me the files comprising the epub piecemeal so that I could assemble the epub manually.
I hit some issues with limitations of grok.com and had to do the processing in pieces, but I now have a final epub of that document with images that is about 1/3 the size of your optimized output.
I understand that the Grok API is available to developers and can tell you that its OCR capabilities were great. It might be worth your time to investigate if it could be useful to you in a later release of your tool.
Dave
|