Hello,
Many thanks for this terrific plugin. Incredibly handy.
Would it be possible to add some options such that anything the plugin does, such as fix common OCR, PDF export, or HTML errors, can be selectively enabled, that is one has precise control over the enabling of all things? In the case of vector-quality commercial PDF exports, OCR as well, I prefer to use my own regexes and find errors as they occur. Some maybe OCR errors, some may be a common PDF export error, some may be a typo in the original source, etc. ; in each case, I'd prefer to find them myself on the off chance the common fix isn't correct.
In the meantime, I removed the lines of code for my use and I think I got them all as the log didn't report any changes except the ones I wanted.
I recently found and used this plugin solely for hyphenation. On that note, calibre uses the eBook itself, scanning for words and compiling a dictionary. Would you someday consider such a feature? Many works – academic, scientific, and so forth –, may have unique terms, either from the field itself, transliterated from another language, Latin terms, etc. that not in any dictionary. Would be nice to have.

I had first tried calibre but prefer not to convert. In the meantime, I converted the EPUB to text, created a word list, and used that.