A different language plus specialized scientific terms for which perhaps I couldn't find a dictionary.
Exported PDF to EPUB with one of the numerous cheapo apps. Results not bad though took a few days to get into decent shape and could take months to fix. A work that is worth it to me as I'll study the subject as I fix.
I was able to take the EPUB, convert to txt in calibre, make a calibre custom dictionary, and use the calibre function from above. That takes care of most terms. There's still word breaks between paragraphs e.g. "some-" next paragraph "day". I haven't been able to figure out a regex for that.
Possibly I'll do more PDF conversions and there are professional apps that publishers use to import to a desktop publishing app, e.g. perhaps if they are left with only some print, old proof, and need to reprint to revise. I haven't tried such yet I imagine that they'd significantly reduce effort and perhaps if publishers rely on them, they might be not bad.
|