Quote:
Originally Posted by kovidgoyal
If you care about speed use the extract_text() function from calibre.db.fts.text
|
Thank you. That looks to do what I want. It does not produce the exact same results as ebook-convert but ignoring formatting and white-space issues its is extremely close. I tried it on about 1000 books and the worse case was still 99% similar and the vast majority of them were over 99.9% similar.
Also only takes about 1/20 of the time to call ebook-convert.
Thanks again.