View Single Post
Old 02-16-2023, 09:00 AM   #10
sgmoore
Zealot
sgmoore ought to be getting tired of karma fortunes by now.sgmoore ought to be getting tired of karma fortunes by now.sgmoore ought to be getting tired of karma fortunes by now.sgmoore ought to be getting tired of karma fortunes by now.sgmoore ought to be getting tired of karma fortunes by now.sgmoore ought to be getting tired of karma fortunes by now.sgmoore ought to be getting tired of karma fortunes by now.sgmoore ought to be getting tired of karma fortunes by now.sgmoore ought to be getting tired of karma fortunes by now.sgmoore ought to be getting tired of karma fortunes by now.sgmoore ought to be getting tired of karma fortunes by now.
 
Posts: 138
Karma: 642206
Join Date: Mar 2021
Device: Kindle Voyage
Quote:
Originally Posted by kovidgoyal View Post
If you care about speed use the extract_text() function from calibre.db.fts.text
Thank you. That looks to do what I want. It does not produce the exact same results as ebook-convert but ignoring formatting and white-space issues its is extremely close. I tried it on about 1000 books and the worse case was still 99% similar and the vast majority of them were over 99.9% similar.

Also only takes about 1/20 of the time to call ebook-convert.

Thanks again.
sgmoore is offline   Reply With Quote