View Single Post
Old 10-24-2022, 12:03 AM   #4
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,489
Karma: 28005164
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
If you want to extract text from the pdf calibre includes the pdftohtml and pdfttotext binaries form the poppler project, use those. See pdftohtml.py in the calibre source for examples of how to use.
kovidgoyal is offline   Reply With Quote