View Single Post
Old 04-07-2009, 12:10 PM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,407
Karma: 27757236
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
calibre already runs regexps on the output of pdftohtml to detect line endings, but it's been a while since I optimized them, they're in the file html.py in the calibre source code, so you're welcome to suggest enhancements to them
kovidgoyal is offline   Reply With Quote