Quote:
Originally Posted by kovidgoyal
I just meant to give you an idea how to do it. Basically pdftohtml preserves line breaks using <br> elements. These need to be removed intelligently (based on line length) and two consecutive <br> elements become a new paragraph.
|
Actually the double br should be replaced with \p p and the rest removed for the ebookwise. then maybe clean up the first and last paragraph manually.
Dale