Quote:
Originally Posted by llasram
- There should be no extraneous spaces. ConvertLIT attempts to pretty-print HTML as it extracts it, but frequently inserts whitespace where it doesn't belong.
|
Nice! I had a LIT file where ConvertLIT had this problem.
However, the downside of your change is that the resulting HTML file often has very long lines and is hard to read. Two suggestions:
1. Automatically replace "> <" with ">\n<". Notice the space between > and <. (\n = line break)
I suggested this for mobi2oeb too and it has been accepted.
2. Make line breaks where it's safe to do them, e.g. after "</p>" and "</h1>" ...
This is true for the resulting OPF as well, by the way.
Nice work so far, I'll use your script to hunt down bugs.