View Single Post
Old 11-30-2008, 03:54 AM   #9
AprilHare
Wizard
AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.
 
AprilHare's Avatar
 
Posts: 2,981
Karma: 11862367
Join Date: Apr 2008
Device: Sony Reader PRS-T2
Quote:
Originally Posted by kovidgoyal View Post
It isn't the size then. try running htmltidy on the html file and then converting it
Tried running it through HTML Tidy, got the following:
Quote:
line 1 column 1 - Warning: missing <!DOCTYPE> declaration
line 976 column 1 - Warning: missing <li>
line 1087 column 1 - Warning: missing </ul> before </li>
line 4625 column 1 - Warning: discarding unexpected </li>
line 4866 column 1 - Warning: inserting implicit <p>
line 5286 column 1 - Error: <l> is not recognized!
line 5286 column 1 - Warning: discarding unexpected <l>
line 5286 column 4 - Warning: missing <li>
line 5287 column 1 - Warning: discarding unexpected </l>
line 6259 column 1 - Error: <h> is not recognized!
line 6259 column 1 - Warning: discarding unexpected <h>
line 6260 column 1 - Warning: discarding unexpected </h>
line 6295 column 1 - Error: <l> is not recognized!
line 6295 column 1 - Warning: discarding unexpected <l>
line 6295 column 4 - Warning: missing <li>
line 6296 column 1 - Warning: discarding unexpected </l>
line 6909 column 1 - Error: <l> is not recognized!
line 6909 column 1 - Warning: discarding unexpected <l>
line 6909 column 4 - Warning: missing <li>
line 6910 column 1 - Warning: discarding unexpected </l>
line 11400 column 1 - Error: <l> is not recognized!
line 11400 column 1 - Warning: discarding unexpected <l>
line 11400 column 4 - Warning: missing <li>
line 11401 column 1 - Warning: discarding unexpected </l>
line 11730 column 1 - Warning: discarding unexpected </li>
line 14141 column 45 - Warning: discarding unexpected </ul>
line 2 column 1 - Warning: inserting missing 'title' element
line 29 column 1 - Warning: trimming empty <p>
line 36 column 1 - Warning: trimming empty <p>
line 2552 column 1 - Warning: trimming empty <p>
line 2631 column 1 - Warning: trimming empty <p>
line 3449 column 1 - Warning: trimming empty <p>
line 3973 column 1 - Warning: trimming empty <p>
line 4866 column 1 - Warning: trimming empty <p>
line 5049 column 1 - Warning: trimming empty <p>
line 6186 column 1 - Warning: trimming empty <p>
line 6190 column 1 - Warning: trimming empty <p>
line 6234 column 1 - Warning: trimming empty <p>
line 7177 column 1 - Warning: trimming empty <p>
line 7989 column 1 - Warning: trimming empty <p>
line 7995 column 1 - Warning: trimming empty <p>
line 9335 column 1 - Warning: trimming empty <p>
line 9413 column 1 - Warning: trimming empty <p>
line 9612 column 1 - Warning: trimming empty <p>
line 12646 column 1 - Warning: trimming empty <p>
line 12650 column 1 - Warning: trimming empty <p>
line 12655 column 1 - Warning: trimming empty <p>
line 13582 column 1 - Warning: trimming empty <p>
line 13940 column 1 - Warning: trimming empty <p>
Info: Document content looks like HTML 4.01 Transitional
Info: No system identifier in emitted doctype
44 warnings, 5 errors were found!

This document has errors that must be fixed before
using HTML Tidy to generate a tidied up version.

To learn more about HTML Tidy see http://tidy.sourceforge.net
Please fill bug reports and queries using the "tracker" on the Tidy web site.
Additionally, questions can be sent to html-tidy@w3.org
HTML and CSS specifications are available from http://www.w3.org/
Lobby your company to join W3C, see http://www.w3.org/Consortium
Took the HTML file, resaved it with OpenOffice.org 3, ran the resultant compliant HTML document through HTML Tidy, then converted. I got a significantly larger LRF file and there was no obvious improvement in terms of viewability.
AprilHare is offline   Reply With Quote