View Single Post
Old 03-21-2015, 05:28 PM   #5
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 8,859
Karma: 6120478
Join Date: Nov 2009
Device: many
Hi
I think I could throw together a python plugin that would walk the complete set of xhtml files and build up a report of any not well-formed files with a description of at least the first error in the file if one exists. Would this do the trick?

BTW: We have already removed Tidy and will use google's gumbo-parser to auto clean up any not properly formed files in the future. Gumbo implements the true html5 parsing spec and will handle the html exactly like browsers will. Gumbo is basically like Beautiful Soup but written in C and really fast.

Last edited by KevinH; 03-21-2015 at 05:34 PM.
KevinH is offline   Reply With Quote