If it helps, Sigil's gumbo based repair parser library sigilgumbo has a python plugin interface that could be modified to create a xhtml repair parser for a single file that could be extended to create a command line tool if you have any python experience.
|