Actually, regarding named entities, I think I am going to have the checker simply ignore them when checking for well-formed XML. Any named entities will be reported separately as an error (once per HTML file, with an option to auto-replace them with unicode characters with a single click).
|