You know, at some point, it might be worth writing an EPUB checker script that scans EPUB books for any content that is likely to tickle any of the known reader bugs, and provides helpful tips for working around those bugs.
The only real question is whether to write it in C using libxml2 or in something more obscene and unmaintainable, like node.js with jsdom, or Perl with a handwritten XML parser.