Quote:
Originally Posted by richasta
|
Plucker deals with clean, proper HTML, not the horribly-broken and invalid constructs that are represented on that site.
This is especially important when dealing with XML, because the spec itself says that ANY error in XML should immediately throw a fatal error in the parser... as it does with Plucker.
The result is that you'll either have to tell them to clean up their HTML, or clean it up yourself in an inline filter or parse the pages locally with something like tidy or similar tools.