Quote:
Originally Posted by DNSB
The problem is that the same issues with lxml as mentioned in Kovid's bug report are still present in your modification that simply adds the direct RECOVER_PARSER definition (and adds a second instance of the "from lxml import etree" line. I looked at the parse_html documentation and examples and it's a lot more work than I am able or willing to do. Not to mention the need to test those changes.
|
parse_html is fine for html, RECOVER_PARSER is for xml. Kovid has included now in his message which function you can use for xml.