Now that I found the
problem, it would be nice to find a
solution also, right? ;-)
The one I'm using (downloading the index.file and manually removing the offending DOCTYPE declaration), is a bit.... boring.
So I've tried this...
Quote:
preprocess_regexps = [
(re.compile(r'<!DOCTYPE html .*strict.dtd">', re.DOTALL|re.IGNORECASE),
lambda match: '<!DOCTYPE html>'),
]
|
...but nothing happens.