If you want to check the HTML the script sees, simplpy save it inside preprocess_raw_html something like
Code:
def reprocess_raw_html(self, html, *a):
open('/path/to/somewhere/on/yuour/computer/file.html', 'wb').write(html.encode('utf-8'))
return html