Quote:
Originally Posted by kovidgoyal
Code:
keep_only_tags = dict(attrs={'class':'asset story clearfix'})
def preprocess_raw_html(self, html, url):
if '<article class="asset clearfix">' in html:
self.abort_article()
return html
|
Thank you I will try this. If I also add the remove_tag with additional attributes does it matter where it is placed in the recipe? Same with auto_cleanup?