Quote:
Originally Posted by Dereks
somehow remove_tags_before doesn't do it's job: feed remains polluted with the stuff.
|
Usually that just means what you think is first/last isn't where you think it is. Print out the soup and look at it with
Code:
def preprocess_html(self, soup):
print 'The soup is: ',soup
return soup