Quote:
Originally Posted by evanmaastrigt
You can not expect the recipe for The Washington Post to work on this website (or any other website for that matter). You have to examine the HTML and adjust accordingly.
For a start: remove the extra_css and remove_tags properties. Then remove the get_article_url(), print_version() and postprocess_html() methods. Then add the following line
Code:
keep_only_tags = [dict(name = 'div', attrs = {'id': 'storycontentleft'})]
Refine by removing unwanted tags and adding css to your liking.
|
Thanks for the quick advice! I will try that and see what I get.