Quote:
Originally Posted by oneillpt
I'll post a new version if I can make these feeds work with the same recipe which now works for the main news feed.
|
Change the keep_only_tags to:
Code:
keep_only_tags = [dict(name='div', attrs={'id':'main-content'}),
dict(name='div', attrs={'class':'contentNewsArticle'})]
and remove the commenting from the remaining feeds.
All sections except politics (Politiikka) extract. As there is no content at present in the Politiikka feed, I hope it too will extract when there is content.