View Single Post
Old 10-14-2011, 11:16 AM   #4
oneillpt
Connoisseur
oneillpt began at the beginning.
 
Posts: 63
Karma: 46
Join Date: Feb 2011
Device: Kindle 3 (cracked screen!); PW1; Oasis
Quote:
Originally Posted by oneillpt View Post
I'll post a new version if I can make these feeds work with the same recipe which now works for the main news feed.
Change the keep_only_tags to:

Code:
keep_only_tags = [dict(name='div', attrs={'id':'main-content'}),
    dict(name='div', attrs={'class':'contentNewsArticle'})]
and remove the commenting from the remaining feeds.

All sections except politics (Politiikka) extract. As there is no content at present in the Politiikka feed, I hope it too will extract when there is content.
oneillpt is offline   Reply With Quote