View Single Post
Old 12-14-2015, 02:39 PM   #20
paddyrm
Connoisseur
paddyrm began at the beginning.
 
Posts: 69
Karma: 10
Join Date: Oct 2012
Device: Kindle 3
Red face

Kieran, if you add a custom news source, customise built in recipe for The Guardian and Observer, Kovid has done most of the work for you (see message #12 in this thread). Scan down to the bottom of the recipe and you will see he has added the Sports section, it looks like this:

def parse_index(self):
feeds = self.parse_section(self.base_url)
feeds += self.parse_section('http://www.theguardian.com/uk/sport', 'Sport - ')
return feeds

I don't want the Sports section, so I took that out and replaced it with the sections I do want, eg Travel. But I had to add the dates, or the file is enormous (it swells from 7Mb to 72Mb!) because I assume it scrapes everything it finds. The date relates to a specific issue, eg 2015/dec/12 for last Saturday.

Does that help, or make it worse?

Paddy
paddyrm is offline   Reply With Quote