Kieran, if you add a custom news source, customise built in recipe for The Guardian and Observer, Kovid has done most of the work for you (see message #12 in this thread). Scan down to the bottom of the recipe and you will see he has added the Sports section, it looks like this:
def parse_index(self):
feeds = self.parse_section(self.base_url)
feeds += self.parse_section('http://www.theguardian.com/uk/sport', 'Sport - ')
return feeds
I don't want the Sports section, so I took that out and replaced it with the sections I do want, eg Travel. But I had to add the dates, or the file is enormous (it swells from 7Mb to 72Mb!) because I assume it scrapes everything it finds. The date relates to a specific issue, eg 2015/dec/12 for last Saturday.
Does that help, or make it worse?
Paddy
|