Quote:
Originally Posted by Starson17
Yes.
Do something like:
Code:
INDEX2 = 'http://sportsillustrated.cnn.com/vault/cover/home/index.htm'
followed by changing
Code:
soup = self.index_to_soup(self.INDEX)
to
Code:
soup = self.index_to_soup(self.INDEX2)
in parse_index
Than change
Code:
cover = soup.find('div', attrs = {'alt' : 'Read All Articles', 'style' : 'vertical-align:bottom;'})
if cover:
currentIssue = cover.parent['href']
to whatever is needed to produce the currentIssue.
|
I think the old problem may have cropped up again? Or a new one?
I use the recipe created by kwetal. It stopped working a month or more ago- it downloads successfully except that it is stuck on the September 9, 2013 issue.