MobileRead Forums - View Single Post - Discover Magazine recipe no longer working

truth1ness · 04-17-2015, 09:56 PM

I'm working on this script and making another variation for the monthly magazine version but I think the login portion of the script is not working and I'm not sure how to proceed because it seems to have the correct field names.

I put in a working feed and stripped down the default DiscoverMagazine recipe so it would just [try to] log in and do no filtering. The raw output of the ebook it produces shows Sign In/Register so I know the log in portion is not working (there is probably an easier way to check this but I'm not sure how). Below is the stripped down version I used.

Registration on the site is free so you can make one if you don't already have a login.

Spoiler:

04-17-2015, 09:56 PM	#6
truth1ness Zealot Posts: 126 Karma: 50000 Join Date: Mar 2015 Device: none	I'm working on this script and making another variation for the monthly magazine version but I think the login portion of the script is not working and I'm not sure how to proceed because it seems to have the correct field names. I put in a working feed and stripped down the default DiscoverMagazine recipe so it would just [try to] log in and do no filtering. The raw output of the ebook it produces shows Sign In/Register so I know the log in portion is not working (there is probably an easier way to check this but I'm not sure how). Below is the stripped down version I used. Registration on the site is free so you can make one if you don't already have a login. Spoiler: #!/usr/bin/env python2 __license__ = 'GPL v3' __copyright__ = '2008, Kovid Goyal kovid@kovidgoyal.net' __docformat__ = 'restructuredtext en' ''' discovermagazine.com ''' import re from calibre.web.feeds.news import BasicNewsRecipe class DiscoverMagazine(BasicNewsRecipe): title = u'Discover Magazine' description = u'Science, Technology and the Future' __author__ = 'Starson17' language = 'en' oldest_article = 33 max_articles_per_feed = 20 no_stylesheets = True remove_javascript = True use_embedded_content = False encoding = 'utf-8' extra_css = '.headline {font-size: x-large;} \n .fact {padding-top: 10pt}' # Login stuff needs_subscription = True use_javascript_to_login = True requires_version = (0, 9, 20) def javascript_login(self, br, username, password): br.visit('http://discovermagazine.com', timeout=120) f = br.select_form('div.login.section div.form') f['username'] = username f['password'] = password br.submit('input[id="signInButton"]', timeout=120) br.run_for_a_time(20) # End login stuff def append_page(self, soup, appendtag, position): pager = soup.find('span',attrs={'class':'next'}) if pager: nexturl = pager.a['href'] soup2 = self.index_to_soup(nexturl) texttag = soup2.find('div', attrs={'class':'articlebody'}) newpos = len(texttag.contents) self.append_page(soup2,texttag,newpos) texttag.extract() appendtag.insert(position,texttag) feeds = [ (u'Top Stories', u'http://feeds.feedburner.com/DiscoverTopStories'), ]