View Single Post
Old 03-12-2015, 03:15 PM   #7
Divingduck
Wizard
Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.
 
Posts: 1,166
Karma: 1410083
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
Please find attached a new update for this recipe

Spoiler:
Code:
##
## Written:      October 2012 (new coding)
## Version:      9.0
## Last update:  2015-03-12
##

__license__   = 'GPL v3'
__copyright__ = ''
'''
Fetch RSS-Feeds spektrum.de
'''
from calibre.web.feeds.recipes import BasicNewsRecipe
class AdvancedUserRecipe1303841067(BasicNewsRecipe):
    title                 = u'Spektrum der Wissenschaft'
    __author__            = 'Armin Geller, Bratzzo, Rainer Zenz' # Update AGE 2014-02-25
    description           = u'German  online portal of Spektrum der Wissenschaft'
    publisher             = 'Spektrum der Wissenschaft Verlagsgesellschaft mbH'
    category              = 'science news, Germany'
    oldest_article        = 7
    max_articles_per_feed = 100
    no_stylesheets        = True
    remove_javascript     = True
    remove_empty_feeds    = True
    language              = 'de'
    encoding              = 'utf8' # AGe 2015-03-13 new, old encoding'cp1252'
#    recursions						= 0 # AGe 2014-08-21 new
    #conversion_options    = {'base_font_size': 20}


    # Cover from Rainer Zenz
    cover_url             = 'http://www58.zippyshare.com/scaled/VISHzqJj/file.html' # AGE 2015-03-12 new cover location
    masthead_url          = 'http://www.spektrum.de/fm/861/spektrum.de.png'
    
    # AGE 2014-02-25 add .textbox-wide
    extra_css      =  '''
                      h1 {font-size: 1.6em; text-align: left}
                      h2 {font-size: 1em; font-style: italic; font-weight: normal}
                      h3 {font-size: 1.3em;text-align: left}
                      h4, h5, h6, .heading, .hgroup {font-size: 1em;text-align: left}
                      .textbox-wide {font-size: 1.3em; font-style: italic}
                      '''

    feeds = [
              (u'Spektrum.de', u'http://www.spektrum.de/alias/rss/spektrum-de-rss-feed/996406'),
              (u'Spektrum der Wissenschaft', u'http://www.spektrum.de/alias/rss/spektrum-der-wissenschaft-rss-feed/982623'),
              (u'Gehirn & Geist', u'http://www.spektrum.de/alias/rss/gehirn-geist-rss-feed/982626'),
              (u'Sterne und Weltraum', u'http://www.spektrum.de/alias/rss/sterne-und-weltraum-rss-feed/865248'),
              (u'Pressemitteilungen', u'http://www.spektrum.de/alias/rss/pressemitteilungen/995265'), # AGe 2014-08-21 available but empty
							(u'Meistgelesene Artikel',u'http://www.spektrum.de/alias/rss/spektrum-de-meistgelesene-artikel/1224665'), # AGe 2014-08-21 new

             ]

    keep_only_tags = [
                        dict(name='div', attrs={'class':'border-h clearfix article-top relcontainer'}),
                        dict(name='article', attrs={'class':'clearfix'}),
                      ]

    remove_tags = [
                    dict(name='aside', attrs={'class':['bilderrahmenlinks','bilderrahmenrechts']}), # AGE 2014-08-29
                    dict(name='div', attrs={'class':'textbox-wide clearfix'}), # AGE 2014-08-29
                    dict(name='span', attrs={'class':['no-display','sprite-a sprite-lupe']}), # AGe 2015-03-13
                  ]
Attached Files
File Type: zip Spektrum_AGe_V9.zip (1.3 KB, 191 views)
Divingduck is offline   Reply With Quote