View Single Post
Old 07-26-2013, 04:11 AM   #3
Divingduck
Wizard
Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.
 
Posts: 1,166
Karma: 1410083
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
I made an update for this recipe. Old cover source wasn't available since long time.

Spoiler:
Code:
##
## Written:      October 2012 (new coding)
## Version:      6.0
## Last update:  2013-07-25
##

__license__   = 'GPL v3'
__copyright__ = ''
'''
Fetch RSS-Feeds spektrum.de
'''
from calibre.web.feeds.recipes import BasicNewsRecipe
class AdvancedUserRecipe1303841067(BasicNewsRecipe):
    title                 = u'Spektrum der Wissenschaft'
    __author__            = 'Armin Geller, Bratzzo, Rainer Zenz' # Update AGE 2013-07-26
    description           = u'German  online portal of Spektrum der Wissenschaft'
    publisher             = 'Spektrum der Wissenschaft Verlagsgesellschaft mbH'
    category              = 'science news, Germany'

    oldest_article        = 7
    max_articles_per_feed = 100
    no_stylesheets        = True
    remove_javascript     = True
    remove_empty_feeds    = True
    language              = 'de'

    #conversion_options    = {'base_font_size': 20}


    #cover_url             = 'http://upload.wikimedia.org/wikipedia/de/5/59/Spektrum-cover.jpg' # from Rainer Zenz
    cover_url             = 'http://www16.zippyshare.com/scaled/52219516/file.html' # AGE 2013-07-26 new cover location

    masthead_url          = 'http://www.spektrum.de/fm/861/spektrum.de.png'

    extra_css      =  '''
                      h1 {font-size: 1.6em; text-align: left}
                      h2 {font-size: 1em; font-style: italic; font-weight: normal}
                      h3 {font-size: 1.3em;text-align: left}
                      h4, h5, h6, .heading, .hgroup {font-size: 1em;text-align: left}
                      '''

    feeds = [
              (u'Spektrum.de', u'http://www.spektrum.de/alias/rss/spektrum-de-rss-feed/996406'),
              (u'Spektrum der Wissenschaft', u'http://www.spektrum.de/alias/rss/spektrum-der-wissenschaft-rss-feed/982623'),
              (u'Gehirn & Geist', u'http://www.spektrum.de/alias/rss/gehirn-geist-rss-feed/982626'),
              (u'Epoc', u'http://www.spektrum.de/alias/rss/epoc-rss-feed/982625'),
              (u'Sterne und Weltraum', u'http://www.spektrum.de/alias/rss/sterne-und-weltraum-rss-feed/865248'),
              (u'Editional', u'http://www.spektrum.de/alias/rss/spektrum-de-editorial/996044'),
              (u'Pressemitteilungen', u'http://www.spektrum.de/alias/rss/pressemitteilungen/995265'),
             ]

    keep_only_tags = [
                        dict(name='div', attrs={'class':'border-h clearfix article-top'}),
                        dict(name='div', attrs={'class':'clearfix'}),
                        dict(name='div', attrs={'class':'bilderrahmenlinks'}),
                        dict(name='div', attrs={'class':'relcontainer'}),
                      ]

    remove_tags_after=dict(name='div', attrs={'class':['sidebar-box-head']})

    remove_tags = [
                    dict(attrs={'id':['recommend-article', 'dossierbox', 'cover', 'toc']}),
                    dict(attrs={'class':['sidebar-box-full clearfix', 'linktotop' ]}),
                  ]


Let me know, if something isn't working.
Attached Files
File Type: zip Spektrum_AGe_V6.zip (1.2 KB, 263 views)
Divingduck is offline   Reply With Quote