View Single Post
Old 02-25-2014, 02:54 AM   #4
Divingduck
Wizard
Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.
 
Posts: 1,161
Karma: 1404241
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
A new update due to some minor changes in article design.

Spoiler:
Code:
##
## Written:      October 2012 (new coding)
## Version:      7.0
## Last update:  2014-02-25
##

__license__   = 'GPL v3'
__copyright__ = ''
'''
Fetch RSS-Feeds spektrum.de
'''
from calibre.web.feeds.recipes import BasicNewsRecipe
class AdvancedUserRecipe1303841067(BasicNewsRecipe):
    title                 = u'Spektrum der Wissenschaft'
    __author__            = 'Armin Geller, Bratzzo, Rainer Zenz' # Update AGE 2014-02-25
    description           = u'German  online portal of Spektrum der Wissenschaft'
    publisher             = 'Spektrum der Wissenschaft Verlagsgesellschaft mbH'
    category              = 'science news, Germany'
    oldest_article        = 7
    max_articles_per_feed = 100
    no_stylesheets        = True
    remove_javascript     = True
    remove_empty_feeds    = True
    language              = 'de'

    #conversion_options    = {'base_font_size': 20}


    # Cover from Rainer Zenz
    cover_url             = 'http://www16.zippyshare.com/scaled/52219516/file.html' # AGE 2013-07-26 new cover location

    masthead_url          = 'http://www.spektrum.de/fm/861/spektrum.de.png'
    
    # AGE 2014-02-25 add .textbox-wide
    extra_css      =  '''
                      h1 {font-size: 1.6em; text-align: left}
                      h2 {font-size: 1em; font-style: italic; font-weight: normal}
                      h3 {font-size: 1.3em;text-align: left}
                      h4, h5, h6, .heading, .hgroup {font-size: 1em;text-align: left}
                      .textbox-wide {font-size: 1.3em; font-style: italic}
                      '''

    feeds = [
              (u'Spektrum.de', u'http://www.spektrum.de/alias/rss/spektrum-de-rss-feed/996406'),
              (u'Spektrum der Wissenschaft', u'http://www.spektrum.de/alias/rss/spektrum-der-wissenschaft-rss-feed/982623'),
              (u'Gehirn & Geist', u'http://www.spektrum.de/alias/rss/gehirn-geist-rss-feed/982626'),
              (u'Epoc', u'http://www.spektrum.de/alias/rss/epoc-rss-feed/982625'),
              (u'Sterne und Weltraum', u'http://www.spektrum.de/alias/rss/sterne-und-weltraum-rss-feed/865248'),
              (u'Editional', u'http://www.spektrum.de/alias/rss/spektrum-de-editorial/996044'),
              (u'Pressemitteilungen', u'http://www.spektrum.de/alias/rss/pressemitteilungen/995265'),
             ]

    keep_only_tags = [
                        dict(name='div', attrs={'class':'border-h clearfix article-top'}),
                        dict(name='div', attrs={'class':'clearfix'}),
                        dict(name='div', attrs={'class':'bilderrahmenlinks'}),
                        dict(name='div', attrs={'class':'relcontainer'}),
                      ]

    remove_tags_after=dict(name='div', attrs={'class':['sidebar-box-head']})

    remove_tags = [
                    dict(attrs={'id':['recommend-article', 'dossierbox', 'cover', 'toc']}),
                    dict(attrs={'class':['sidebar-box-full clearfix', 'linktotop' ]}),
                    dict(name='a', attrs={'class':'button-petrol'}), # AGE 2014-02-25
                    dict(name='hr', attrs={'class':['box-border left','box-border right']}), # AGE 2014-02-25
                  ]
Attached Files
File Type: rar Spektrum_AGe_V7.rar (1.2 KB, 157 views)
Divingduck is offline   Reply With Quote