View Single Post
Old 08-23-2013, 02:35 AM   #2
Divingduck
Wizard
Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.
 
Posts: 1,166
Karma: 1410083
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
Feel free to use this update

Spoiler:
Code:
## Last Edit:  2013-08-23
## From:  Armin Geller
__license__   = 'GPL v3'
__copyright__ = '2010, NA'
'''
consumerist.com
'''

from calibre.web.feeds.news import BasicNewsRecipe

class Consumerist(BasicNewsRecipe):
    title                 = 'Consumerist'
    __author__            = 'NA'
    description           = "Consumerist, Shoppers Bite Back."
    publisher             = 'consumerist.com'
    category              = 'news, consumer news, consumer rights'
    oldest_article        = 2
    max_articles_per_feed = 100
    no_stylesheets        = True
    encoding              = 'utf-8'
    use_embedded_content  = False
    language              = 'en'
    masthead_url          = 'http://consumermediallc.files.wordpress.com/2013/02/consumerist.png'# AGe 2013-08-23
    
    extra_css             = '''
                              body{font-family: "Lucida Grande",Helvetica,Arial,sans-serif}
                              img{margin-bottom: 1em}
                              h1{font-family :Arial,Helvetica,sans-serif; font-size:x-large}
                              h2{font-family :Arial,Helvetica,sans-serif; font-size:large}
                            '''
    conversion_options = {
                          'comment'   : description,
                          'tags'      : category,
                          'publisher' : publisher,
                          'language'  : language,
                          }

    remove_attributes  = ['width','height']

    keep_only_tags     = dict(name='div', attrs={'class':['hfeed',]}) # AGe 2013-08-23

    remove_tags        = [dict(name='div', attrs={'class':['navigation', # AGe 2013-08-23
                                         'wpcom-related-posts widget widget_related_posts', # AGe 2013-08-23
                                         'sharedaddy sd-like-enabled sd-sharing-enabled',]}), # AGe 2013-08-23
                          dict(name='div', attrs={'id':['comments',]}), # AGe 2013-08-23
                          ]

    feeds = [(u'Articles', u'http://consumerist.com/index.xml')]

    def preprocess_html(self, soup):
        return self.adeify_images(soup)
Attached Files
File Type: zip Consumerist_AGe.zip (971 Bytes, 172 views)
Divingduck is offline   Reply With Quote