View Single Post
Old 06-14-2011, 09:55 PM   #1
Bortolotto
Member
Bortolotto began at the beginning.
 
Bortolotto's Avatar
 
Posts: 15
Karma: 14
Join Date: Jun 2011
Location: Brazil
Device: Kindle
Fetching taking too much time

Hi buddies!

I've made a recipe that is taking too much time to process (about 30 min.)
May you take a look at it?

Below is the recipe and attached the execution log.

Code:
class PortalR7(BasicNewsRecipe):
    title                  = 'Noticias R7'
    __author__             = 'Diniz Bortolotto'
    description            = 'Noticias Portal R7'
    oldest_article         = 2
    max_articles_per_feed  = 20
    encoding               = 'utf8'
    publisher              = 'Rede Record'
    category               = 'news, Brazil'
    language               = 'pt_BR'
    publication_type       = 'newsportal'
    feeds                  = [
                              (u'Brasil', u'http://www.r7.com/data/rss/brasil.xml'), 
                              (u'Economia', u'http://www.r7.com/data/rss/economia.xml'), 
                              (u'Internacional', u'http://www.r7.com/data/rss/internacional.xml'), 
                              (u'Tecnologia e Ci\xeancia', u'http://www.r7.com/data/rss/tecnologiaCiencia.xml')
                             ]

    reverse_article_order  = True
    remove_tags            = [
                              dict(name='ul', attrs={'class':'controles'}),
                              dict(name='div', attrs={'class':'materia_banner'}),
                              dict(name='ul', attrs={'class':'relacionados'})
                             ]
    keep_only_tags         = [
                              dict(name='div', attrs={'class':'materia'})
                             ]
Attached Files
File Type: txt calibre_fetch.txt (495.1 KB, 207 views)
Bortolotto is offline   Reply With Quote