03-28-2013, 07:41 AM | #1 |
Connoisseur
Posts: 50
Karma: 10
Join Date: Apr 2005
Device: Nokia 5320
|
Público.pt
Hi,
Público.pt recibe does not work. I only get the titles. Can someone take a look? Thanks in advance. José Pinto |
03-28-2013, 11:06 AM | #2 | |
Connoisseur
Posts: 62
Karma: 46
Join Date: Feb 2011
Device: Kindle 3 (cracked screen!); PW1; Oasis
|
Quote:
Code:
keep_only_tags = [dict(attrs={'class':['hentry article single']})] remove_tags = [dict(attrs={'class':['entry-options entry-options-above group','entry-options entry-options-below group', 'module tag-list']})] Code:
keep_only_tags = [dict(attrs={'class':['entry-header single-header','entry-body']})] |
|
Advert | |
|
03-28-2013, 12:32 PM | #3 | |
Connoisseur
Posts: 50
Karma: 10
Join Date: Apr 2005
Device: Nokia 5320
|
Quote:
Thanks, Text is extracted now, but sections "Desporto", "Sociedade", "Ciências" and "Ecosfera" are not downloaded. I don´t know if the feeds are the same of not, so I will search for the relevant feeds. José Pinto |
|
03-28-2013, 06:23 PM | #4 | |||
Connoisseur
Posts: 62
Karma: 46
Join Date: Feb 2011
Device: Kindle 3 (cracked screen!); PW1; Oasis
|
Quote:
Quote:
Quote:
|
|||
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Publico, 20minutos,eljueves recipes | nadid | Recipes | 3 | 08-21-2011 12:00 PM |