|
|
#1 |
|
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 74
Karma: 19640
Join Date: Apr 2012
Device: Kindle Touch
|
Help with Wordpress feed (El Mundo Today)
http://www.elmundotoday.com/feed/ I've tweaked with several recipes for Wordpress found in this very forum to no avail, the index is always empty (no articles) although manually downloading the feed shows articles there. The feed is compressed with gzip, but I guess this should not be a problem for Calibre? Below is my last attempt: Code:
lass AdvancedUserRecipe1335711936(BasicNewsRecipe):
title = u'El Mundo Today'
description = 'La actualidad del mañana'
cover_url = 'http://www.elmundotoday.com/wp-content/themes/EarthlyTouch/images/logo.png'
oldest_article = 365
max_articles_per_feed = 100
auto_cleanup = False
no_stylesheets = True
language = 'es_ES'
use_embedded_content = True
feeds = [(u'El Mundo Today', u'http://www.elmundotoday.com/feed/')]
|
|
|
|
|
|
#2 |
|
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 74
Karma: 19640
Join Date: Apr 2012
Device: Kindle Touch
|
I should have tried this before posting:
Uncompressed the RSS file, copied it to the data directory of a very simple web server that runs in my computer, then pointed the feed in the recipe to localhost. Articles now show up. So it seems gzip compression was indeed the problem. |
|
|
|
|
Enthusiast
|
|
|
|
#3 |
|
Creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 22,481
Karma: 2944574
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
If you want calibre to handle gzip transparently, use
Code:
def get_broser(self):
br = BasicNewsRecipe.get_browser(self)
br.set_handle_gzip(True)
return br
__________________
Get calibre Notice to all: I can not provide assistance with DRM removal, for legal reasons, so please do not contact me about it. |
|
|
|
|
|
#4 |
|
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 74
Karma: 19640
Join Date: Apr 2012
Device: Kindle Touch
|
Thanks Kovid, that did it. Below is a working version of the recipe in case someone else is interested in the site:
Code:
from calibre.web.feeds.news import BasicNewsRecipe
class ElMundoTodayRecipe(BasicNewsRecipe):
title = 'El Mundo Today'
description = u'La actualidad del mañana'
category = 'Noticias, humor'
cover_url = 'http://www.elmundotoday.com/wp-content/themes/EarthlyTouch/images/logo.png'
oldest_article = 30
max_articles_per_feed = 30
auto_cleanup = True
no_stylesheets = True
language = 'es_ES'
use_embedded_content = True
feeds = [('El Mundo Today', 'http://www.elmundotoday.com/feed/')]
def get_broser(self):
br = BasicNewsRecipe.get_browser(self)
br.set_handle_gzip(True)
return br
|
|
|
|
|
|
#5 |
|
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 74
Karma: 19640
Join Date: Apr 2012
Device: Kindle Touch
|
Updated version with better page parsing and some CSS for eye candy.
Spoiler:
|
|
|
|
|
|
#6 |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,488
Karma: 1035940
Join Date: Jun 2009
Location: Madrid, Spain
Device: Pink Sony Prs-350, Kobo Mini, iPhone, iPad
|
Thank you, I'm interested
(and always surprised by their news )
__________________
Ciao Terisa de Morgan
|
|
|
|
|
|
#7 |
|
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 74
Karma: 19640
Join Date: Apr 2012
Device: Kindle Touch
|
Glad to know it's of use to someone else
|
|
|
|
![]() |
| Thread Tools | Search this Thread |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Psychology today news feed failing to download | Shuichiro | Recipes | 1 | 05-14-2011 05:11 AM |
| BBC Mundo (v1.0) - Spanish | tolyluis | Recipes | 0 | 01-29-2011 07:12 PM |
| Hola mundo | johansolo | Introduce Yourself | 6 | 08-22-2009 09:53 PM |
| Wordpress Vs Textpattern | Moejoe | Lounge | 4 | 03-06-2009 11:46 AM |
| iLiad review in El Mundo (Spanish newspaper) | ElaHuguet | iRex | 1 | 08-17-2007 10:15 AM |