View Single Post
Old 01-28-2017, 09:15 AM   #1
Serpico
Enthusiast
Serpico began at the beginning.
 
Posts: 27
Karma: 10
Join Date: Nov 2015
Device: PocketBook Touch Lux 3
Fetching images from AFP (France24.com)

Hi,

does anybody has an idea how to fetch images from http://www.france24.com/en/wires/rss into an ebook through a custom recipe? My recipe fetches text only:

Code:
import re, string, time
from calibre import entity_to_unicode, strftime
from datetime import timedelta, date
from datetime import timedelta, datetime, date
class BasicUserRecipe1483714006(AutomaticNewsRecipe):
    title          = u'France24'
    oldest_article = 14
    max_articles_per_feed = 100
    compress_news_images = True
    compress_news_images_max_size =30 
    auto_cleanup = True
    timefmt = ' [%Y-%m-%d@%H:%M]' 
    conversion_options = {'title' : title + ' - ' + strftime('%Y-%m-%d@%H:%M')}
    def preprocess_html(self, soup):
        for alink in soup.findAll('a'):
            if alink.string is not None:
               tstr = alink.string
               alink.replaceWith(tstr)
        return soup
    feeds          = [(u'France24', u'http://www.france24.com/en/wires/rss')]
Serpico is offline   Reply With Quote