View Single Post
Old 12-16-2010, 05:27 PM   #6
Onecanuck
Member
Onecanuck began at the beginning.
 
Onecanuck's Avatar
 
Posts: 12
Karma: 10
Join Date: Jan 2010
Device: Sony PRS600S
Still no Joy...

Quote:
Originally Posted by Starson17 View Post
That's not a feed. That page has multiple links to multiple feeds.


Your problem is that your actual feeds have both the full content and images. That's unusual. 95% or more of feeds have just brief summaries with links to pages (the articles) that have the full content. Calibre tries to detect if the feed has full content or links to content. It does that by analyzing the length of the text. When the text is long, (500 plus words IIRC) it will assume all the content is on the feed, and it won't go to the article pages with your images.

You can force Calibre to follow to the articles with:

Code:
use_embedded_content = False
Try that and it should go to your article pages and find your images.
Hi and thank you for the quick reply,

So this is what I am up to. I downloaded Python and did as you said. I put the changes the embedded condition to FALSE. Now when I run the script, the recipe downloads all images from the website. So in fact it is not really just taking the feed from the RSS files. I have never used Python and the so I am learning as I go.

The following is how my script looks:

class AdvancedUserRecipe1292537099(BasicNewsRecipe):

title = 'Leduc - Wetaskiwin Pipestone Flyer'
__author__ = 'Brian Hahn'
description = 'News from Alberta, Canada'


oldest_article = 14
max_articles_per_feed = 100
no_stylesheets = False
#delay = 1
use_embedded_content = False
encoding = 'utf8'
publisher = 'Pipestone Publishing'
category = 'News, Alberta, Canada'
language = 'en_CA'

feeds = [
('Features Stories', 'http://pipestone.inmca.com/Feature.rss'),
('Editorial Comment', 'http://pipestone.inmca.com/Comments.rss'),
('Letters to the Editor', 'http://pipestone.inmca.com/Opinion.rss'),
('Opinion', 'http://pipestone.inmca.com/Columnists.rss'),
('Sports', 'http://pipestone.inmca.com/Sports.rss'),
('On the Beat', 'http://pipestone.inmca.com/Police%20Blotter.rss'),
('Business News', 'http://pipestone.inmca.com/Chambers.rss'),
('Local Government', 'http://pipestone.inmca.com/Government.rss'),
('Faith', 'http://pipestone.inmca.com/Faith.rss'),

]

I appreciate all your feedback and patience.

Thanks again.

Brian Hahn
Onecanuck is offline   Reply With Quote