My local paper is published weekly and the RSS is updated once a week. On the feed page the paper includes articles from the past two issues. No matter what I set oldest_article to calibre will always download all the articles. I looked at the source for the feed page (
http://www.mahopacnews.com/rssheadlines.xml) and the only indication of the article date is in the article URL. Examples:
Code:
'http://www.mahopacnews.com/Articles-c-2011-03-01-207511.112113-Residents-question-timing-of-agencys-hearing.html'
'http://www.mahopacnews.com/Articles-c-2011-02-22-207434.112113-You-can-choose-to-be-happy.html'
How does calibre determine the article date when downloading articles?
I can extract the date using url.split. Is there a way to tell calibre to use this date to determine article age?
Must the date be in a particular format?
Thanks