![]() |
#1 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6
Karma: 12946
Join Date: Nov 2010
Device: Kindle 3
|
Only fetches 10 posts from blogs
I've got a new Kindle and am hoping to use Calibre to fetch blogs. I've tried creating custom news sources to go back 365 days and grab up to 100 posts. In all 3 that I've tried it only goes back to the last 10 posts. Essentially all 3 recipes look like this:
class AdvancedUserRecipe1290235764(BasicNewsRecipe): title = u'Punk Rock OR' oldest_article = 365 max_articles_per_feed = 100 feeds = [(u'Punk Rock OR', u'http://punkrockor.wordpress.com/feed/')] The other two site had these URLs: http://mat.tepper.cmu.edu/blog/?feed=rss2 http://greenor.wordpress.com/feed/ I'm guessing that I'm missing something obvious, but I don't see it from the other recipes or the part of the manual that I can understand. Any pointers (or solutions) will be appreciated. Also, after the initial use, will Calibre grab only new posts, or will it grab the most recent X posts even if some were grabbed earlier. This has me a bit confused, too. Thanks in advance-- |
![]() |
![]() |
![]() |
#2 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,351
Karma: 27182818
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
The RSS feed you are using only has 10 items in it.
|
![]() |
![]() |
![]() |
#3 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6
Karma: 12946
Join Date: Nov 2010
Device: Kindle 3
|
![]() Finally, I don't know whether Calibre keeps a record of the most recent items it's grabbed. if the RSS feed is limited to the 10 most recent posts will I always get the entire feed, or just the ones since Calibre last looked? Thanks for the help. |
![]() |
![]() |
![]() |
#4 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,351
Karma: 27182818
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
No it doesn't it will refetch any articles newer then the setting for oldest_article.
You can get older blog posts by directly parsing the html usng the parse_index function, see the user manual for how to do that. |
![]() |
![]() |
![]() |
#5 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6
Karma: 12946
Join Date: Nov 2010
Device: Kindle 3
|
Thanks, Kovid. Looks like I have a little project for the upcoming long weekend!
|
![]() |
![]() |
![]() |
#6 | |
Junior Member
![]() Posts: 1
Karma: 10
Join Date: Dec 2010
Device: Kindle 3
|
Quote:
I've got the same problem, since i would like to download an entire blog instead of just the latest items using the feed; i'm quite good in python so i guess i should be able to figure out by myself how to write the code, but i was wondering if someone has a generic recipe for blogs hosted by google (blogspot) and/or wordpress so i can save some time ![]() Thanks in advance to anyone willing to help... |
|
![]() |
![]() |
![]() |
#7 |
Junior Member
![]() Posts: 1
Karma: 10
Join Date: Jan 2012
Device: kindle touch
|
Found code for downloading blogspot archives
There's a blogspot blogger that I like to read. It goes back for 5 years, and so I wanted a way to download past years.
At first I thought that Calibre had a limit on the number of articles it could download from a blog, because it wasn't getting all of the articles available. But obviously it was nowhere near this limit. Luckily someone wrote code to do this download. If you search the Custom Recipes thread for the "Universal Blogspot Downloader" it'll get you here: https://www.mobileread.com/forums/sho...postcount=1785 which gives you recipe code, and some tweaking instructions. You can tweak it to specify the range of years and months that you want. As written, it downloads comments, too. I decided to remove the comments, and so only show articles. My slightly revised version is attached. All I did was add a few new entries in the "remove_tags" line, based on reading the orginal HTML carefully. Big kudos to EnergyLens for the original. Paul BTW if you want to have articles sorted from oldest to newest, add this line to your recipe: reverse_article_order = True Last edited by paulkon; 01-10-2012 at 02:44 PM. Reason: adding detail on sort order |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
What do blogs look like on the Kindle? | Nate the great | Amazon Kindle | 12 | 07-28-2010 12:11 AM |
Classic Blogs for the Nook | tyncam | Barnes & Noble NOOK | 4 | 02-12-2010 07:25 AM |
Classic Blogs on nook? | geneaber | Barnes & Noble NOOK | 2 | 11-02-2009 04:48 PM |
blogs? | fishcube | Amazon Kindle | 0 | 09-08-2009 07:50 PM |
best way to save blogs ? | bugsbunny14 | Sony Reader | 0 | 10-05-2006 07:02 AM |