View Single Post
Old 12-02-2011, 10:07 PM   #5
ppriede
Junior Member
ppriede began at the beginning.
 
Posts: 6
Karma: 10
Join Date: May 2010
Location: Chile
Device: Kindle 3G, Kindle Fire (soon)
Nice idea NotTaken

Well, now i found a simple way to get the URL from updated comics from GoComics.com

With Dapper
http://open.dapper.net/transform.php...com%2Ffeatures

or with Feed43.com (only loads 100KB of the HTML, but is something)
http://www.feed43.com/random_gocomics.xml

With Dapper, only show the updated comics and with Feed43 in the content is a "Status: " that show if is updated or not (updated or empty)


BUT!
Now, i'm really lost..

I know i have to download the RSS first, then shuffle it, and the chop it (to 20 elements for example), and then pass it to the GoComics Recipe to clean it..

I try to read the the tutorial on Recipes, but really don't find a good example to understand the way the fetching is done.

Is possible to shuffle a RSS an then pass it to the other functions of the GoComics Recipe? (make_links(self, url), preprocess_html(self, soup))

I think the parse_index(self) is no longer needed.. or it is?


any information would be appreciated
Thanks.
ppriede is offline   Reply With Quote