MobileRead Forums - View Single Post

ppriede · 12-02-2011, 11:07 PM

Nice idea NotTaken

Well, now i found a simple way to get the URL from updated comics from GoComics.com

With Dapper
http://open.dapper.net/transform.php...com%2Ffeatures

or with Feed43.com (only loads 100KB of the HTML, but is something)
http://www.feed43.com/random_gocomics.xml

With Dapper, only show the updated comics and with Feed43 in the content is a "Status: " that show if is updated or not (updated or empty)

BUT!
Now, i'm really lost..

I know i have to download the RSS first, then shuffle it, and the chop it (to 20 elements for example), and then pass it to the GoComics Recipe to clean it..

I try to read the the tutorial on Recipes, but really don't find a good example to understand the way the fetching is done.

Is possible to shuffle a RSS an then pass it to the other functions of the GoComics Recipe? (make_links(self, url), preprocess_html(self, soup))

I think the parse_index(self) is no longer needed.. or it is?

any information would be appreciated
Thanks.

12-02-2011, 11:07 PM	#5
ppriede Junior Member Posts: 6 Karma: 10 Join Date: May 2010 Location: Chile Device: Kindle 3G, Kindle Fire (soon)	Nice idea NotTaken Well, now i found a simple way to get the URL from updated comics from GoComics.com With Dapper http://open.dapper.net/transform.php...com%2Ffeatures or with Feed43.com (only loads 100KB of the HTML, but is something) http://www.feed43.com/random_gocomics.xml With Dapper, only show the updated comics and with Feed43 in the content is a "Status: " that show if is updated or not (updated or empty) BUT! Now, i'm really lost.. I know i have to download the RSS first, then shuffle it, and the chop it (to 20 elements for example), and then pass it to the GoComics Recipe to clean it.. I try to read the the tutorial on Recipes, but really don't find a good example to understand the way the fetching is done. Is possible to shuffle a RSS an then pass it to the other functions of the GoComics Recipe? (make_links(self, url), preprocess_html(self, soup)) I think the parse_index(self) is no longer needed.. or it is? any information would be appreciated Thanks.