View Single Post
Old 02-04-2009, 03:09 AM   #163
gregcd
Connoisseur
gregcd rocks like Gibraltar!gregcd rocks like Gibraltar!gregcd rocks like Gibraltar!gregcd rocks like Gibraltar!gregcd rocks like Gibraltar!gregcd rocks like Gibraltar!gregcd rocks like Gibraltar!gregcd rocks like Gibraltar!gregcd rocks like Gibraltar!gregcd rocks like Gibraltar!gregcd rocks like Gibraltar!
 
gregcd's Avatar
 
Posts: 90
Karma: 100000
Join Date: Jan 2009
Location: New Zealand
Device: prs-t1, prs-650 to sell
Hi all, I'm working on setting up these feeds for full text (print versions):

New zealand news site Stuff http://www.stuff.co.nz/feeds/topstories.xml

The Stuff site seems simple :
def print_version(self, url):
return url.replace('http://www.stuff.co.nz/', 'http://www.stuff.co.nz/print/')
However when I run it the EPUB result appears to be the full HTML version (which the sony doesn't like) and not the print version.

What should be changed for it to work?
Quote:
class AdvancedUserRecipe1233732796(BasicNewsRecipe):
title = u'Stuff.co.nz'
oldest_article = 7
max_articles_per_feed = 100

feeds = [(u'top stories', u'http://www.stuff.co.nz/feeds/topstories.xml'), (u'Worldnews', u'http://www.stuff.co.nz/feeds/worldnews.xml'), (u'Music', u'http://www.stuff.co.nz/feeds/music.xml'), (u'Llifestyle', u'http://www.stuff.co.nz/feeds/lifestyle.xml'), (u'Sport', u'http://www.stuff.co.nz/feeds/sport.xml')]
def print_version(self, url):
return url.replace('http://www.stuff.co.nz/', 'http://www.stuff.co.nz/print/')

I have a similar issue with the CIO new zealand site http://cio.co.nz/rss/cio.xml
Also with this one, the "printer friendly" has fixed line length - is there python code to remove this?

Gamespot seems simple, but I"m not sure what the right code is as "replace" doesn't seem appropriate - the print version just has "print=1" at the end of the URL.

http://rss.gamespot.com/misc/rss/gam...es_reviews.xml
gregcd is offline