Getting even closer.
I can read all the articles now, but there's stuff before and after them that I'm picking up off the web site. I can't figure out how to
1. Get it to the print only page
2. Get the stuff at the beginning (really disruptive for reading) and the end (not as bad but would love to remove it)
Thanks for any assistance anyone can provide. I certainly wouldn't mind a little .rar pack with the answer in it either!
Grateful either way,
Matt
class WinnipegFreePress(BasicNewsRecipe):
title = u'Journal Gazette'
__author__ = 'somedayson'
description = 'News from Fort Wayne Indiana'
oldest_article = 1
max_articles_per_feed = 30
category = 'News, Fort Wayne, Indiana'
cover_url = 'http://media.winnipegfreepress.com/designimages/winnipegfreepress_WFP.gif'
no_stylesheets = True
encoding = 'UTF-8'
remove_javascript = True
use_embedded_content = False
language = 'en_CA'
feeds = [(u'Opinion', u'http://journalgazette.net/apps/pbcs.dll/section?Category=EDIT&template=blogrss&mime=xml'),
(u'Local News',u'http://journalgazette.net/apps/pbcs.dll/section?Category=LOCAL&template=blogrss&mime=xml') ,
(u'Sports',u'http://journalgazette.net/apps/pbcs.dll/section?Category=SPORTS&template=blogrss&mime=xml' ),
(u'Features',u'http://journalgazette.net/apps/pbcs.dll/section?Category=FEAT&template=blogrss&mime=xml'),
(u'Business',u'http://journalgazette.net/apps/pbcs.dll/section?Category=BIZ&template=blogrss&mime=xml'),
(u'Ice Chips',u'http://journalgazette.net/apps/pbcs.dll/section?Category=BLOGS11&template=blogrss&mime=xml '),
(u'Entertainment',u'http://journalgazette.net/apps/pbcs.dll/section?Category=ENT&template=blogrss&mime=xml'),
(u'Food',u'http://journalgazette.net/apps/pbcs.dll/section?Category=FOOD&template=blogrss&mime=xml')
]
|