10-13-2012, 01:00 PM | #1 |
Connoisseur
Posts: 58
Karma: 12
Join Date: May 2011
Location: Deland, Florida
Device: Kindle 3
|
USA Today "pageinfo data hidden"
I am trying to remove several tags from the USA Today recipe that I personally find annoying. Here is my remove_tags recipe:
Code:
remove_tags = [dict(name='aside', attrs={'class':['comp story-highlights','right partner','right']}), dict(name='span', attrs={'class':['last-updated']}), dict(name='div', attrs={'class':['pageinfo data hidden']}), ] Here is the original source code from the RSS feed: <div class="pageinfo data hidden"> { "assetid": "1631253", "aws": "tech", "aws_id": "tech", "blogname": "", "contenttype": "story pages ", "pagename": "Space shuttle Endeavour continues trek through L.A.", "seotitle": "Shuttle-endeavour-los-angeles", "seotitletag": "Space shuttle Endeavour continues trek through L.A.", "ssts": "tech", "taxonomykeywords":"Traffic congestion,Space exploration,Los Angeles,California,Manchester,Manchester,Long Beach,Manchester", "templatename": "stories/default", "topic":"traffic-congestion,space-exploration,los-angeles,california,manchester,manchester,long-beach,manchester", "videoincluded":"yes", "basePageType":"story" } </div> All of that garbage is appearing after every article and I would very much appreciate any help in assisting me in removing it from download. Thank you, Randy |
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
"The Hidden Truth" - newly published by Wade C. Wilson | thehiddentruth | Self-Promotions by Authors and Publishers | 0 | 09-30-2012 05:14 AM |
How do I make a "hidden" pages for endnotes in an EPUB? | clemens14 | ePub | 9 | 04-30-2012 06:57 PM |
No data in "In Library" and "On Device" columns after upgrade | ily426 | Library Management | 8 | 04-03-2011 02:53 PM |
Why No "USA Today"? | HaggisMacJedi | Amazon Kindle | 18 | 07-03-2008 09:24 AM |