06-19-2013, 11:20 AM | #1 |
Junior Member
Posts: 4
Karma: 10
Join Date: Jun 2013
Device: Kindle Paperwhite
|
"keep_only_tags" doesn't work?
I'm using the test recipe to crawl infoq.com:
Code:
class InfoQ_Test(BasicNewsRecipe): title = u'InfoQ Test' auto_cleanup = True no_stylesheets = True keep_only_tags = [dict(id=['content'])] def parse_index(self): items = [] items.append({ 'title': 'Article1', 'url': 'http://www.infoq.com/news/2013/06/stratos-2' }) items.append({ 'title': 'Article2', 'url': 'http://www.infoq.com/news/2013/06/document-messaging-analysis' }) return [("Default", items)] Thanks |
06-19-2013, 11:05 PM | #2 |
creator of calibre
Posts: 43,930
Karma: 22669822
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Remove
auto_cleanup = True |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Library doesn't work, eInk side stuck with "Home" screen | andr2k | enTourage eDGe | 4 | 08-04-2012 08:37 AM |
"Most Recent First" doesn't work any more | Len666 | Amazon Kindle | 9 | 06-03-2012 06:56 AM |
T1: plugboard "title_sort" doesn't work? | salines | Devices | 0 | 11-05-2011 09:17 AM |
Border's "store" doesn't work? | EldRick | Kobo Reader | 4 | 06-29-2011 10:11 PM |
The option "--extra-css" doesn't work | slex | Conversion | 2 | 02-19-2011 06:26 AM |