05-14-2010, 02:16 PM | #1921 | ||
Wizard
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
Code:
def preprocess_html(self, soup): print 'The soup is: ', soup return soup Quote:
|
||
05-14-2010, 08:39 PM | #1922 |
Connoisseur
Posts: 55
Karma: 10
Join Date: Apr 2010
Location: new york city
Device: nook, ipad
|
I just wanted to jump in and thank folks for trying with the whole prospect thing. This is well above my computer language skills (which are limited to html/css), and I appreciate the effort.
Didn't realize what a can of worms I was opening though! |
Advert | |
|
05-16-2010, 05:40 AM | #1923 |
Connoisseur
Posts: 98
Karma: 22
Join Date: Mar 2010
Device: IRiver Story, Ipod Touch, Android SmartPhone
|
Recipes
new recipe:
www.libero-news.it italian daily newspaper older recipe: L'Espresso italian weekly news -- better viewing, now all feeds work, and 2 new feeds. La Repubblica -- better viewing, now all feeds work , more efficient remove policy Le Scienze -- bettwer viewing, new feed |
05-17-2010, 03:57 AM | #1924 |
Junior Member
Posts: 2
Karma: 10
Join Date: May 2010
Device: lbook v3
|
Calibre not working with Instapaper fetch now
When Calibre fetches Instapaper, there is file generated and transferred successfully, but no content. The size of the file is 0.0 mb.
I think, Instapaper API changed. |
05-17-2010, 10:06 AM | #1925 |
Guru
Posts: 800
Karma: 194644
Join Date: Dec 2007
Location: Argentina
Device: Kindle Voyage
|
Updated recipe for instapaper.com:
|
Advert | |
|
05-17-2010, 11:37 AM | #1926 | |
Junior Member
Posts: 3
Karma: 10
Join Date: Nov 2009
Device: kindle2
|
submitting a patched recipe for new york review of books
Hi Kovid & Calibre community,
I've repaired the "new york review of books" recipe - one of Calibre's core recipes. It was missing all article's titles because of a change in the nybooks.com HTML configuration. Where should I submit the patch to? Regards, Pablo Funes PS: The patch is very simple. Where it says keep_only_tags = [dict(id='article-body')] It should be instead, keep_only_tags = [dict(id=['article-body','page-title'])] Quote:
|
|
05-17-2010, 11:47 AM | #1927 |
creator of calibre
Posts: 43,858
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
@pablofunes: Thanks, I've applied your change.
|
05-17-2010, 01:46 PM | #1928 |
Connoisseur
Posts: 98
Karma: 22
Join Date: Mar 2010
Device: IRiver Story, Ipod Touch, Android SmartPhone
|
New recipe
infomotori
Italian Car and MotorCicle News |
05-17-2010, 05:10 PM | #1929 |
award-winning bozo
Posts: 258
Karma: 172703
Join Date: Sep 2009
Location: Philadelphia
Device: Kobo Libra 2
|
American Prospect Recipe
American Prospect Recipe
sdow1 - try this recipe. It's very simple, strips out all formatting at the moment. Code:
import re class AdvancedUserRecipe1273850169(BasicNewsRecipe): title = u'American Prospect' oldest_article = 7 max_articles_per_feed = 100 recursions = 0 no_stylesheets = True remove_javascript = True keep_only_tags = [dict(name=['p','img'])] preprocess_regexps = [ (re.compile('\r'),lambda match: ''), (re.compile(r'<head.*?<title>', re.DOTALL|re.IGNORECASE), lambda match: '<head><title>'), (re.compile(r'</title>.*?</head>', re.DOTALL|re.IGNORECASE), lambda match: '</title></head>'), (re.compile(r'<body.*?<div class="pad_10L10R">', re.DOTALL|re.IGNORECASE), lambda match: '<body><div>'), (re.compile(r'</div>.*</body>', re.DOTALL|re.IGNORECASE), lambda match: '</div></body>'), ] feeds = [(u'Articles', u'feed://www.prospect.org/articles_rss.jsp')] Last edited by mwheinz; 05-17-2010 at 07:44 PM. |
05-18-2010, 07:38 AM | #1930 |
Connoisseur
Posts: 55
Karma: 10
Join Date: Apr 2010
Location: new york city
Device: nook, ipad
|
mwheinz:
That looks like it works! Thanks so much for the help |
05-18-2010, 12:47 PM | #1931 | |
Connoisseur
Posts: 55
Karma: 10
Join Date: Apr 2010
Location: new york city
Device: nook, ipad
|
Quote:
|
|
05-18-2010, 01:20 PM | #1932 |
award-winning bozo
Posts: 258
Karma: 172703
Join Date: Sep 2009
Location: Philadelphia
Device: Kobo Libra 2
|
American Prospect, Politifact, Factcheck
@Sdow1 - thanks for the tip, I don't normally read AP.
@everybody Here's a bundle of 3 "political" recipes - the American Prospect, Factcheck and Politifact. Last edited by mwheinz; 05-18-2010 at 01:32 PM. |
05-18-2010, 06:40 PM | #1933 |
Enthusiast
Posts: 49
Karma: 2062
Join Date: May 2010
Device: iPad (one)
|
http://www.tomdispatch.com/
I can't figure out how to get through feedburner to the google feed to the ctual articles... |
05-18-2010, 08:25 PM | #1934 |
award-winning bozo
Posts: 258
Karma: 172703
Join Date: Sep 2009
Location: Philadelphia
Device: Kobo Libra 2
|
mlstein,
Try this: Code:
class TomDispatch(BasicNewsRecipe): title = u'TomDispatch' __author__ = u'Michael Heinz' oldest_article = 21 max_articles_per_feed = 100 recursion = 2 use_embedded_content = False no_stylesheets = True publication_type = 'magazine' masthead_url = 'http://www.tomdispatch.com/application/images/site/tomdispatch_logo_v1.gif' cover_url = 'http://www.tomdispatch.com/application/images/site/tomdispatch_logo_v1.gif' remove_tags = [ dict(name='div', attrs={'id':'postSideBar'}), ] keep_only_tags = [dict(name='div', attrs={'id':'mainWide'})] feeds = [ (u'Articles', u'feed://feeds.feedburner.com/tomdispatch/esUU'), ] def get_article_url(self, article): return article.get('feedburner_origlink', None) |
05-18-2010, 08:41 PM | #1935 |
Junior Member
Posts: 1
Karma: 10
Join Date: May 2010
Device: Kindle
|
I'm new here, so I'm sorry if I'm not doing this right.
I couldn't find any recipe for Proceedings or Naval History magazines, they both have a free section that requires a registration: http://www.usni.org/magazines/proceedings/index.asp http://www.usni.org/magazines/navalhistory/index.asp Thanks a lot. ----------- Beside that request, I'd like to thank the The Economist (free) and the Foreign Affair (subscription) recipes, both worked pretty good on my Kindle. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Custom column read ? | pchrist7 | Calibre | 2 | 10-04-2010 02:52 AM |
Archive for custom screensavers | sleeplessdave | Amazon Kindle | 1 | 07-07-2010 12:33 PM |
How to back up preferences and custom recipes? | greenapple | Calibre | 3 | 03-29-2010 05:08 AM |
Donations for Custom Recipes | ddavtian | Calibre | 5 | 01-23-2010 04:54 PM |
Help understanding custom recipes | andersent | Calibre | 0 | 12-17-2009 02:37 PM |