|
|
#1921 | ||
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
Code:
def preprocess_html(self, soup):
print 'The soup is: ', soup
return soup
Quote:
|
||
|
|
|
|
#1922 |
|
Connoisseur
![]() Posts: 55
Karma: 10
Join Date: Apr 2010
Location: new york city
Device: nook, ipad
|
I just wanted to jump in and thank folks for trying with the whole prospect thing. This is well above my computer language skills (which are limited to html/css), and I appreciate the effort.
Didn't realize what a can of worms I was opening though! |
|
|
| Advert | |
|
|
|
|
#1923 |
|
Connoisseur
![]() Posts: 98
Karma: 22
Join Date: Mar 2010
Device: IRiver Story, Ipod Touch, Android SmartPhone
|
Recipes
new recipe:
www.libero-news.it italian daily newspaper older recipe: L'Espresso italian weekly news -- better viewing, now all feeds work, and 2 new feeds. La Repubblica -- better viewing, now all feeds work , more efficient remove policy Le Scienze -- bettwer viewing, new feed |
|
|
|
|
#1924 |
|
Junior Member
![]() Posts: 2
Karma: 10
Join Date: May 2010
Device: lbook v3
|
Calibre not working with Instapaper fetch now
When Calibre fetches Instapaper, there is file generated and transferred successfully, but no content. The size of the file is 0.0 mb.
I think, Instapaper API changed. |
|
|
|
|
#1925 |
|
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 800
Karma: 194644
Join Date: Dec 2007
Location: Argentina
Device: Kindle Voyage
|
Updated recipe for instapaper.com:
|
|
|
| Advert | |
|
|
|
|
#1926 | |
|
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Nov 2009
Device: kindle2
|
submitting a patched recipe for new york review of books
Hi Kovid & Calibre community,
I've repaired the "new york review of books" recipe - one of Calibre's core recipes. It was missing all article's titles because of a change in the nybooks.com HTML configuration. Where should I submit the patch to? Regards, Pablo Funes PS: The patch is very simple. Where it says keep_only_tags = [dict(id='article-body')] It should be instead, keep_only_tags = [dict(id=['article-body','page-title'])] Quote:
|
|
|
|
|
|
#1927 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,626
Karma: 28549046
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
@pablofunes: Thanks, I've applied your change.
|
|
|
|
|
#1928 |
|
Connoisseur
![]() Posts: 98
Karma: 22
Join Date: Mar 2010
Device: IRiver Story, Ipod Touch, Android SmartPhone
|
New recipe
infomotori
Italian Car and MotorCicle News |
|
|
|
|
#1929 |
|
award-winning bozo
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 258
Karma: 172703
Join Date: Sep 2009
Location: Philadelphia
Device: Kobo Libra 2
|
American Prospect Recipe
American Prospect Recipe
sdow1 - try this recipe. It's very simple, strips out all formatting at the moment. Code:
import re
class AdvancedUserRecipe1273850169(BasicNewsRecipe):
title = u'American Prospect'
oldest_article = 7
max_articles_per_feed = 100
recursions = 0
no_stylesheets = True
remove_javascript = True
keep_only_tags = [dict(name=['p','img'])]
preprocess_regexps = [
(re.compile('\r'),lambda match: ''),
(re.compile(r'<head.*?<title>', re.DOTALL|re.IGNORECASE), lambda match: '<head><title>'),
(re.compile(r'</title>.*?</head>', re.DOTALL|re.IGNORECASE), lambda match: '</title></head>'),
(re.compile(r'<body.*?<div class="pad_10L10R">', re.DOTALL|re.IGNORECASE), lambda match: '<body><div>'),
(re.compile(r'</div>.*</body>', re.DOTALL|re.IGNORECASE), lambda match: '</div></body>'),
]
feeds = [(u'Articles', u'feed://www.prospect.org/articles_rss.jsp')]
Last edited by mwheinz; 05-17-2010 at 08:44 PM. |
|
|
|
|
#1930 |
|
Connoisseur
![]() Posts: 55
Karma: 10
Join Date: Apr 2010
Location: new york city
Device: nook, ipad
|
mwheinz:
That looks like it works! Thanks so much for the help
|
|
|
|
|
#1931 | |
|
Connoisseur
![]() Posts: 55
Karma: 10
Join Date: Apr 2010
Location: new york city
Device: nook, ipad
|
Quote:
|
|
|
|
|
|
#1932 |
|
award-winning bozo
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 258
Karma: 172703
Join Date: Sep 2009
Location: Philadelphia
Device: Kobo Libra 2
|
American Prospect, Politifact, Factcheck
@Sdow1 - thanks for the tip, I don't normally read AP.
@everybody Here's a bundle of 3 "political" recipes - the American Prospect, Factcheck and Politifact. Last edited by mwheinz; 05-18-2010 at 02:32 PM. |
|
|
|
|
#1933 |
|
Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 49
Karma: 2062
Join Date: May 2010
Device: iPad (one)
|
http://www.tomdispatch.com/
I can't figure out how to get through feedburner to the google feed to the ctual articles... |
|
|
|
|
#1934 |
|
award-winning bozo
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 258
Karma: 172703
Join Date: Sep 2009
Location: Philadelphia
Device: Kobo Libra 2
|
mlstein,
Try this: Code:
class TomDispatch(BasicNewsRecipe):
title = u'TomDispatch'
__author__ = u'Michael Heinz'
oldest_article = 21
max_articles_per_feed = 100
recursion = 2
use_embedded_content = False
no_stylesheets = True
publication_type = 'magazine'
masthead_url = 'http://www.tomdispatch.com/application/images/site/tomdispatch_logo_v1.gif'
cover_url = 'http://www.tomdispatch.com/application/images/site/tomdispatch_logo_v1.gif'
remove_tags = [
dict(name='div', attrs={'id':'postSideBar'}),
]
keep_only_tags = [dict(name='div', attrs={'id':'mainWide'})]
feeds = [
(u'Articles', u'feed://feeds.feedburner.com/tomdispatch/esUU'),
]
def get_article_url(self, article):
return article.get('feedburner_origlink', None)
|
|
|
|
|
#1935 |
|
Junior Member
![]() Posts: 1
Karma: 10
Join Date: May 2010
Device: Kindle
|
I'm new here, so I'm sorry if I'm not doing this right.
I couldn't find any recipe for Proceedings or Naval History magazines, they both have a free section that requires a registration: http://www.usni.org/magazines/proceedings/index.asp http://www.usni.org/magazines/navalhistory/index.asp Thanks a lot. ----------- Beside that request, I'd like to thank the The Economist (free) and the Foreign Affair (subscription) recipes, both worked pretty good on my Kindle. |
|
|
![]() |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Custom column read ? | pchrist7 | Calibre | 2 | 10-04-2010 03:52 AM |
| Archive for custom screensavers | sleeplessdave | Amazon Kindle | 1 | 07-07-2010 01:33 PM |
| How to back up preferences and custom recipes? | greenapple | Calibre | 3 | 03-29-2010 06:08 AM |
| Donations for Custom Recipes | ddavtian | Calibre | 5 | 01-23-2010 05:54 PM |
| Help understanding custom recipes | andersent | Calibre | 0 | 12-17-2009 03:37 PM |