07-02-2011, 11:27 AM | #1 |
Junior Member
Posts: 4
Karma: 12
Join Date: Jun 2011
Device: none
|
[Solved] Engadget recipe - full article text
Is there a way to alter the engadget recipe such that it downloads the entire article?
E.g. for the recent Windows Phone 7.5 Preview, the recipe only gives the first paragraph, and then a link. This is fine for short articles, but having to load up the browser to read the entirety of the longer ones seems (to me) to undermine the point of saving to an eBook format in the first place. I don't mind having a go myself (I have some experience with Python), but I don't know where to start. I have looked at other recipes, and the API docs, but the few things I thought might work just broke it ... Thanks! Last edited by UnWeave; 07-03-2011 at 09:27 AM. Reason: Clarification |
07-02-2011, 11:37 AM | #2 |
creator of calibre
Posts: 43,839
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
You'll need to set use_embedded_content=False then add code to cleanup the article html
|
Advert | |
|
07-02-2011, 08:56 PM | #3 | |
Wizard
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
Kovid - This can probably replace the existing Engadget, but I've named it Engadget_Full Spoiler:
|
|
07-03-2011, 09:26 AM | #4 |
Junior Member
Posts: 4
Karma: 12
Join Date: Jun 2011
Device: none
|
Awesome! Thank you for the quick (and very helpful) responses. Might have a go at writing a couple of my own in light of this.
|
07-03-2011, 09:50 AM | #5 | |
Wizard
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
I never saw the longer articles, and I didn't realize that the current recipe pulled the feed summaries, not the articles, so I never found anything to fix. I saw your post, and tried to find the article you referenced, but Engadget changes so quickly, the post had already scrolled off. (There was a sublink to that article in an article on their podcast, but no RSS direct link) I almost posted that you were seeing all there was to see, but I had my wife review your post, and she said "He's right!" It wasn't until I looked at the recipe and read Kovid's post that I realized it would never find a long article - it just grabbed summaries. I had to hunt through a dozen RSS feed links to find a long article, but from there it was fairly easy to write the recipe. Let me know if you ever find anything missing. A few of the articles were formatted oddly. I fixed those in the recipe, but there could be some more from time to time that it won't handle correctly. |
|
Advert | |
|
07-03-2011, 11:01 PM | #6 |
Junior Member
Posts: 4
Karma: 12
Join Date: Jun 2011
Device: none
|
I think that the article was probably gone from the feed before I posted - it was the only long article from them I could think of off the top of my head.
And thanks, again - it seems to be working perfectly so far, but if I do notice any problems with it I will let you know (and also offer a fix, if I can work one out). |
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Wall Street Journal, WSJ, Free version, recipe improvement for full text of all ar | winterescape | Recipes | 16 | 02-07-2011 01:51 PM |
Engadget article on the 1.1 update | boswd | Nook Color & Nook Tablet | 0 | 01-27-2011 05:59 PM |
Decorate article headings as hyperlinks to full article? | tomsem | Recipes | 5 | 10-15-2010 08:30 PM |
Classic Full review at Engadget now | Mac Jones | Barnes & Noble NOOK | 13 | 12-07-2009 08:49 PM |
Engadget Article on Sony Reader | Fain | Sony Reader | 7 | 08-26-2007 12:44 AM |