![]() |
#1 |
onlinenewsreader.net
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 324
Karma: 10143
Join Date: Dec 2009
Location: Phoenix, AZ & Victoria, BC
Device: Kindle 3, Kindle Fire, IPad3, iPhone4, Playbook, HTC Inspire
|
New York Times recipe update
Changes to nytimes recipe:
Here are typical file sizes for various recipe options. Run time is proportional, so for example the Web version with all articles downloaded can take several hours. Headlines only: 6MB Today's Paper: 9MB Web, 1 day: 14MB Web, 7 day: 27MB Web, all: 40MB |
![]() |
![]() |
![]() |
#2 |
Enthusiast
![]() Posts: 42
Karma: 20
Join Date: Jan 2012
Device: Kindle Paperwhite
|
Works great!
Gave this a shot this morning and it works beautifully. Thanks also for stripping the mostly unnecessary "multimedia" images from the articles.
Curious about the file size though. The non-Sunday paper seems about 2x-3x larger than the one created by the earlier recipe. So far it's working fine, but slightly concerned that the Sunday edition will choke my Kindle PW (as has happened in the past with the Sunday NYT). |
![]() |
![]() |
Advert | |
|
![]() |
#3 | |
onlinenewsreader.net
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 324
Karma: 10143
Join Date: Dec 2009
Location: Phoenix, AZ & Victoria, BC
Device: Kindle 3, Kindle Fire, IPad3, iPhone4, Playbook, HTC Inspire
|
Quote:
The related articles and inline links are only processed for top-level articles. So if an inline linked or related article has inline links or related articles, they are not processed (otherwise it could go on forever). Of course, the inline linked and related articles increase the file size. You can prevent the inline links and related articles from being processed and downloaded by setting recursions=0. Note however that setting recursions=0 may prevent some articles that are preceded by an ad from being included. As I noted in the original message, there is still an intermittent problem with these articles, where the ad sometimes sneaks in and the article is downloaded as a subsidiary link. Setting recursions=0 would stop the subsidiary link from being downloaded. One final note: I regularly feed my Kindle Keyboard (K3) 25MB news download files with no problems, although I suppose the PW could be more limited. Note that you can use the includeSections and excludeSections variables to control what sections are processed, so for example if you don't care for the Sports section you could set excludeSections=['Sports'] and bypass all of that content. |
|
![]() |
![]() |
![]() |
#4 | |
onlinenewsreader.net
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 324
Karma: 10143
Join Date: Dec 2009
Location: Phoenix, AZ & Victoria, BC
Device: Kindle 3, Kindle Fire, IPad3, iPhone4, Playbook, HTC Inspire
|
Quote:
Interestingly, this only seems to happen when there are multiple downloads taking place simultaneously. The recipe I submitted has simultaneous_downloads=1 and this seems to prevent the problem from arising. If anyone encounters the ads slipping in with simultaneous_downloads=1 please let me know. Otherwise I'll assume the slight increase in runtime from using simultaneous_downloads=1 is a reasonable solution to the problem. |
|
![]() |
![]() |
![]() |
#5 | |
onlinenewsreader.net
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 324
Karma: 10143
Join Date: Dec 2009
Location: Phoenix, AZ & Victoria, BC
Device: Kindle 3, Kindle Fire, IPad3, iPhone4, Playbook, HTC Inspire
|
simultaneous_download>1 now OK
Quote:
|
|
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Dec 2012
Device: nook hd+
|
Is it possible to add the date in the Titlename like:
New York Times [Mo, 31 Dez 2012] Thanx in advance |
![]() |
![]() |
![]() |
#7 | |
onlinenewsreader.net
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 324
Karma: 10143
Join Date: Dec 2009
Location: Phoenix, AZ & Victoria, BC
Device: Kindle 3, Kindle Fire, IPad3, iPhone4, Playbook, HTC Inspire
|
Quote:
title = 'New York Times'+strftime(' [%a, %d %b %Y]') Put it in right before decode_url_date but AFTER the other tests that are setting title according to other parameters. I think a lot of people like the standard recipe to not include the date in the title because that way different issues of the same publication get stacked on their e-readers instead of appearing as distinct documents. |
|
![]() |
![]() |
![]() |
#8 |
Enthusiast
![]() Posts: 26
Karma: 10
Join Date: Aug 2007
Location: Petaling Jaya, Malaysia
Device: Kindle Fire HD 8.9, Kobo Aura HD, Sony PRS-950
|
I noticed that nytimes has changed the web layouts recently and because of this, the current caliber nytimes recipe seems to not work as nicely as before. Besides a great increase in file size, every article now precedes with "1. Loading ..... " etc
Would appreciate if the original authors can look into this. NYTimes recipe has been great and I enjoy the loaded articles tremendously. |
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
FIX: New York Times Recipe | bcollier | Recipes | 2 | 08-25-2011 11:31 AM |
Which New York Times recipe? | jdomingos76 | Recipes | 1 | 03-25-2011 08:40 PM |
Help - New York Times Recipe | brutalist | Recipes | 6 | 03-20-2011 10:17 PM |
Updated New York Times recipe | nickredding | Recipes | 2 | 11-20-2010 10:53 AM |
New York Times recipe | madrone26 | Calibre | 4 | 04-02-2009 01:13 PM |