![]() |
#1 |
Member
![]() ![]() Posts: 21
Karma: 190
Join Date: Nov 2017
Device: Kindle paperwhite
|
New York Times
For the last couple of days, recipes retrieve only the section headers. I checked print and web versions, book review, tech beat, sports beat. TIA
|
![]() |
![]() |
![]() |
#2 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,414
Karma: 27757236
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Looks like the nytimes has started requiring javascript and captchas. Sigh. Probably will have to get it from the wayback machine instead, although the wayback machine doesnt seem to be storing nytimes articles in a timely fashion.
Another possibility is using the graphql api though that will require some reverse engineering. |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 630
Karma: 85520
Join Date: May 2021
Device: kindle
|
Worked until now because of google bot headers.. looks like the bot itself got blocked.
the graphql api one looks hard. |
![]() |
![]() |
![]() |
#4 |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Feb 2021
Device: iPad mini
|
I downloaded the Sunday NY Times today multiple times, only to retrieve section headers, as well. This is terrible.
|
![]() |
![]() |
![]() |
#5 |
Junior Member
![]() Posts: 6
Karma: 10
Join Date: Jan 2021
Device: Kindle app on android tablet
|
Following thread as I'm having the same issue. Hope to see a solution soon.
|
![]() |
![]() |
Advert | |
|
![]() |
#6 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,414
Karma: 27757236
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
This fixes it for me, but dont know how long it will remain fixed.
https://github.com/kovidgoyal/calibr...0c145c09025cf0 |
![]() |
![]() |
![]() |
#7 |
onlinenewsreader.net
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 328
Karma: 10143
Join Date: Dec 2009
Location: Phoenix, AZ & Victoria, BC
Device: Kindle 3, Kindle Fire, IPad3, iPhone4, Playbook, HTC Inspire
|
New York Times
The New York Times and New York Times (Web) recipes already use graphql and work fine.
|
![]() |
![]() |
![]() |
#8 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,414
Karma: 27757236
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
![]() |
![]() |
![]() |
#9 | |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 630
Karma: 85520
Join Date: May 2021
Device: kindle
|
Quote:
|
|
![]() |
![]() |
![]() |
#10 |
onlinenewsreader.net
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 328
Karma: 10143
Join Date: Dec 2009
Location: Phoenix, AZ & Victoria, BC
Device: Kindle 3, Kindle Fire, IPad3, iPhone4, Playbook, HTC Inspire
|
True, but the article content is encapsulated in JSON within the article content so fiddling the UA is not necessary--the JSON is there with a standard UA header.
It seems that major publications are moving to a Content Management System that does this JSON encapsulation as a method of defeating simple screen scrapers. It also seems that RSS feeds are going out of style and relying on RSS in the future for indexes will be increasingly unreliable. |
![]() |
![]() |
![]() |
#11 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 630
Karma: 85520
Join Date: May 2021
Device: kindle
|
|
![]() |
![]() |
![]() |
#12 | |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,414
Karma: 27757236
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Quote:
|
|
![]() |
![]() |
![]() |
#13 | |
Junior Member
![]() Posts: 6
Karma: 10
Join Date: Jan 2021
Device: Kindle app on android tablet
|
Quote:
|
|
![]() |
![]() |
![]() |
#14 |
Junior Member
![]() Posts: 1
Karma: 10
Join Date: Jan 2022
Location: Brooklyn, New York, USA
Device: OG Kindle DX
|
![]()
The recipe appears to be broken again, at least for me. Last successfully downloaded the paper yesterday, 12 August. Today it fails no matter what tweaks I try (both recipes (web/not), resetting all options to default, etc.). Tried with and without VPN and from different IP by using personal hotspot.
These are the job details (spoiler tag used to hide the section of code before the errors appear): Code:
Last edited by Michael-E; Today at 11:56 AM. Reason: corrected last time the recipe ran successfully from 11 August to 12 August |
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
NYT Spanish New York Times Español El Times Recipe | compa | Recipes | 0 | 03-24-2022 02:40 PM |
The New York Times | Vikas Chahal | Recipes | 0 | 05-09-2021 03:55 AM |
New York Times Paywall? | Starson17 | Recipes | 7 | 04-03-2011 08:33 PM |
Help - New York Times Recipe | brutalist | Recipes | 6 | 03-20-2011 10:17 PM |
New York Times recipe | madrone26 | Calibre | 4 | 04-02-2009 01:13 PM |