Thread: New York Times
View Single Post
Old 06-20-2025, 12:20 AM   #12
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,427
Karma: 27757236
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Quote:
Originally Posted by nickredding View Post
True, but the article content is encapsulated in JSON within the article content so fiddling the UA is not necessary--the JSON is there with a standard UA header.

It seems that major publications are moving to a Content Management System that does this JSON encapsulation as a method of defeating simple screen scrapers. It also seems that RSS feeds are going out of style and relying on RSS in the future for indexes will be increasingly unreliable.
that's irrelevant, you will get a captcha page if you try to download without an appropriate user-agent. nytimes uses captcha-delivery.com on its article pages.
kovidgoyal is offline   Reply With Quote