Originally Posted by BobbyVan
Curious about the file size though. The non-Sunday paper seems about 2x-3x larger than the one created by the earlier recipe.
The increased file size is because of the inline links and related articles processing. If you look at the NYT on the web you will see a lot of stories have a sidebar called "Related" or similar, with links to related articles. Articles also have inline links to other, related articles. The new recipe collects these related and inline linked articles and downloads them behind the main article. When you are reading the main article on your device, if you click/tap on an inline link that was downloaded you will land on that article. Similarly, you will find a list of links under the title "Related" at the end of the main article, which are the articles from the "Related" sidebar. You can click/tap on these to proceed to the corresponding article.
The related articles and inline links are only processed for top-level articles. So if an inline linked or related article has inline links or related articles, they are not processed (otherwise it could go on forever).
Of course, the inline linked and related articles increase the file size. You can prevent the inline links and related articles from being processed and downloaded by setting recursions=0.
Note however that setting recursions=0 may prevent some articles that are preceded by an ad from being included. As I noted in the original message, there is still an intermittent problem with these articles, where the ad sometimes sneaks in and the article is downloaded as a subsidiary link. Setting recursions=0 would stop the subsidiary link from being downloaded.
One final note: I regularly feed my Kindle Keyboard (K3) 25MB news download files with no problems, although I suppose the PW could be more limited. Note that you can use the includeSections and excludeSections variables to control what sections are processed, so for example if you don't care for the Sports section you could set excludeSections=['Sports'] and bypass all of that content.