The NYT recipe works better than ever! Here are a few observations that might help to improve it even more:
- There are two standard recipes, both called "The New York Times". One is Web Version, the other not. It might be helpful to give them different names in the list, and explain the difference. I use the non-web version, which gives me much smaller files.
- Bylines run into the dates, giving Times reporters interesting names like O'LOUGHLINFEB. Surely there's a way to insert a space.
- Some articles appear in the section table of contents, but consist entirely of a URL, with no headline, byline, or content. Example from today's paper: "Rewrite Iran Deal? Europeans Offer a Different Solution: A New Chapter". I don't see any obvious reason why this article, and not others, would be blank.
Thanks for the continued progress!
Dan