Thank you, this recipe works very well! It's fantastic that it could be rewritten in a fraction of the code.
There are a couple differences from before, but these are cosmetic and non-essential:
-There's some cruft at the end of each article. It starts with "A version of this article appears in print on..." The old script did not have this. However, this is pretty easy to ignore.
-The article text contains the same hyperlinks from the website -- when tapped, accidentally or not, on a Kindle, they open up the slow-as-molasses Kindle browser. The old script seemed to erase the hyperlinks, which I never found useful (can't speak for others though). Again, non-essential and easy to ignore.
-The resulting files seem larger than before (10 MB vs 3-4 MB for a weekday paper, 75 minutes vs 15 minutes to process on a Raspberry Pi 2, both with the setting compress_news_images_auto_size = 16). I will tool around with compress_news_images_max_size to see if I can get this back down to the same file size / processing time as before.
Thank you again!
|