I’d be interested in seeing what you come up with.
The pages I seem to download now are from Cloudflare and just contain lots of JavaScript which needs to be executed. Some of that seems to be to compute complex json requests to other URLs which then give some actual HTML.
Hence running it inside an actual browser, or maybe node.js (?) might be needed.
|