You might want to try a two step process to send each article to something like outline.com by prepending that to the url like
https://outline.com/www.vanityfair.c...amantha-markle
This standardizes the format but takes a while sometimes, so you'd want a hefty timeout plus a check on the returned URL.