Quote:
Originally Posted by sdow1
But the recipe itself still returns nothing but a table of contents and then a bunch of empty sections.
|
I took a look at the recipe. It seems that the page structure changed. The regular expressions were a little bit greedy and removed nearly the whole page.
I've modified the recipe to use keep_only_tags and remove_tags to clean up the pages. The result now contains the articles.
aprospect.recipe.txt