11-18-2010, 08:06 AM
The method preprocess_html() gets a "soup" argument, but I have a situation where the article being fetched is a request for authentication. After doing so, one is expected to re-fetch the URL. Is the URL in the soup object or, better yet, is the article object (with title, URL, description, and date) available in the BasicNewsRecipe object (i.e., self)? I would love to add more attributes (e.g., byline) to the article object and have that available to preprocess_html() so that I can add more stuff to the fetched article. Thanks!