Quote:
Originally Posted by marbs
i cant belive i got there. thank you very much Starson!
|
Congratulations!
Quote:
now i am not sure what i do with all the new pages i can get. how do i finish append_page i dont see it returns anything in this example or any of the others. some more help?
|
append_page does nothing until it is used in preprocess_html as:
self.append_page(soup, soup.body, 3)
It's recursive, and grabs the current page in soup form from the "soup" parameter of the article being processed in preprocess_html. That page will have a "Next Page" button or equivalent, and when append_page is correctly written, it creates a new url from the url in the "Next Page" button, grabs the content of that new page, tacks it on to the bottom of the content in the current page, then recursively does it again, finding rhe "Next Page" button on page 2 to go to page 3, etc.
Quote:
edit:
it seems like br.follow_link does not actually open a page in the browser, it gets the responce, but i dont know how to have br. have the new page in it. is there a way to open the link or read the response somehow as a web page in the browser?
|
Look at any recipe that uses obfuscated feed links to print pages to see how it's usually done.