View Single Post
Old 07-07-2015, 10:40 PM   #3
anisotrope
Junior Member
anisotrope began at the beginning.
 
Posts: 5
Karma: 14
Join Date: Jul 2015
Device: none
Quote:
Originally Posted by kovidgoyal View Post
You can check that by passing raw=True to index_to_soup then it will return the raw html it got from the server. Save that to a file and examine it.
Hmm, I checked this:

Code:
soup = self.index_to_soup(self.FRONTPAGE, raw = True)
and I don't get the full page, which is consistent with what I was seeing. However, when I turn off javascript with Firefox, I still get the full page at the URL in the browser.

What else can I do to try to download the enitre page?
anisotrope is offline   Reply With Quote