Quote:
Originally Posted by bowbow
Some ideas on the login-procedure:
I have no clue about python programming, but the logic behind the login is the following:
- RSS for all articles: http://www.changex.de/Feed/Home/RSS20
- Leads to first subscribers-only page: http://www.changex.de/Article/report...g_fuer_bildung
- IF not logged in, THEN <div class="subscribers weiterlesen">
- IF div class=subscribers weiterlesen", THEN a, page + ?login, e.g. http://www.changex.de/Article/report..._bildung?login
- WHEN ?login, THEN prompt for username & password AND fill <input id="nutzername" type="text" value="" name="username"> AND fill <input id="passwort" type="password" value="" name="password">
- WHEN filled in, THEN <button class="login-send" type="submit">
- You should now get the page with its full content
May anyone help me translating this into python?
Cheers!
|
Look at any of the subscription recipes or
here.
AFAICT, you haven't described how the site determines if you are logged in. Usually it's cookies, so the recipe needs to go to the login page before following any of the article links. The link above shows you how to go to the right page, send the user/password and the recipe then sets the correct login cookies and never sees the redirect to the login page that you seem to be describing. If it's something other than cookies (headers, etc.) then you may need more than the basic tools already built in for handling this.