Quote:
Originally Posted by XanthanGum
I guess what I need to know is:
- When you guys come up with a well-working recipe for a site such as the New York Times or New Statesman, are you looking at the source HTML code from the site?
|
Yes. The best way to browse quickly html is to get firefox and firebug plugin.
Quote:
Originally Posted by XanthanGum
How do you know what tags to remove, for example?
|
That is something you get with the time.
Quote:
Originally Posted by XanthanGum
- How do you fetch an entire article from a news site? What code segment does that?
|
Setting use_embedded_content to False does this.
Code:
use_embedded_content = False
Quote:
Originally Posted by XanthanGum
Would the same code solve the Ars Technica problem or would it have to be changed in some way?
|
Yes it would.
What you need to read is actually documentation of the BasicNewsRecipe and see for yourself the actual code which is in general well comented.
The rest you can deduce from the multitude of existing recipes. You should start with more simple one's. The New York times is one of the more complex and it is not recommended for the beginners.