Quote:
Originally Posted by wintoid
Thanks again. Yes, what I'm hoping for is to get all "posts" within the "thread". I've programmed C before but not Python, and am completely new to Calibre. Is there an example somewhere I can see that iterates through the "posts"?
Also, does "article" mean the actual main "threads" or the "posts" within the "threads"?
Thanks for all your help
|
Calibre terminology is that the "feed" is a link to a page that has links to "articles." I checked the links, and in this case the "article" is the first page in a new thread. Depending on whether there are any replies to the first post, there may be additional pages. You might find something useful in the builtin slashdot recipe. I know there are some other forum recipes, but I don't recall any names.
You don't need to iterate through posts. They will all be on the linked "article" page, although in a very active thread, if you run the recipe late, there might be additional pages you would need to follow to get everything. You will need to strip the stuff you don't want. Read the "Using News Recipes" sticky here, particularly at the end for some useful starting links.