10-07-2005, 09:12 AM | #16 | |
Nameless Being
|
hi alex,
still loving this a lot. Quote:
i wonder if there's a way we can think of to architect a solution. i only track 5 or so boards, but regularly miss messages from xmsr, a very popular board (used to be even more so, with 3 posts a minute regularly a while back). i assume you cache only the headlines that you've scraped (for lack of a better word) for a particular board. and i assume you only cache those boards for whom there is a subscriber. could there not be a way to scrape only those headlines starting from the last message# retrieved instead of last 40? an if there is no 'protocol' way to request messages from #X on, can you not then go back until you've gotten a page with the next # after your most recent one? your yahoo server hitting frequency would not have to change, and in fact could increase; net efficiency would be actually much greater because you're now skewed to duplicating retrieval requests from infrequent boards. you would then merely have to store the headlines (potentially a lot more, but it's not a large amount of data per) for let's say a week, and then roll them off. apologies if i've assumed incorrectly how you've implemented it or you have thought about all this already and it can't be done! (in another time and place and life i used to do nothing but worry about this very issue |
|
10-07-2005, 09:15 AM | #17 |
Nameless Being
|
oops, i meant to say your yahoo server hitting freq could decrease..
|
Advert | |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Hello from WA, love the boards! | Ayrahvon | Introduce Yourself | 11 | 02-16-2010 06:06 PM |
School boards, Religion and Politics... | kennyc | Lounge | 7 | 02-15-2010 04:11 PM |
30 Free Ebooks on Personal Finance | Nate the great | Deals and Resources (No Self-Promotion or Affiliate Links) | 0 | 01-28-2008 02:45 PM |
Yahoo! Finance News through RSS feeds | TadW | Lounge | 0 | 01-13-2005 09:06 AM |