NY Times - How Remove Garbage?
I just started using Sunrise. Overall I really like it, especially now that I've gotten logins to work! But one problem remains ... I'm getting a huge amount of "junk" on each page from such sites as NYTimes Sports & NYPost Sports. By "junk" I means links & text not related to sports articles. Also, it would be great to get the formatted for printing version of each article.
I've tried Showcase.sdl for NYTimes. But this uses the RSS feed, which contains very few articles. I've tried the provided .XSL for NYTimes ... but it not only deletes the junk, it also deletes all useful info!
I just remembered ... I forgot to try the NYTIMES-FEEDS.JS file, which I noticed is used in Showcase. So, I'll give that a try. The printing version would get rid of most, but not all, the junk. And I still have the problem with NYPost. I just looked & it uses PHP to get the printing version, so at the very least, modifying the URL will be more difficult; does anybody know if the same technique will work with PHP?
One other thing ... I've looked around, but so far have not found a Windows app to display Plucker files, or at least to convert a Plucker .PDB to a viewable format; that would make testing this stuff a lot easier than doing a hotsync every time.
Thanks!
Dave
|