Thread: Rss2Book
View Single Post
Old 03-03-2008, 04:07 AM   #234
moz
Addict
moz once ate a cherry pie in a record 7 seconds.moz once ate a cherry pie in a record 7 seconds.moz once ate a cherry pie in a record 7 seconds.moz once ate a cherry pie in a record 7 seconds.moz once ate a cherry pie in a record 7 seconds.moz once ate a cherry pie in a record 7 seconds.moz once ate a cherry pie in a record 7 seconds.moz once ate a cherry pie in a record 7 seconds.moz once ate a cherry pie in a record 7 seconds.moz once ate a cherry pie in a record 7 seconds.moz once ate a cherry pie in a record 7 seconds.
 
moz's Avatar
 
Posts: 368
Karma: 1553
Join Date: Feb 2008
Location: Melbun
Device: Sony PRS-505
The page is good but there's only source code to download. Can you add a link to where I can download the executable version? presumably http://www.download.com/3000-20-10649163.html ? I'm about to have a play now

Hmm, www.publicaddress.net doesn't work as rss or web site, System.UriFormatException as web page, just headlines as rss. http://blog.greens.org.nz/index.php/feed/ just doesn't work - no output at all. mozbike.blogspot.com causes it to hang, http://mozbike.blogspot.com/feeds/posts/default just produces no output. The log window gets a bit annoying after a while - do you absolutely have to show it every time? I think you might need to write a wizard to set up feeds, or make the inputs more tolerant. But at least Process Explorer can kill it when it hangs, that bit works.

Cool, http://smh.com.au/text works. Is there any way to tell it "ignore links that don't start with http://smh.com.au/text"? Also, converting this to rtf takes avery long time (I think, after a minute or two I killed it). It looks as though you just cat the HTML of all the links together - perhaps it would be better if you at least removed all the extra HTML and BODY tags? I suspect that stripping the non-text context would help that, as the html page currently produced has all sorts of images and formatting as well as embedded scripts and styles. Using OpenOffice Writer to import the HTML is slow to the point where I killed that too. Using a text editor to remove the start and end blocks of "stuff" plus all the img and href tags makes it possible to load the HTML. Now I need to use MS-Office to convert HTML to RTF because I don't (yet) have a better solution.

Clicking Help-Report Bug takes me to a page that doesn't work.

I will download the source and have a play sometime, I hope.

Last edited by moz; 03-03-2008 at 06:08 AM. Reason: more discoveries.
moz is offline   Reply With Quote