Shiny New E-Book Gizmo: The Amazon Kindle


View Full Version : Splicing parts of webpages


emceephd
03-06-2003, 11:46 AM
I was wondering if there was an elegant (program available) or other (more challenging) way to splice a webpage for viewing in iSilo. I have a favorite news website that includes lots of nice content but do not want to download the whole front page (e.g. ads, subscriber links). I read the source code and saw that there are comments that begin and the content I want.

Any ideas?

Alexander Turcic
03-06-2003, 11:56 AM
Hmm this is my personal most wanted feature for a future version of iSiloX. To define what parts of a web page to download through Regular Expressions for instance. Currently there is no direct way of doing this. You can use some other tool like Sitescooper which does exactly this, prepares the web page content for you and runs it through the console version of iSiloX at the end.

gvtexas
03-06-2003, 12:47 PM
Originally posted by emceephd@Mar 6 2003, 11:46 AM
I was wondering if there was an elegant (program available) or other (more challenging) way to splice a webpage for viewing in iSilo.
You can do this cleanly with HandStory's scripting language. It works by identifying points in the page code to start and stop. But I haven't mastered it yet to the point where I can reliably apply it to a page, nor has anyone else that I've seen comment about the script language. But it is possible, and it does work by grabbing only the sections of the page you want.

Alexander Turcic
03-06-2003, 01:03 PM
Had a longer icq chat with emceephd and we solved the "problem". Wrote him a quick perl script that does its job using RegEx.

Tell me guys if you want me to attach it here... for you to see how it works.

Alexander Turcic
03-06-2003, 02:30 PM
Ok I attached it. Must be installed as a perl script.

If you have interest, we can start a little thread on how to do regex and write scripts such as this one.

Only if you are interested.... :D

Cheers!

PS: You must save the file and rename it to weekly.cgi before using it.

emceephd
03-07-2003, 09:07 AM
A thread would be a great idea. We're all pretty lucky that Alexander is providing this site. Thanks so much!