|
|
#1 |
|
Member
![]() Posts: 17
Karma: 44
Join Date: Jul 2004
Device: Palm m515
|
Hi all,
Here are two Italian sitescooper .site files I've made. Some of the PreProcess lines are probably unnecessary, but I used my Die Zeit files as a template. Otherwise, special characters turn out lòßike this on my Palm! Here's Reuters Italia top news: Code:
URL: http://today.reuters.it/news/default.aspx
Name: Reuters Italienisch
Description: Reuters Nachrichten auf Italienisch
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <!-- Left Column -->
ContentsEnd: <td colspan="2"><img border="0" src="images/clear.gif"
StoryURL: http://today.reuters.it/PrinterFriendlyPopup.aspx\?type=topNews\&storyID=.*
StoryToPrintableSub: s,^http://today.reuters.it/news/.ewsArticle.aspx\?type=topNews\&storyID=(.*)\S*,http://today.reuters.it/PrinterFriendlyPopup.aspx\?type=topNews\&storyID=uri\%3a$1,
StoryStart: <td id="StoryDataCell" colspan="2" valign="top">
StoryEnd: <td colspan="2" align="right">
ContentsUseTableSmarts: 1
ImageURL: .*\.jpg.*
ContentsHTMLPreProcess: {
s,ò,ò,gis;
s,é,é,gis;
s,ù,ù,gis;
s,ì,ì,gis;
s,è,è,gis;
s,Ã ,à,gis;
s,©,©,gis;
s,ä,ä,gis;
s,ü,ü,gis;
s,ß,ß,gis;
s,ö,ö,gis;
s,Ü,Ü,gis;
s,Ö,Ö,gis;
s,Ä,Ä,gis;
s,<span class="artTitle">(.*)</span>,<b>$1</b>,gis;
s,<span class="newsDate">(.*)</span>,<br><i>$2<i>,gis;
s,<td colspan="2" valign="top" class="medium">,<HR>,gis;
}
StoryHTMLPreProcess: {
s,ò,ò,gis;
s,à©,é,gis;
s,ù,ù,gis;
s,ì,ì,gis;
s,è,è,gis;
s,Ã ,à,gis;
s,©,©,gis;
s,ä,ä,gis;
s,ü,ü,gis;
s,ß,ß,gis;
s,ö,ö,gis;
s,Ü,Ü,gis;
s,Ö,Ö,gis;
s,Ä,Ä,gis;
s,<a.*?>,,gis;
s,<br.*?>,,gis;
s,<hr.*?>,,gis;
s,<br.*?>,,gis;
s,<span class="artTitle">(.*)</span>,<b>$1</b>,gis;
s,<span class="newsDate">(.*)</span>,<br><i>$2<i>,gis;
s,<td colspan="2" valign="top" class="medium">,<HR>,gis;
}
Code:
URL: http://www.panorama.it/mondo/index.html
Name: Panorama Mondo
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <!-- Riga 2 -->
ContentsEnd: <!-- fine Riga 9 -->
StoryURL: http://www.panorama.it/home/stampa/articolo/ix1-.*/idxsl1-stampaarticolo
StoryToPrintableSub: s,^http://www.panorama.it/.*/.*/articolo/(ix1-.*)\S*,http://www.panorama.it/home/stampa/articolo/$1/idxsl1-stampaarticolo,
StoryStart: <span class="txtliv3b">
StoryEnd: <!-- fine Riga 1 -->
ContentsUseTableSmarts: 0
ContentsHTMLPreProcess: {
s,ò,ò,gis;
s,é,é,gis;
s,ù,ù,gis;
s,ì,ì,gis;
s,è,è,gis;
s,Ã ,à,gis;
s,©,©,gis;
s,ä,ä,gis;
s,ü,ü,gis;
s,ß,ß,gis;
s,ö,ö,gis;
s,Ü,Ü,gis;
s,Ö,Ö,gis;
s,Ä,Ä,gis;
s,<span class="artTitle">(.*)</span>,<b>$1</b>,gis;
s,<span class="newsDate">(.*)</span>,<br><i>$2<i>,gis;
s,<td colspan="2" valign="top" class="medium">,<HR>,gis;
}
StoryHTMLPreProcess: {
s,ò,ò,gis;
s,à©,é,gis;
s,ù,ù,gis;
s,ì,ì,gis;
s,è,è,gis;
s,Ã ,à,gis;
s,©,©,gis;
s,ä,ä,gis;
s,ü,ü,gis;
s,ß,ß,gis;
s,ö,ö,gis;
s,Ü,Ü,gis;
s,Ö,Ö,gis;
s,Ä,Ä,gis;
s,<a.*?>,,gis;
s,<br.*?>,,gis;
s,<hr.*?>,,gis;
s,<br.*?>,,gis;
s,<span class="artTitle">(.*)</span>,<b>$1</b>,gis;
s,<span class="newsDate">(.*)</span>,<br><i>$2<i>,gis;
s,<td colspan="2" valign="top" class="medium">,<HR>,gis;
}
Regards, Geoffrey
|
|
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| iPad Reuters: Still plenty of demand for iPad: Survey | kjk | Apple Devices | 4 | 07-16-2010 07:22 PM |
| Borders CEO nixes idea of own e-reader (Reuters) | Nate the great | News | 11 | 12-23-2009 07:42 PM |
| Reuters Article - with Actual quotes from Mr. Bezos | poohbear_nc | News | 2 | 10-09-2009 11:33 AM |
| Yet another Italian! | verbosus | Introduce Yourself | 1 | 08-24-2006 04:31 AM |
| Reuters to offer podcasts | Alexander Turcic | Lounge | 0 | 09-21-2005 09:53 AM |