View Single Post
Old 08-03-2004, 01:34 AM   #5
geoffreynz
Member
geoffreynz began at the beginning.
 
Posts: 17
Karma: 44
Join Date: Jul 2004
Device: Palm m515
Thumbs up

Glad that I could contribute I also made NYT ones for National and Arts, like I said, they're virtually identical apart from the URLs. I just fixed up "Die Zeit" at the weekend, it rocks! Enjoy and please post any new site files you make. I really want site files for the Washington Post. Because of the requirement to register, this could be difficult, but there must be a way somehow, like there was for the NYT. Does anyone have any ideas?

Geoffrey

INDEX:
Die Zeit x5
gms Reise
NYT x2
Sunday Herald x7

URL: http://www.zeit.de/feuilleton/index
Name: Zeit Feuilleton
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <td width="20px"><img alt="" src="http://zeus.zeit.de/images/transparent_pixel.gif" width="20px"
ContentsEnd: src="http://zeus.zeit.de/images/transparent_pixel.gif" valign="bottom" align="right" width="65"
StoryURL: http://www.zeit.de/\d+/\d+/.*
StoryStart: <div class="text">
StoryEnd: <p class="mainnavigation">
StoryHTMLPreProcess: {
s,..8222;,,gis;
}
ContentsHTMLPreProcess: {
s,..8222;,,gis;
}
# ImageURL: http://zeus.zeit.de/bilder/\d+/\d+/.*/.*\.jpg
# ImageURL: http://zeus.zeit.de/bilder/\d+/\d+/politik/.*\.gif

URL: http://www.zeit.de/literatur/index
Name: Zeit Literatur
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <td width="20px"><img alt="" src="http://zeus.zeit.de/images/transparent_pixel.gif" width="20px"
ContentsEnd: src="http://zeus.zeit.de/images/transparent_pixel.gif" valign="bottom" align="right" width="65"
StoryURL: http://www.zeit.de/\d+/\d+/.*
StoryStart: <div class="text">
StoryEnd: <p class="mainnavigation">
StoryHTMLPreProcess: {
s,..8222;,,gis;
}
ContentsHTMLPreProcess: {
s,..8222;,,gis;
}
# ImageURL: http://zeus.zeit.de/bilder/\d+/\d+/.*/.*\.jpg
# ImageURL: http://zeus.zeit.de/bilder/\d+/\d+/politik/.*\.gif

URL: http://www.zeit.de/politik/
Name: Zeit Politik
AuthorName: Geoffrey Miller
Levels: 2
# ContentsStart: <td width="20px"><img alt=""
ContentsStart: <img alt="" border="0" src="http://zeus.zeit.de/bilder/elemente/aktuelle_ausgabe_386.gif" align="center" vspace="0" width="386" class="teaserimage">
# ContentsEnd: <td width="100%">
ContentsEnd: <td width="140px" valign="top">
StoryURL: http://www.zeit.de/\d+/\d+/.*
StoryStart: <div class="text">
StoryEnd: <p class="mainnavigation">
# StoryEnd: <p class="mainnavigation"><a href="#top">ZUM ARTIKELANFANG</a></p>
StoryHTMLPreProcess: {
s,..8222;,,gis;
}
ContentsHTMLPreProcess: {
s,..8222;,,gis;
}
# ImageURL: http://zeus.zeit.de/bilder/\d+/\d+/.*/.*\.jpg
# ImageURL: http://zeus.zeit.de/bilder/\d+/\d+/politik/.*\.gif

URL: http://www.zeit.de/reisen/index
Name: Zeit Reisen
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <td width="20px"><img alt="" src="http://zeus.zeit.de/images/transparent_pixel.gif" width="20px"
ContentsEnd: src="http://zeus.zeit.de/images/transparent_pixel.gif" valign="bottom" align="right" width="65"
StoryURL: http://www.zeit.de/\d+/\d+/.*
StoryStart: <div class="text">
StoryEnd: <p class="mainnavigation">
# StoryEnd: <p class="mainnavigation"><a href="#top">ZUM ARTIKELANFANG</a></p>
StoryHTMLPreProcess: {
s,..8222;,,gis;
}
ContentsHTMLPreProcess: {
s,..8222;,,gis;
}
# ImageURL: http://zeus.zeit.de/bilder/\d+/\d+/.*/.*\.jpg
# ImageURL: http://zeus.zeit.de/bilder/\d+/\d+/politik/.*\.gif

URL: http://www.zeit.de/wirtschaft/index
Name: Zeit Wirtschaft
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <td width="20px"><img alt="" src="http://zeus.zeit.de/images/transparent_pixel.gif" width="20px"
ContentsEnd: src="http://zeus.zeit.de/images/transparent_pixel.gif" valign="bottom" align="right" width="65"
StoryURL: http://www.zeit.de/\d+/\d+/.*
StoryStart: <div class="text">
StoryEnd: <p class="mainnavigation"><a href="#top">ZUM ARTIKELANFANG</a></p>
StoryHTMLPreProcess: {
s,..8222;,,gis;
}
ContentsHTMLPreProcess: {
s,..8222;,,gis;
}
# ImageURL: http://zeus.zeit.de/bilder/\d+/\d+/.*/.*\.jpg
# ImageURL: http://zeus.zeit.de/bilder/\d+/\d+/politik/.*\.gif

URL: http://www.ikz-online.de/ikz/ikz.rei...ueberblick.php
Levels: 2
Name: gms Reise
ContentsStart: header.berichte2.gif
ContentsEnd: <!-- Ende - Z_2sp_dpa_Uebers_Fortl_SQL -->
StoryURL: http://www.ikz-online.de/.*
StoryStart: <!-- Ende - Z_2sp_Multicom_Lang_SQL -->
StoryEnd: <span class="contentfliess">
ImageURL: http://www.ikz-online.de/includes/bi....php?.*.nitf.*

# Change YOURID and YOURPASSWORD to your own NYT details!
URL: http://www.nytimes.com/auth/chk_logi...ext/index.html

Name: NYT Arts
AuthorName: Edited by Geoffrey Miller mostly from Kennis Koldewyn's .site files
# Thanks to Kennis Koldewyn for helping me with the NYT files
# This format will probably work with most other NYT text-only URLs, try it and see
# Post the results when you're finished!
Levels: 2
ContentsStart: <td rowspan="3" width="1" bgcolor="#E3E3E3" valign="top">
ContentsEnd: <IMG src="http://graphics7.nytimes.com/images/misc/spacer.gif" width="459" height="1" border="0"/>

# Contents pre-processing:
ContentsHTMLPreProcess: {
# Change font-hacking into heading:
s,<FONT SIZE="\+1"><STRONG>(.*?)</STRONG></FONT><P></P>,<H1>$1</H1>,gis;

# Change empty paragraphs into breaks:
s,<P></P>,<BR>,gis;
}


StoryURL: http://www.nytimes.com/.*\.html.*


StoryStart: <NYT_HEADLINE
StoryEnd: </NYT_TEXT
StoryToPrintableSub: s,(.*),$1?position=&pagewanted=print&position=,


# Story pre-processing:
StoryHTMLPreProcess: {
# Remove lists of online links, inline tables, inline images, etc.:
s,<NYT_AD.*?</NYT_ADD>,,gis;
s,<NYT_BANNER.*?</NYT_BANNER>,,gis;
s,<NYT_INLINEBLURB.*?</?NYT_INLINEBLURB>,,gis;
s,<NYT_INLINEIMAGE.*?</?NYT_INLINEIMAGE>,,gis;
s,<NYT_INLINETABLE.*?</?NYT_INLINETABLE>,,gis;
s,<NYT_LINKS.*?</NYT_LINKS>,,gis;
s,<NYT_LINKS_ONSITE.*?</?NYT_LINKS_ONSITE>,,gis;
s,<NYT_LINKS_OFFSITE.*?</?NYT_LINKS_OFFSITE>,,gis;

# Remove other NYT-specific tags:
s,<\/?NYT_.*?>,,gim;
}


# Change YOURID and YOURPASSWORD to your own NYT details!
URL: http://www.nytimes.com/auth/chk_logi...ext/index.html

Name: NYT National
AuthorName: Edited by Geoffrey Miller mostly from Kennis Koldewyn's .site files
# Thanks to Kennis Koldewyn for helping me with the NYT files
# This format will probably work with most other NYT text-only URLs, try it and see
# Post the results when you're finished!
Levels: 2
ContentsStart: <td rowspan="3" width="1" bgcolor="#E3E3E3" valign="top">
ContentsEnd: <IMG src="http://graphics7.nytimes.com/images/misc/spacer.gif" width="459" height="1" border="0"/>


# Contents pre-processing:
ContentsHTMLPreProcess: {
# Change font-hacking into heading:
s,<FONT SIZE="\+1"><STRONG>(.*?)</STRONG></FONT><P></P>,<H1>$1</H1>,gis;

# Change empty paragraphs into breaks:
s,<P></P>,<BR>,gis;
}


StoryURL: http://www.nytimes.com/.*\.html.*


StoryStart: <NYT_HEADLINE
StoryEnd: </NYT_TEXT
StoryToPrintableSub: s,(.*),$1?position=&pagewanted=print&position=,


# Story pre-processing:
StoryHTMLPreProcess: {
# Remove lists of online links, inline tables, inline images, etc.:
s,<NYT_AD.*?</NYT_ADD>,,gis;
s,<NYT_BANNER.*?</NYT_BANNER>,,gis;
s,<NYT_INLINEBLURB.*?</?NYT_INLINEBLURB>,,gis;
s,<NYT_INLINEIMAGE.*?</?NYT_INLINEIMAGE>,,gis;
s,<NYT_INLINETABLE.*?</?NYT_INLINETABLE>,,gis;
s,<NYT_LINKS.*?</NYT_LINKS>,,gis;
s,<NYT_LINKS_ONSITE.*?</?NYT_LINKS_ONSITE>,,gis;
s,<NYT_LINKS_OFFSITE.*?</?NYT_LINKS_OFFSITE>,,gis;

# Remove other NYT-specific tags:
s,<\/?NYT_.*?>,,gim;
}

URL: http://www.sundayherald.com/newshome.shtml
Name: SH-News
Description: Scottish Sunday newspaper
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <!-- content -->
ContentsEnd: <td width="10" bgcolor="FFFFFF"></td>
StoryURL: http://www.sundayherald.com/.*\d+\.*
StoryStart: <table width="100%"
StoryEnd: Back to previous page
#StoryStart: <div class="headline">
#StoryEnd: <script language="JavaScript">
StoryToPrintableSub: s,^(http://www.sundayherald.com)/(\d+)\S*,\1/print\2,

#StoryStart: <div class="bodyTextPrint">
#StoryEnd: <a href="javascript:history.back()">Back to previous page</a>
#StorytoPrintableSub: s,(.*),http://www.sundayherald/print\d+,

URL: http://www.sundayherald.com/sevendayshome.shtml
Name: SH-7days
Description: Scottish Sunday newspaper
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <!-- content -->
ContentsEnd: <td width="10" bgcolor="FFFFFF"></td>
StoryURL: http://www.sundayherald.com/.*\d+\.*
StoryStart: <table width="100%"
StoryEnd: Back to previous page
#StoryStart: <div class="headline">
#StoryEnd: <script language="JavaScript">
StoryToPrintableSub: s,^(http://www.sundayherald.com)/(\d+)\S*,\1/print\2,

#StoryStart: <div class="bodyTextPrint">
#StoryEnd: <a href="javascript:history.back()">Back to previous page</a>
#StorytoPrintableSub: s,(.*),http://www.sundayherald/print\d+,

URL: http://www.sundayherald.com/businesshome.shtml
Name: SH-Business
Description: Scottish Sunday newspaper
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <!-- content -->
ContentsEnd: <td width="10" bgcolor="FFFFFF"></td>
StoryURL: http://www.sundayherald.com/.*\d+\.*
StoryStart: <table width="100%"
StoryEnd: Back to previous page
#StoryStart: <div class="headline">
#StoryEnd: <script language="JavaScript">
StoryToPrintableSub: s,^(http://www.sundayherald.com)/(\d+)\S*,\1/print\2,

#StoryStart: <div class="bodyTextPrint">
#StoryEnd: <a href="javascript:history.back()">Back to previous page</a>
#StorytoPrintableSub: s,(.*),http://www.sundayherald/print\d+,

URL: http://www.sundayherald.com/focushome.shtml
Name: SH-NewsFocus
Description: Scottish Sunday newspaper
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <!-- content -->
ContentsEnd: <td width="10" bgcolor="FFFFFF"></td>
StoryURL: http://www.sundayherald.com/.*\d+\.*
StoryStart: <table width="100%"
StoryEnd: Back to previous page
#StoryStart: <div class="headline">
#StoryEnd: <script language="JavaScript">
StoryToPrintableSub: s,^(http://www.sundayherald.com)/(\d+)\S*,\1/print\2,

#StoryStart: <div class="bodyTextPrint">
#StoryEnd: <a href="javascript:history.back()">Back to previous page</a>
#StorytoPrintableSub: s,(.*),http://www.sundayherald/print\d+,

URL: http://www.sundayherald.com/internationalhome.shtml
Name: SH-World
Description: Scottish Sunday newspaper
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <!-- content -->
ContentsEnd: <td width="10" bgcolor="FFFFFF"></td>
StoryURL: http://www.sundayherald.com/.*\d+\.*
StoryStart: <table width="100%"
StoryEnd: Back to previous page
#StoryStart: <div class="headline">
#StoryEnd: <script language="JavaScript">
StoryToPrintableSub: s,^(http://www.sundayherald.com)/(\d+)\S*,\1/print\2,

#StoryStart: <div class="bodyTextPrint">
#StoryEnd: <a href="javascript:history.back()">Back to previous page</a>
#StorytoPrintableSub: s,(.*),http://www.sundayherald/print\d+,

URL: http://www.sundayherald.com/magazinehome.shtml
Name: SH-Magazine
Description: Scottish Sunday newspaper
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <!-- content -->
ContentsEnd: <td width="10" bgcolor="FFFFFF"></td>
StoryURL: http://www.sundayherald.com/.*\d+\.*
StoryStart: <table width="100%"
StoryEnd: Back to previous page
#StoryStart: <div class="headline">
#StoryEnd: <script language="JavaScript">
StoryToPrintableSub: s,^(http://www.sundayherald.com)/(\d+)\S*,\1/print\2,

#StoryStart: <div class="bodyTextPrint">
#StoryEnd: <a href="javascript:history.back()">Back to previous page</a>
#StorytoPrintableSub: s,(.*),http://www.sundayherald/print\d+,

URL: http://www.sundayherald.com/reviewhome.shtml
Name: SH-Review
Description: Scottish Sunday newspaper
AuthorName: Geoffrey Miller
Levels: 2
ContentsStart: <!-- content -->
ContentsEnd: <td width="10" bgcolor="FFFFFF"></td>
StoryURL: http://www.sundayherald.com/.*\d+\.*
StoryStart: <table width="100%"
StoryEnd: Back to previous page
#StoryStart: <div class="headline">
#StoryEnd: <script language="JavaScript">
StoryToPrintableSub: s,^(http://www.sundayherald.com)/(\d+)\S*,\1/print\2,

#StoryStart: <div class="bodyTextPrint">
#StoryEnd: <a href="javascript:history.back()">Back to previous page</a>
#StorytoPrintableSub: s,(.*),http://www.sundayherald/print\d+,
geoffreynz is offline