Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > Miscellaneous > Archive > Sitescooper

Notices

 
 
Thread Tools Search this Thread
Old 08-23-2004, 06:56 AM   #1
geoffreynz
Member
geoffreynz began at the beginning.
 
Posts: 17
Karma: 44
Join Date: Jul 2004
Device: Palm m515
PrintableSub problem

Can anyone help me with this site file? I can't seem to work out how to write the StoryToPrintableSub line. It's driving me crazy!

This is an example of an ordinary URL: http://www.lefigaro.fr/magazine/20040820.MAG0015.html
And here is its printable version:
http://www.lefigaro.fr/cgi/edition/g...040820.MAG0015

I get the following error message (but the URL it doesn't like works!)

SITE WARNING: "lefigaromagazine.site" line 1: Printable version does not match S
toryURL pattern, reverting from http://www.lefigaro.fr/cgi/edition/g...040820.MAG0014 to http://www.lefigaro.fr/magazine/20040820.MAG0014.html

The site file in question:

URL: http://www.lefigaro.fr/magazine/
Name: Le Figaro Magazine
Description: Le Figaro Magazine
Levels: 2
ContentsStart: <!--sdvD=tete-->
ContentsEnd: <!--sdvF=centred07-->
StoryURL: http://www.lefigaro.fr/magazine/\d+.MAG\d+\.html
StoryToPrintableSub: s,(http://www.lefigaro.fr)/(magazine)/(\d+.MAG\d+)\.html,$1/cgi/edition/genimprime?cle=$3,
StoryStart: <A CLASS=TIT-TETE
StoryEnd: <!-- VENTRE : F -->

Thanks in advance for any help - I'd greatly appreciate it.

Geoffrey
geoffreynz is offline  
Old 08-23-2004, 09:54 AM   #2
Alexander Turcic
Fully Converged
Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.
 
Alexander Turcic's Avatar
 
Posts: 17,107
Karma: 10995944
Join Date: Oct 2002
Location: Switzerland
Device: Sony PRS-650 / Nexus 7 / Kindle PW
Geoffrey,

try it with these pattern:

Code:
StoryToPrintableSub: s{http://www.lefigaro.fr/magazine/(\d+.MAG\d+)\.html}{http://www.lefigaro.fr/cgi/edition/genimprime\?cle=$1}i
 StoryURL: http://www.lefigaro.fr/cgi/edition/genimprime\?cle=\d+.MAG\d+
I haven't tested it, but I don't see why it shouldn't work. I just had a look at your code; StoryURL must match the print URL, not the original URL!

Greets
Alex
Alexander Turcic is offline  
Old 08-23-2004, 04:30 PM   #3
geoffreynz
Member
geoffreynz began at the beginning.
 
Posts: 17
Karma: 44
Join Date: Jul 2004
Device: Palm m515
Thanks a lot for your help! Now it works with most of the URLs on the page, but it doesn't like the first ones on the page. See the error messages below:

Found 30 links, examining them.
Reading: http://www.lefigaro.fr/cgi/edition/g...040820.MAG0014
SITE WARNING: "lefigaromagazine.site" line 1: Printable version does not match S
toryURL pattern, reverting from http://www.lefigaro.fr/cgi/edition/genimprime?cl
e=20040820.MAG0015 to http://www.lefigaro.fr/magazine/20040820.MAG0015.html

SITE WARNING: "lefigaromagazine.site" line 1: Printable version does not match S
toryURL pattern, reverting from http://www.lefigaro.fr/cgi/edition/genimprime?cl
e=20040820.MAG0016 to http://www.lefigaro.fr/magazine/20040820.MAG0016.html

SITE WARNING: "lefigaromagazine.site" line 1: Printable version does not match S
toryURL pattern, reverting from http://www.lefigaro.fr/cgi/edition/genimprime?cl
e=20040820.MAG0017 to http://www.lefigaro.fr/magazine/20040820.MAG0017.html

SITE WARNING: "lefigaromagazine.site" line 1: Printable version does not match S
toryURL pattern, reverting from http://www.lefigaro.fr/cgi/edition/genimprime?cl
e=20040820.MAG0018 to http://www.lefigaro.fr/magazine/20040820.MAG0018.html

SITE WARNING: "lefigaromagazine.site" line 1: Printable version does not match S
toryURL pattern, reverting from http://www.lefigaro.fr/cgi/edition/genimprime?cl
e=20040820.MAG0019 to http://www.lefigaro.fr/magazine/20040820.MAG0019.html

SITE WARNING: "lefigaromagazine.site" line 1: Printable version does not match S
toryURL pattern, reverting from http://www.lefigaro.fr/cgi/edition/genimprime?cl
e=20040820.MAG0020 to http://www.lefigaro.fr/magazine/20040820.MAG0020.html

Reading: http://www.lefigaro.fr/cgi/edition/g...040820.MAG0009
Reading: http://www.lefigaro.fr/cgi/edition/g...040820.MAG0013
Reading: http://www.lefigaro.fr/cgi/edition/g...040820.MAG0010
Reading: http://www.lefigaro.fr/cgi/edition/g...040820.MAG0021
Reading: http://www.lefigaro.fr/cgi/edition/g...040820.MAG0022
Reading: http://www.lefigaro.fr/cgi/edition/g...040820.MAG0023

Thanks again for your help so far,

Geoffrey
geoffreynz is offline  
Old 08-23-2004, 08:23 PM   #4
geoffreynz
Member
geoffreynz began at the beginning.
 
Posts: 17
Karma: 44
Join Date: Jul 2004
Device: Palm m515
SOLVED!!! This site file is perfect. I'll probably upload a whole suite of .site files for Le Figaro next week. Thanks a lot to Alexander for all his help with this file!!!

URL: http://www.lefigaro.fr/magazine/
# Thanks to Alexander for all his help with making this site file!!!!
Name: Le Figaro Magazine
Description: Le Figaro Magazine
Levels: 2
ContentsStart: <!--sdvD=tete-->
ContentsEnd: <!--sdvF=centred07-->
StoryStart: <A CLASS=TIT-TETE
StoryEnd: <!-- VENTRE : F -->
StoryToPrintableSub:

s{http://www.lefigaro.fr/magazine/(\d+.MAG\d+.*)\.html}{http://www.lefigaro.fr/cgi/edition/genimprime\?cle=$1}i
StoryURL: http://www.lefigaro.fr/cgi/edition/g...e=\d+.MAG\d+.*
ContentsUseTableSmarts: 0
# ImageURL version about 500k, no images about 80k
ImageURL: http://www.lefigaro.fr/photos/.*\.jpg.*
geoffreynz is offline  
Old 08-24-2004, 10:04 AM   #5
Alexander Turcic
Fully Converged
Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.Alexander Turcic ought to be getting tired of karma fortunes by now.
 
Alexander Turcic's Avatar
 
Posts: 17,107
Karma: 10995944
Join Date: Oct 2002
Location: Switzerland
Device: Sony PRS-650 / Nexus 7 / Kindle PW
Glad to hear Was definitely an interesting chat!
Alexander Turcic is offline  
 

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Not sure what the problem is... damnyankeeuk Reading and Management 9 01-12-2010 10:44 AM
PRS-500 battery problem, but the battery's not the problem ZachC Sony Reader 7 01-11-2010 11:46 PM
Calibré problem (may be XP problem) Hildebrandt Calibre 3 07-23-2009 02:04 PM
0.4.91 problem alexxxm Calibre 4 10-07-2008 11:55 AM


All times are GMT -4. The time now is 05:22 PM.


MobileRead.com is a privately owned, operated and funded community.