11-15-2010, 06:02 AM | #1 |
Junior Member
Posts: 3
Karma: 10
Join Date: Nov 2010
Device: PRS 350
|
MetroTime Belgium
http://www.metrotime.be/digipapernl.html
Then look for a .pdf file. This pdf id unfortunately only one page and you have to continue downloading every consecutive page. Luckily the urls make sense. Code:
page1: http://www.metrotime.be/UserFiles/DigiPaper/nl/20101110/1/MVLMP-0-20101110-01.pdf page2: http://www.metrotime.be/UserFiles/DigiPaper/nl/20101110/2/MVLMP-0-20101110-02.pdf Could these be merged into one pdf file? If not, you would have to "click" each story on the image of the page to get to a simple version. You would just have to look for a "storyId=" in the source of every of the 24 pages. (And on the frontpage there are short headlines and the page number they are on, which have a storyid to them as well which should be ignored. This could technically be done since the "stories" all end with the identifier "e_SRitp") Is any of this possible? Edit: forgot to mention that it doesn't update every day. Last edited by Leprecon; 11-16-2010 at 04:45 AM. |
11-16-2010, 02:32 AM | #2 |
Zealot
Posts: 122
Karma: 10
Join Date: Jul 2010
Device: nook
|
calibre does not support pdf file input. sorry.
|
Advert | |
|
11-16-2010, 04:49 AM | #4 |
ePaper Enthousiast
Posts: 5
Karma: 10
Join Date: Jun 2008
Location: Overijse, Belgium
Device: iRex DR1000S
|
What you could try is to download the html site using scrapbook than, find out what html file could be the most complete table of contents and use Calibre's ebook converter to get an ePub.
J-F |
11-16-2010, 09:48 AM | #5 | |
Wizard
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
|
|
Advert | |
|
11-16-2010, 05:09 PM | #6 |
Junior Member
Posts: 3
Karma: 10
Join Date: Nov 2010
Device: PRS 350
|
Apparently my English isn't as good as I thought.
My question was whether someone could make a recipe for me. You can get the article in plaintext or with a couple of simple images by going to this site and pressing the individual articles on the picture of the newspapers page. All of the articles have links that look like this Code:
http://www.metrotime.be/digipaperArticlenl.html?storyId=37748746 http://www.metrotime.be/digipaperArticlenl.html?storyId=37748752 Code:
http://www.metrotime.be/digipapernl.html?pag=1&kdate=15/11/2010 ... http://www.metrotime.be/digipapernl.html?pag=24&kdate=15/11/2010 Code:
/digipaperArticlenl.html?storyId= Then you would have to remove every story that ends with "e_SRitp" (like this one) because it is useless fluff. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Hello to all from Belgium | ybuelens | Introduce Yourself | 6 | 02-04-2010 05:12 AM |
Hello from Belgium. | Ubikzz | Introduce Yourself | 12 | 01-25-2010 10:53 AM |
Hello from Belgium | Nate0072 | Introduce Yourself | 3 | 02-27-2009 02:14 PM |
New from Belgium | hannah | Introduce Yourself | 10 | 02-15-2009 10:08 AM |
Greetings from Belgium | rittsi | Introduce Yourself | 11 | 04-25-2008 06:18 AM |