![]() |
#1 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,160
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
web2disk
Released libprs500 v0.3.69 with the program web2disk. It allows you to download websites to your harddisk in a format that's suitable to run through html2lrf to generate a nice LRF file for your SONY Reader.
Code:
web2disk http://www.google.com Enjoy and report bugs. This is a few hours work, so there are bound to be many. |
![]() |
![]() |
![]() |
#2 |
Sir Penguin of Edinburgh
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
Hey! I'm gonna download the Internet!
![]() web2disk *.*.*.* But seriously, can you set a delay of one request per second? That is the cutoff for Wikipedia. Any faster and they will block the URL. |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Member
![]() Posts: 22
Karma: 10
Join Date: Mar 2007
Location: France
Device: Sony Reader
|
AdBlock (and AdBlock Plus, its successor), two Firefox extensions can use a list of regex to block ads from webpages.
I put here a link to a good list (http://adblock.free.fr/adblock.txt), which can be useful to people wanting to retrieve lare bunch of sites, without following ads and stats links. |
![]() |
![]() |
![]() |
#4 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,160
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
When you say a good list you mean a list of websites that should not be blocked? Also why are there so many / in the regexps?
|
![]() |
![]() |
![]() |
#5 |
Member
![]() Posts: 22
Karma: 10
Join Date: Mar 2007
Location: France
Device: Sony Reader
|
I mean a list of site that *should* be blocked, since all those regexps are matching sites known for their ad campaigns or stats placement code.
My post wasn't very clear... By the way, the list wasn't done by me, but I use it everyday. And I have forgottent since a long time what's like to surf on pages with advertisements with this tool ! ![]() |
![]() |
![]() |
Advert | |
|
![]() |
#6 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,160
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
![]() |
![]() |
![]() |
#7 |
Enthusiast
![]() Posts: 31
Karma: 44
Join Date: Feb 2009
Device: none
|
This is awesome, thanks so much Kovid!
|
![]() |
![]() |
![]() |
Tags |
libprs500, web2disk |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
web2disk - howto? | urbane_gorilla | Calibre | 3 | 03-29-2010 04:48 AM |
website crashes ebook-convert after web2disk | eksor | Calibre | 1 | 03-11-2010 06:58 AM |
chaining web2disk to html2lrf | beowulf573 | Calibre | 2 | 11-19-2008 04:48 PM |