10-19-2010, 10:49 PM | #1 |
Member
Posts: 12
Karma: 10
Join Date: Aug 2010
Device: nook
|
Website to ereader?
I have downloaded the entire web-book at the website: http://openbookproject.net/thinkcs/python/english2e/. I don't know too much about .html and the way it gets converted to different ereader formats, but is there a way to take this downloaded site and convert it to an ereader format, such as .epub, with Calibre or another program?
|
10-20-2010, 10:42 AM | #2 |
Serpent Rider
Posts: 1,123
Karma: 10219804
Join Date: Jun 2009
Device: Sony 350; Nook STR; Oasis
|
your best bet is to install "Sigil" [there's a sub forum dedicated to it here at MR] and then just copy and paste each html section into one "epub". That is what I'd do anyway...
|
Advert | |
|
10-20-2010, 12:07 PM | #3 |
Reading and reading
Posts: 582
Karma: 8250144
Join Date: Oct 2010
Device: Infibeam Pi, iPod Touch 4G, iPad Air 2, iPad mini 2, Oneplus One
|
This is what you want: Instapaper!
Go to website, 1. Create account (easy, username and password, nothing else, even password is optional) 2. Follow the site's instructions to add the bookmarklet to your favourites bar. 3. Whenever you visit the site, whose page you want to save as chapter of a file, click on that bookmarklet. 4. Do it for all chapters in sequence 5. Go to instapaper account. Click on either Download to kindle/ download to ePub reader. The file will be either *.mobi or *.epub file. 6. Transfewr file to your reader. Even contents are added. |
10-20-2010, 06:06 PM | #4 |
Member
Posts: 12
Karma: 10
Join Date: Aug 2010
Device: nook
|
I actually ended up figuring this out just by playing around a bit. Doing as follows you get a perfectly formatted, completely indexed ebook with virtually no effort:
Download website (already completed in my case, but if someone else is interested) using wget Locate contents of book within download, which should be contained in a sub-folder Delete all cascading style sheets (.css files) and any sort of text files within the folder (and this is where I was getting my problems. There was a .txt file within the download, so when I tried to convert the html later on I kept getting an excerpt from "Alice in Wonderland", as Calibre was trying to convert the .txt instead of the html. Later on, Calibre tried to preserve the original formating of the site, rather than wrapping the text/images. This was because of the cascading style sheets). Rename folder with book contents to "HTML". Zip folder. Rename archive to "Whatever you want the book to be called".zip. Use Calibre to convert to desired format. Perfect output! |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
DR800/DR1000 Website archive browser (website in .ZIP file) | luite | iRex | 44 | 08-14-2010 12:52 AM |
What Website are American eReader Owners Most Likely to Visit? | BenLee | News | 42 | 06-24-2010 01:47 AM |
Unutterably Silly eReader-Spotting / eReader in Film und Fernsehen | beachwanderer | Lounge | 1 | 04-29-2010 04:26 PM |
eReader (pdb) purchased ebooks on B&N eReader on iPhone? | bthoven | Reading and Management | 5 | 12-23-2009 06:52 AM |
Fictionwise eReader Sale & eReader for Blackberry beta | AnemicOak | Deals and Resources (No Self-Promotion or Affiliate Links) | 6 | 03-23-2009 03:08 PM |