View Single Post
Old 11-18-2009, 02:34 AM   #10
okalyddude
Enthusiast
okalyddude will become famous soon enoughokalyddude will become famous soon enoughokalyddude will become famous soon enoughokalyddude will become famous soon enoughokalyddude will become famous soon enoughokalyddude will become famous soon enough
 
Posts: 41
Karma: 602
Join Date: Oct 2009
Device: E600
Quote:
Originally Posted by eksor View Post
calibre (calibre.kovidgoyal.net/download) will work fine. Just download the tar, uncompress it and load the top html in calibre, then convert it to lrf or epub.

But the file will be huge (note that a 3.5 Gb dvd is available, to get an idea) and will take a lot of time loading it into the device if finally ends. In fact you will ned a sd or a produo card to store it.

Alternatively you could use calibre web2disk from the site itself:

web2disk http://schools-wikipedia.org/wp/index/a.htm will download everything linked to a.htm (one level) and leave an index.htm in the download directory.
when finished:
ebook-convert index.htm a.lrf //or a.epub will convert all the linked htm files to a self contained lrf or epub file. Sadly, even if you repeat the same with b.htm no content will be linked with a.htm. In other words, you will not be able to navigate freely across the wikipedia if you download/convert a,b,c, etc.
I have tested this approach with the site of your interest with some subjects, (performers, for instance or WWII) and it works fine. Better results with epub format (some images missing with lrf).

Both web2disk and ebook-convert are command line utilities available with calibre software.

regards.
Hmm ok, the version I have of the wikipedia does not have an indexed html for the letters.. merely a folder containing all the html and jpg files (and jpg files are not well labeled)

I will try to look at what you linked, and see if I can get it working that way.

I did intend to buy an expandable memory slot, but only if I got this working in an efficient manner.

In the way you're describing, I would have a single epub file for each letter? And the chapters catalogue would be links to all the html pages. How do you include the jpgs in the html pages? Are they already in the 'index' html? (i'm new and quite clueless to this, and have yet to try it out)

Then, with the search function of the 600, would you have to open the letter you want, then search, and the first result will be from the index?

Would there be any way to index them all in one file? Would the reader be able to handle this? Would it be able to handle the size of individual letters that are quite large?

I've been busy, so haven't had time to play around with the wikipedia version I have, I will check out the link you provided to see how the indexing works by letter...

The poster above you mentioned what I thought I'd have to do (as what I have is a bunch of randomly named html and jpg files) but it would be a ton of work - but I could also index them all in one, by letter, etc...
okalyddude is offline   Reply With Quote