Wikipedia on Sony Reader
Hello all, this is something I've been looking around for since i got my sony reader, but noone ha been able to do it.
Last night, i remembered some projects where people were able to put wikipedia on theys ipods, so i started from there. For those of you that didnt knew, wikipedia has a dowloadable version, on all it's languages and this can be in XML or HTML format (i think there are some more).
The thing is, you can convert HTML files using calibre, from html to ePub. Why ePub? Because the conversion is done directly and the outputted file is compressed, saving some more space on the device.
Now, here's the bad part. Only in spanish (my language) the file is 1.3 GB compressed. The result is a little more than 1,300,000 files in different folders and also including the user comments or discuddion for each article, for a nice total of 19 GB of data.
Lets say i remove the user comments/discussions, maybe i'll get the file to a mere 2 GB uncompressed. After running the file through calibre and compressing/converting them to ePub, lets say it stays on 1.3 GB again (this are just guesses i'm trying). (remember, this is for the spanish wikipedia).
How on earth am i going to put 1.3GB of data on my reader? I can get a cheap 2 or 4 GB sandisk memory stick duo (or SD card) and store the files there, the thing is that it will be more than 600,000 files to store and that the reader will have to read an manage. Is the reader even able to handle such an ammount of files??? will the baterry drain just from trying to list the files on the book collection?
The point of leaving each article as a separate ePub file, is that the reader manages the collection easily by letther and all, so it wont really be necessary to have a "search" function, the reader would arrange them by title making it kinda easy to search.
That is the reason why you couldnt also put a lot of file into one singe ePub file to decrease the file count on the memory stick/SD.
-Does anybody have an idea on how this could be handled?
-any idea on how to rapidly convert all the files to ePub woithout having to import them to calibre? (kind of like a batch process).
-does anybody know if there is a lighter wikipedia downloadable version?
-i know there is another downloadable wikipedia version for Tomeraider, but that isnt compatible with sony reader, maybe we can convert it.
Ill keep searching and posting my findings, but it would be great to see if somebody has ny feedback to give.
thank you all.
|