![]() |
#1 |
Wearer of Pants
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,050
Karma: 7634
Join Date: Jan 2008
Location: Norman, OK
Device: Amazon Kindle DX / iPhone
|
Wikipedia to Ebook
So I've needed to familiarize myself briefly with a few topics lately and wikipedia is perfect for the job of "the basics" (and please, no arguments about it as a source, etc.)
I have a kindle, so of course I can use the whispernet, but it really kind of bites and Sprint is unreliable where i live. So I'd like to setup some "packets" of wikipedia articles that are in decent condition for use as an ebook. Now, I can go to print mode, copy+paste, delete a lot, etc. and then send it to my Kindle but it takes a lot of time. I was wondering if there was anything automated or at least a bit streamlined. It does seem the german version of wikipedia has such a thing. But it's not available for the english version yet. |
![]() |
![]() |
![]() |
#2 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,832
Karma: 11844413
Join Date: Jan 2007
Location: Tampa, FL USA
Device: Kindle Touch
|
Download the Wikipedia CD?
http://en.wikipedia.org/wiki/Wikiped...ia-CD/Download I have no idea what format it is.. I assume HTML? BOb |
![]() |
![]() |
Advert | |
|
![]() |
#3 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Quote:
Dale |
|
![]() |
![]() |
![]() |
#4 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,832
Karma: 11844413
Join Date: Jan 2007
Location: Tampa, FL USA
Device: Kindle Touch
|
|
![]() |
![]() |
![]() |
#5 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
|
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Certainly it has its own format although it can use some HTML statements. Have you looked at the source on our wiki?
It is certainly not XML. It claims to be based on email but I find that to be not very convincing. It has special interpretations of lots of coding but has most things that are in HTML and will even accept lots of HTML constructs to allow importing but the native language is quite different. Dale |
![]() |
![]() |
![]() |
#7 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
I stand corrected - I thought that it was XML-based. Thanks for putting me right!
|
![]() |
![]() |
![]() |
#8 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,832
Karma: 11844413
Join Date: Jan 2007
Location: Tampa, FL USA
Device: Kindle Touch
|
Quote:
Basically, it HAS to be HTML for a browser to render it. Unless it uses flash or java of course. BOb |
|
![]() |
![]() |
![]() |
#9 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Quote:
This forum is rendered in my browser as well but it is not html either so the claim that it has be html is incorrect. The rendering engine converts on the fly. Dale |
|
![]() |
![]() |
![]() |
#10 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Quote:
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en" dir="ltr"> so you are correct in that it must be rendered as html. Dale |
|
![]() |
![]() |
![]() |
#11 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,832
Karma: 11844413
Join Date: Jan 2007
Location: Tampa, FL USA
Device: Kindle Touch
|
Right, so this is what I am saying. I am "assuming" of course that the Wikipedia CD is the rendered output. Because I am sure they aren't shipping a web server engine which is what does all the transforms and such.
I am very much doubting that the Wikipedia CD is all the source files. But, it could of course be. BOb |
![]() |
![]() |
![]() |
#12 |
Zealot
![]() ![]() Posts: 135
Karma: 142
Join Date: Jul 2008
Device: iPod Touch, iPad
|
I don't know how many remember one of the older ebook formats called TomeRaider, not much for general reading really, but it was particularly useful for browsing and searching large reference texts -- I used it on my Psion quite a bit, which should tell you how old it is.
![]() Well, it is still around, and one of the uses these days is condensed versions of large web sites for offline access, such as IMDB. They have a Wikipedia file, but it is just the abstracts with links to the full articles (download link). While they still support the older and more limited handhelds (they still list EPOC as supported!), they have also announced forth-coming versions for Android and iPhone, though I never have much faith in such announcements until they at least produce a beta. ![]() Anyway, that is the only 'ebook' or offline version of Wikipedia that I know of, aside from the CD mentioned. The Bandit |
![]() |
![]() |
![]() |
#13 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 9,707
Karma: 32763414
Join Date: Dec 2008
Location: Krewerd
Device: Pocketbook Inkpad 4 Color; Samsung Galaxy Tab S6
|
Quote:
In Mediawiki, you can use ==head==, or you can use <h2>head</h2>. Both will work. So, is wiki code HTML or a custom format? It is both! Wiki language is just like any other server language (and it's fun to play with!), in that respect. If you would save the pages, you don't save the server code, you save the generated code, aka the HTML pages. |
|
![]() |
![]() |
![]() |
#14 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,832
Karma: 11844413
Join Date: Jan 2007
Location: Tampa, FL USA
Device: Kindle Touch
|
OK, I downloaded the Wiki CD. It's 400MB. It comes with a viewer called "Kwix".
All the articles on the CD/DVD are in .html files. And I can view them with firefox. However, there is no .html default or index page. There is a .index file and a .map file which I assume this Kwix can read. I am going to zip up the html folder then add it as a book in calibre and see what it makes of it. BOb |
![]() |
![]() |
![]() |
#15 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,832
Karma: 11844413
Join Date: Jan 2007
Location: Tampa, FL USA
Device: Kindle Touch
|
Well... that didn't go too well. Calibre must not be traversing into folders to create a zip with html files in it, into a book. I converted the .zip in calire to mobi... and I seem to have got the list of contirbutors.
So, this is probably doable. Someone is going to have to write something that can maybe traverse all the source files and put them into a flat file... then maybe claibre can do something with it. I think this could be done and put into a .mobi book. I just don't have the energy to do it right now. BOb |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Wikipedia | omro | Astak EZReader | 0 | 12-09-2009 10:42 AM |
2008 Wikipedia Ebook Available For Free Download | MatYadabyte | News | 7 | 10-21-2009 04:26 AM |
Wikipedia Ebook Project | Nate the great | Workshop | 14 | 04-17-2009 10:46 AM |
Reference Wikipedia: SOS Children 2006 Wikipedia CD | hn_88 | BBeB/LRF Books | 0 | 01-29-2008 12:23 PM |
Wikipedia | checkmate | Reading and Management | 7 | 10-19-2003 10:06 AM |