View Full Version : Government EPUB - CIA World Factbook
Bossman 07-25-2008, 12:12 PM Hi Guys
I wanted to try out the epub format with the new prs505 firmware.
To see how well it handles big files i converted the CIA World Factbook to epub format with a bit of quick and dirty text editing.
I used this guide to create the epub file
http://www.teleread.org/blog/2007/10/31/experimenting-with-epub-creation/
and used this to convert the Website to xhtml
http://www.it.uc3m.es/jaf/html2xhtml/
It might not be perfect but good enough to use it.
Its much faster than the pdf version that i used before.
p.s. are we gonna have a seperate epub section?
Updated 02.08.2008
Cleanup and Multi level Toc
Updated 18.08.2008
added Flags
made xhtml 1.1 compliant
Cleanup
igorsk 07-25-2008, 12:18 PM Why the thumbs down icon? This is awesome!
Bossman 07-25-2008, 12:23 PM didnt realise i clicked it
Donnageddon 07-26-2008, 02:54 AM Not sure why, but the pages end off screen to the right for me. Even at the smallest font. Even in portrait orientation.
Looks like a great portable resource, but I can't get it to function correctly.
Just some friendly feedback.
Bossman 07-26-2008, 03:33 AM Yes that seems to only happen in the geography section.
Firefox always renders that section correctly only in Adobe DE and on the reader is that section sometimes shifted to the right.
On thing i also found is that if one table field entry is to big to be rendered on one page, if you scroll Adobe DE and the reader will not show the rest of the field.
TedPark 07-26-2008, 11:22 AM Yes, please - a new EPUB section will be needed - - -
And, while you are at it, I would like to see a RTF section as well, since it is really a "universal" format - rather than a misc. (read: "oddball") format.
TedPark 07-26-2008, 11:23 AM I read somewhere that the new firmware allows you to scroll. Is this something new and different - or just the limited capability of the previous firmware?
Donnageddon 07-28-2008, 11:47 AM I read somewhere that the new firmware allows you to scroll. Is this something new and different - or just the limited capability of the previous firmware? On the Sony 505? No, no scrolling.
MishaS 07-29-2008, 12:03 PM I wanted to try out the epub format with the new prs505 firmware.
To see how well it handles big files i converted the CIA World Factbook to epub format with a bit of quick and dirty text editing.
I used this guide to create the epub file
http://www.teleread.org/blog/2007/10/31/experimenting-with-epub-creation/
and used this to convert the Website to xhtml
http://www.it.uc3m.es/jaf/html2xhtml/
It might not be perfect but good enough to use it.
Its much faster than the pdf version that i used before.
Hopefully, this output of epubcheck (http://code.google.com/p/epubcheck/) is useful.
Bossman 08-02-2008, 09:56 AM Thx MishaS
i made some cleanup.
The "same target" error from the TOC is supposed to be that way.
Isnt it allowed to have multible Entries linking to the same Point?
However i cant work out the problem with the
"I/O error reading OEBPS/countrylisting.xhtml".
i made the zip according to the spec with
zip -Xr9D cwfb.epub mimetype *
MishaS 08-03-2008, 02:36 AM Thx MishaS
i made some cleanup.
The "same target" error from the TOC is supposed to be that way.
Isnt it allowed to have multible Entries linking to the same Point?
However i cant work out the problem with the
"I/O error reading OEBPS/countrylisting.xhtml".
i made the zip according to the spec with
zip -Xr9D cwfb.epub mimetype *
I'll post a message in epubcheck group (http://groups.google.com/group/epubcheck)
MishaS 08-08-2008, 02:51 PM I'll post a message in epubcheck group (http://groups.google.com/group/epubcheck)
I got some recommendations privately:
You have to re-arrange the playorder attribute in the toc.ncx file, it should be always in a ascending order. And for the I/O reading error, you should removed the doctype declaration in the HTML files. Pls do this and run the epubcheck again.
I'm not sure all of them are clear enough... I'll see what that could mean :)
Meanwhile, you could use online ePub validation service (http://www.threepress.org/document/epub-validate/) offered by ThreePress (http://www.threepress.org/).
Bossman 08-09-2008, 11:41 AM Thanks Mishas
The Problem was indeed the xhtml beein in version 1.0.
I converted the Paged to xhtml 1.1.
Unfortunately in the conversoin process some layout information got lost due to xhtml 1.1 restrictions.
I tried to recreate them in css.
Adobed DE behaviour regarding CSS entries it cant understand is a bit harsh. If such a file is encountered the entire CSS gets ignored. That might be something worth while adding to the epubcheck.
The bug with Text beeing cut in a Table cell, if there is more text that can be displayed on one page seems to be a bug in Adobe DE. Nothing i can do here.
Bossman 08-09-2008, 11:46 AM One more thing i came across.
Adobe DE does not support Text align: Justify. The Epub standard does specify support for it. I really hope that they add this soon because it makes books look so much neater.
MishaS 08-09-2008, 01:20 PM Thanks Mishas
I just realized that I did not mention the person who actually provided the help :smack: It's "Senthil Kumar". Kudos go to him.
|