Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Readers > Sony Reader

Notices

Reply
 
Thread Tools Search this Thread
Old 10-20-2006, 08:52 AM   #1
FangornUK
Addict
FangornUK has a complete set of Star Wars action figures.FangornUK has a complete set of Star Wars action figures.FangornUK has a complete set of Star Wars action figures.FangornUK has a complete set of Star Wars action figures.
 
FangornUK's Avatar
 
Posts: 205
Karma: 317
Join Date: Oct 2006
Location: England
Device: Sony PRS-505, iPad, Kindle 3
Utility for converting Gutenberg books.

Here's a little perl script I knocked together for converting Gutenberg HTML books. It produces a HTML format suitable for the Librie Toolbar (which creates BBEB books).

You additionally need wget and unzip installed to get it to work. It should work fine on MacOSX and Linux. For Windows I use Cygwin (cygwin.com).

To use it simply pass the location of the ZIPed HTML file on the Gutenberg page for the Book you're interested in, for example:
guthtml.pl http://www.gutenberg.org/files/17290/17290-h.zip

Then you just open the "new.htm" file in Internet Explorer and use the Librie Toolbar to create BBeBs I find the Librie Toolbar at the moment produces the best eBooks as it has the cleanest fonts and these are the fastest on the Reader. It doesn't produce perfect conversions but until a proper tool comes out I use this for now.

To convert text based versions of Gutenberg I simply use GutenMark and convert the produced HTML file with Librie Toolbar.

The script can also call htmldoc if you want to create PDFs for the Reader, simply uncomment the last few lines in the script.

If anyone knows how to get page breaks to work in Librie Toolbar please let me know.
Attached Files
File Type: pl guthtml.pl (2.4 KB, 521 views)

Last edited by FangornUK; 10-25-2006 at 08:23 AM.
FangornUK is offline   Reply With Quote
Old 10-20-2006, 04:32 PM   #2
TadW
Uebermensch
TadW ought to be getting tired of karma fortunes by now.TadW ought to be getting tired of karma fortunes by now.TadW ought to be getting tired of karma fortunes by now.TadW ought to be getting tired of karma fortunes by now.TadW ought to be getting tired of karma fortunes by now.TadW ought to be getting tired of karma fortunes by now.TadW ought to be getting tired of karma fortunes by now.TadW ought to be getting tired of karma fortunes by now.TadW ought to be getting tired of karma fortunes by now.TadW ought to be getting tired of karma fortunes by now.TadW ought to be getting tired of karma fortunes by now.
 
TadW's Avatar
 
Posts: 2,583
Karma: 1094606
Join Date: Jul 2003
Location: Italy
Device: Kindle
Good work, Fangorn.
TadW is offline   Reply With Quote
Advert
Old 11-02-2006, 10:16 AM   #3
FangornUK
Addict
FangornUK has a complete set of Star Wars action figures.FangornUK has a complete set of Star Wars action figures.FangornUK has a complete set of Star Wars action figures.FangornUK has a complete set of Star Wars action figures.
 
FangornUK's Avatar
 
Posts: 205
Karma: 317
Join Date: Oct 2006
Location: England
Device: Sony PRS-505, iPad, Kindle 3
There has been some dicsussion on Gutenberg books not having any markup but for quite a while most new books have HTML versions with the markup from the original books. Many older text versions are being updated with HTML versions. For those old text ones that have no HTML, I really recommend GutenMark as it does a good job of putting back the markup and putting back in none ascii characters such as umlauts.

I've attached an ebook for the Sony Reader converted from a Gutenberg HTML using the perl script above and then put through Librie Toolbar to create a BBeB file (LRF). Nothing was edited in the files to produce this ebook. I haven't managed to figure out how to create Page Breaks with the toolbar though.
Attached Files
File Type: zip York Minster.zip (2.04 MB, 513 views)
FangornUK is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Utility for Project Gutenberg Text Files rocketgranny Deals and Resources (No Self-Promotion or Affiliate Links) 7 03-20-2010 02:44 AM
Importing Books from Project Gutenberg dhume01 Calibre 9 02-04-2010 12:04 PM
Project Gutenberg books ALL available in LRF coolbooks LRF 28 12-23-2009 06:40 PM
EPUB books now available at Project Gutenberg Kris777 News 13 03-28-2009 12:49 AM
SciFi e-books at Project Gutenberg Bob Russell Deals and Resources (No Self-Promotion or Affiliate Links) 2 08-24-2006 09:42 PM


All times are GMT -4. The time now is 10:30 PM.


MobileRead.com is a privately owned, operated and funded community.