View Single Post
Old 04-01-2009, 07:09 AM   #77
Hadrien
Feedbooks.com Co-Founder
Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.
 
Hadrien's Avatar
 
Posts: 2,263
Karma: 145123
Join Date: Nov 2006
Location: Paris, France
Device: Sony PRS-t-1/350/300/500/505/600/700, Nexus S, iPad
Quote:
Originally Posted by mikecook View Post
Seems like I've missed out on all the fun here! I won't jump in on the argument regarding UID's, although I am tempted ;-)

I just want to discuss the reason for the poor quality Gutenberg EPUB/MOBI auto formatted books.

If we're truthful with ourselves, Marcello's work on these conversions is actually a waste time! That doesn't mean I don't think he nor PG are doing a great job, but I believe they are focusing their efforts on the wrong thing.

The current books are somewhat ugly because the source files they have don't use a standard format -- automation needs a standard source format - once you have that, Marcello's job of creating EPUB, MOBI or whatever other format they desire, will be so much easier.

Now it seems that for several/many years there has been discussions within the PG community for a 'Master Format', but the powers-that-be kept refusing. I guess they are now paying the consequences of that decision.

Once you have a standard source format, such as XML based (a 'very strict' ASCII formatting and layout would be okay (the current PG .TXT files are a real hodge-podge), but still not a good as XML), it is relatively easy to convert to most any other format, automatically and with all the correct markup that your new reading system needs.

You may loose out on some hand-coded 'uniqueness' between books, but all that hard work the proofreaders have done can really start to shine.
Agree and this is exactly what I do on Feedbooks (and what you're working on with your TEI subset).
Hadrien is offline   Reply With Quote