![]() |
#77 | |
Feedbooks.com Co-Founder
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,263
Karma: 145123
Join Date: Nov 2006
Location: Paris, France
Device: Sony PRS-t-1/350/300/500/505/600/700, Nexus S, iPad
|
Quote:
|
|
![]() |
![]() |
![]() |
#78 |
Enthusiast
![]() Posts: 35
Karma: 10
Join Date: Jun 2007
Location: United Kingdom
Device: iPad Mini, Nexus 7, Sony Reader, Kindle, and others.
|
|
![]() |
![]() |
![]() |
#79 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,293
Karma: 27111240
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
I totally agree that from the perspective of doing automatic conversions having a strict source format is a great thing (not strictly necessary, but great). The problem is that the perspective of doing automatic conversion is not the only one, nor even the most important one.
The most important perspective is to grow the adoption of ebooks. And to do that you have to encourage content producers as well as consumers. Content producers in general like flexibilty, they like a format that lets them do what they want to do instead of telling them that they can do only a very limited subset of things. So there's a tradeoff between making life easy for automatic converters and giving content producers what they want |
![]() |
![]() |
![]() |
#80 | |
Enthusiast
![]() Posts: 35
Karma: 10
Join Date: Jun 2007
Location: United Kingdom
Device: iPad Mini, Nexus 7, Sony Reader, Kindle, and others.
|
Quote:
Please note, the content of my post was about Project Gutenberg, not the publishing industry. Last edited by mikecook; 04-01-2009 at 05:34 PM. Reason: spelling error |
|
![]() |
![]() |
![]() |
#81 | |
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 170
Karma: 2000
Join Date: Apr 2008
Location: San José, CA
Device: Amazon Kindle 1, Sony PRS-300, Amazon Kindle 3
|
Quote:
|
|
![]() |
![]() |
![]() |
#82 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,452
Karma: 7185064
Join Date: Oct 2007
Location: Linköpng, Sweden
Device: Kindle Voyage, Nexus 5, Kindle PW
|
Quote:
|
|
![]() |
![]() |
![]() |
#83 | |
Enthusiast
![]() Posts: 35
Karma: 10
Join Date: Jun 2007
Location: United Kingdom
Device: iPad Mini, Nexus 7, Sony Reader, Kindle, and others.
|
Quote:
![]() @cerement Sometime back Marcello was working on a new pg2tei version but I've not heard anything since. Maybe he realised full automation to TEI was impossible...there will always be some manual work. |
|
![]() |
![]() |
![]() |
#84 | |
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 170
Karma: 2000
Join Date: Apr 2008
Location: San José, CA
Device: Amazon Kindle 1, Sony PRS-300, Amazon Kindle 3
|
Quote:
|
|
![]() |
![]() |
![]() |
#85 | |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,293
Karma: 27111240
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Quote:
|
|
![]() |
![]() |
![]() |
#86 | ||
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 170
Karma: 2000
Join Date: Apr 2008
Location: San José, CA
Device: Amazon Kindle 1, Sony PRS-300, Amazon Kindle 3
|
Quote:
Generating plaintext from XML is trivial, generating pretty-printed plaintext from XML is just as easy. Generating whichever XML variant from plaintext is harder and relies on the volunteers, but when you already have volunteers churning out half a dozen formats, some automated, some not, choosing a secondary "master format" (since primary is plaintext) would focus the volunteer work and allow easier automatic generation of multiple output formats (a la Feedbooks). As an example: imagine if Calibre allowed 6 formats, both as input and as generated output. With an internal "master format", you need 12 conversion templates, 6 for input format to master format, and 6 for master format to output format. Without a master format, that would be 30 conversion templates (excluding a self-to-self conversion). With a master format, adding a 7th format would mean only adding 2 new conversion templates. Without would mean adding 12 new conversion templates. Quote:
And as mikecook mentioned, there's certainly been plenty of arguments already, pro and con, for PG to adopt a master format. Currently, it looks like that master format is HTML for generating ePub, Mobi, and Plucker, but PG's HTML file quality varies even more randomly than their plaintext quality. |
||
![]() |
![]() |
![]() |
#87 |
Enthusiast
![]() Posts: 35
Karma: 10
Join Date: Jun 2007
Location: United Kingdom
Device: iPad Mini, Nexus 7, Sony Reader, Kindle, and others.
|
What you must realise Kovid is that over 50% of the current gutenberg.org titles have come from the various Distributed Proofreaders channels, where they use there own software to allow people to proof read - no technical knowledge required.
Although I don't know the exact numbers, I don't believe too many new PG titles have come from outside the DP websites within the last few years. DP have had various discussions on using TEI as a master, but had said that it was not good enough to record the books properly...like .TXT is! Saying that, they have produced quite a few TEI versions over the last couple of years, I'm just not sure whether they are from individuals or their own system. @cerement I think the idea was to use pg2tei to convert the back catalogue. I presume the DTD was for newer books. I don't know if he ever had plans for this to be used as a master, or whether is was just another format to add to the archives, like they now have with EPUB and MOBI. |
![]() |
![]() |
![]() |
#88 | |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,293
Karma: 27111240
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Quote:
|
|
![]() |
![]() |
![]() |
#89 | |
Junior Member
![]() Posts: 1
Karma: 28
Join Date: Apr 2009
Device: none
|
Quote:
-- Marcello |
|
![]() |
![]() |
![]() |
#90 |
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 170
Karma: 2000
Join Date: Apr 2008
Location: San José, CA
Device: Amazon Kindle 1, Sony PRS-300, Amazon Kindle 3
|
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Project Gutenberg Australia | ballast | Deals and Resources (No Self-Promotion or Affiliate Links) | 9 | 07-31-2010 04:18 PM |
Project Gutenberg | levi_john | Workshop | 17 | 07-26-2010 06:02 PM |
How are the mobi and epub files at Project Gutenberg? | ficbot | General Discussions | 2 | 04-16-2010 06:57 PM |
What's wrong with Project Gutenberg? | mtravellerh | News | 13 | 04-22-2009 03:17 AM |