Quote:
Originally Posted by jadhvaryu
Hi Quoth,
I am part of team working on a purpose-built reader around Gutenberg free books. And need enriched metadata to help in books searching and selection. Essentially: we need title, abstract/summary, author(s), publisher(s), genre(s), keyword(s)/tag(s) and ISBN#. For now, we only care for English books.
Also, the book format we prefer is HTML.
And like i mentioned earlier, I randomly checked more than 100 books and found the completeness of meta data is consistently poor.
And hence need a way to enrich it.
I played with Calibre a bit. But seems it allows:
- search results to be only 25 books
- only interactive download of one format at a time
- and metadata gathered still seems limited. (I tried the popular book "Complete Works" by William Shakespeare but still metadata was not enough.
Also, i have already downloaded big set of gutenberg ebooks (HTML version zip file). Can i 'import' these books into calibre?
Lastly, the purpose built reader will be priced for profit. We will not charge for the Gutenberg books, just the reader. If we end up using Calibre to maintain our books and update metadata, who can we talk to to understand usage / licensing terms.
Many thanks!
|
Kind of reinventing the wheel, aren't you? I mean...why? Why not just use Calibre for Gutenberg books, or any other eReader? (Does FF ePUBReader still exist? It was fantastic for my use, early on in the aughts.)
It feels...kludgy, like you're trying to find a solution for a "problem" that doesn't exist really.
That's my only comment on that front. On the second front, for the love of heaven, unless YOU are going to then convert the files from HTML into some other, new and improved (?) eBook format, stick with ePUB or MOBI or both. ePUB is vastly superior to HTML only and yes, I really would know.
Everybody here knows. As someone said, that's WHY ePUB and MOBI exist, and the world isn't reading HTML-only eBooks.
Hitch