Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > Workshop

Notices

Reply
 
Thread Tools Search this Thread
Old 03-25-2021, 05:21 PM   #1
jadhvaryu
Junior Member
jadhvaryu began at the beginning.
 
Posts: 8
Karma: 10
Join Date: Mar 2021
Device: iPad
Question Metadata

Hi!

I recently started maintaining project gutenberg ebooks (html formats). I realize that metadata of the books is extremely limited.

Rarely i find proper authors, genres, or ISBN#

I not using Calibre.

I am hoping to get guidance on best way to complete the meta data. I am also open to consider epub format if that has better metadata quality and completeness.

(For full disclosure: the end objective of the project i am part of is 'for profit' from the reading device.)
jadhvaryu is offline   Reply With Quote
Old 03-26-2021, 12:37 PM   #2
Quoth
the rook, bossing Never.
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 11,161
Karma: 85874891
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
ISBN applies to particular editions / formats on paper of the source of the content, so more than one per Gutenberg text. An ebook can have an ISBN. But it's not relevant to ebooks from Gutenberg or Amazon.
The ISBN is used by retailers (or the public) to order a specific version of book. Library programs can store an ISBN, but should be using a unique per copy ID (barcode, QRcode or RFID etc) that isn't EAN/ISBN type of barcode on the book.

Genres are subjective and not all published works have an officially assigned genre.

Either use Gutenberg or a professional Library program.

Also titles are not at all unique. There is no copyright on book titles. I've two unrelated books called "Dancer's Luck".

Last edited by Quoth; 03-26-2021 at 12:40 PM.
Quoth is offline   Reply With Quote
Advert
Old 03-29-2021, 10:33 AM   #3
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 11,462
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
Quote:
Originally Posted by jadhvaryu View Post
Hi!

I recently started maintaining project gutenberg ebooks (html formats). I realize that metadata of the books is extremely limited.

Rarely i find proper authors, genres, or ISBN#

I not using Calibre.

I am hoping to get guidance on best way to complete the meta data. I am also open to consider epub format if that has better metadata quality and completeness.

(For full disclosure: the end objective of the project i am part of is 'for profit' from the reading device.)
Well, I said this on your other thread, but...fwiw: The sort of people that go to Gutenberg for reading material pretty much already have their own readers. OR, you can just click "HTML" and read in the browser, if you wish. OR, download whatever format and put it on your own device or phone.

Not to mention, some huge percentage of them are already available on Amazon, as (somewhat) formatted eBooks--click and they magically appear on your device. It's not just Pride & Prejudice. We're talking about the same business that has 90% of the English-speaking marketplace.

I'm delighted that anyone thinks that all the FREE books in Gutenberg are somehow going to have enough of a marketplace, a buying demographic out there, to invest money in building a dedicated app, but...I mean, we're living in a world in which few people under the age of 50 watch movies filmed in B&W or before the 80's. So...???

{shrug}. It's your time and your nickel, but damned if I see how you'll monetize this in any sensible way.

Hitch
Hitch is offline   Reply With Quote
Old 03-29-2021, 04:05 PM   #4
phillipgessert
Addict
phillipgessert ought to be getting tired of karma fortunes by now.phillipgessert ought to be getting tired of karma fortunes by now.phillipgessert ought to be getting tired of karma fortunes by now.phillipgessert ought to be getting tired of karma fortunes by now.phillipgessert ought to be getting tired of karma fortunes by now.phillipgessert ought to be getting tired of karma fortunes by now.phillipgessert ought to be getting tired of karma fortunes by now.phillipgessert ought to be getting tired of karma fortunes by now.phillipgessert ought to be getting tired of karma fortunes by now.phillipgessert ought to be getting tired of karma fortunes by now.phillipgessert ought to be getting tired of karma fortunes by now.
 
phillipgessert's Avatar
 
Posts: 311
Karma: 3196258
Join Date: Oct 2015
Location: Madison, WI
Device: Kindle 5th Gen
Love PG and hate to use this expression for it, but it's garbage in, garbage out. If the data's missing, there's nothing for it. Hypothetically you could cobble something together that scrapes some other free resource for the missing data, but it really just moves the problem--someone somewhere had to list it. And it's another potential point of failure.

You might be better off scaling down to their top 100 or something, and having a human add the missing info. There are some real obscurities on the site and I would be surprised to learn there's a hidden trove of rich metadata for everything, or even most things, on PG.
phillipgessert is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Calibre's Basic metadata with custom metadata fields MichaelSarri Calibre 2 04-15-2020 03:59 AM
Add Read/Unread metadata from custom column to metadata jacket allanahk Library Management 4 11-12-2018 03:10 PM
Calibre fills automatically title and author metadata in books without any metadata? thosedays Library Management 5 10-28-2016 03:33 PM
Regarding using metadata objects in identify method of metadata download plugin api aprekates Development 1 07-06-2014 03:35 AM
Save the Metadata to the Libary files/change filename to the Metadata RyuujiTakasu Calibre 10 03-19-2014 02:01 PM


All times are GMT -4. The time now is 12:33 AM.


MobileRead.com is a privately owned, operated and funded community.