Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Library Management

Notices

Reply
 
Thread Tools Search this Thread
Old 07-18-2019, 08:59 PM   #1
stan999
Member
stan999 began at the beginning.
 
Posts: 21
Karma: 24
Join Date: Jul 2019
Device: none
Angry Metadata search frustration & other

Coming from a Mac, although the interface is the interface is not the prettiest (hate anything windows aesthetics really), I made an exception in this case because calibre is a powerful, nest piece of software. With that said, the metadata search utility is the worst of any metadata search tool I’ve come across - and I use several other tools making sure my information is accurate. It almost never finds the correct information when inserting an isbn. I know all the rules of metadata search and I’m not doing anything wrong in that respect. It consistently pulls out metadata for books in wrong editions, publication release dates, etc. it is becoming a nightmare working with caliber's metadata search and leaves me manually curate metadata fields. I’ve tried every combination possible e.g. amazon (all servers), amazon international plugin (all server combination), google books, and all other book stores. I’ve ticked and untucked all possible combinations of dates, publishers .... it is not pulling the right information form these databases. Delicious, LibraryThing, bookpedia, bookbuddy, bookcrawler, Pocketpedia, CLZ books (some I use for accuracy reference) are all extracting the right information. Clearly this suggests there is an inherent flaw with caliber’s metadata search engine. Not to say it cannot pull one tag from amazon anymore. This is a nice tool but please do something about your metadata search feature, it’s horendous.
Regards,

Stan
stan999 is offline   Reply With Quote
Old 07-19-2019, 06:55 AM   #2
Divingduck
Wizard
Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.
 
Posts: 1,166
Karma: 1410083
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
[Smile on] Sorry that you can't find a Apple solution... [Smile off]

To be honest, why do you think, this is an calibre problem? You are on the wrong side of the chain. You need to go to the source and this is obviously not calibre. calibre is only pulling what it becomes from metadata source providers and nothing else. You can take a look into the protocol and see what exactly providers are sending back. It often is crap and quite a wonder if you get the correct information. Use their API's and test it yourself if you don't believe it. An other point is that users tend to provide too much information for a query. E.g. if you have a ISBN available, provide only this for a query and nothing else. A ISBN is mostly unique for a publication and every data more will give you more not accurate results.

I did and do this quite often and all I can say is that you will be faster if you do it manual and check the data afterwards with other sources you may trust (in case you want accurate metadata).

Your problem in the end is, that there is no source available that have for all books of the world all kind of metadata available. Also publishers and authors (equal if they are self publishers or not) do not provide accurate metadata at all. It is a pain in the neck that they are not able to produce a correct set of DC metadata these days. It seems to be quite new for them. The description is public available since end of the nineties last century...

I'm doing this kind of research since 10 or 15 years and can trust you this is a never ending mess and quite impossible to find a source you can trust to 100%. It begins with publishers and authors who don't want to do this job accurate. Best resources I use are mostly national library systems around the world. But the quality there also is depending on how old the metadata are as metadata systems and definitions change and grow during the past decades. This is why The Dublin Core Metadata Initiative (DCMI) came in place long time ago (driven by libraries and their systems).

And because you mention Amazon - this is for my metadata searches one of the worst source of metadata. They not only change quite often their API (I feel like they don't want that anyone else use their API's), the "quality" ... is most time I take a look to it also a mess.

By the way, there is a nice plugin from DaltonST that is quite good when you need more than Amazon quality. It is called Library codes. It queries DC metadata from Library of Congress, VIAF, ISNI and Worldcat and save you a lot of manual work if data are available. An other available plugin is Citronalco's DNB_de (metadata from Deutsche Nationalbibliothek API, you need to ask for access and register at DNB before you can use it).

As a last point, feel free to make your own plugin for collecting correct metadata. calibre have a wonderful and well documented interface - you only need to use it. You can also try to improve / modify existing plugins. Guess, Kovid will be quite happy to implement your modifications once they are available.
Divingduck is offline   Reply With Quote
Advert
Old 07-19-2019, 07:23 AM   #3
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 31,041
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
I will second the other duck

Inconsistent is the best I can say. And that is for a SINGLE source. Use multiple sites

Who is to blame? Everyone from the book Author, Publisher to the Fan who contributed to review sites . Even YOU. We all want OUR metadata in just a certain form. Never mind, that others use a different one. We use OUR version

Visit the index of plugins. I use a number, but Import List is one of my Fav's for pre-populating (wish list) or updating the Library.

Don't even start on Series naming.
Tags? Some even think their name should also be a Tag. Toss in the Kitchen sink seems to be the rule some Indies use.
theducks is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Regex in search problems (NOT Search&Replace; the search bar) lairdb Calibre 3 03-15-2017 07:10 PM
Arc & Incompatible Apps = Frustration chattylibrarian Kobo Tablets 26 03-23-2014 04:11 PM
Edit Metadata Search & Replace tarisea Library Management 8 12-26-2012 02:46 PM
Metadata Search & Replace - when it doesn't match Aldebaranian Library Management 4 09-28-2011 11:35 AM
Setting series index in bulk metadata search&replace bubak Calibre 4 12-19-2010 04:04 PM


All times are GMT -4. The time now is 12:46 PM.


MobileRead.com is a privately owned, operated and funded community.