Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 03-12-2017, 06:56 AM   #1
sarasas
Junior Member
sarasas began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Mar 2017
Device: Kindle
Metadata download matches too generously

I just upgraded from Calibre 2.75-ish to 2.81 and ran a batch metadata download on items in my database with no previous match and it went horribly wrong.

A lot of entries got matched of with completely unrelated books based on the title name being mentioned in a comment to another book somewhere. Those entries got updated with new titles, authors, tags and comments and were basically replaced with a different book.

I went through them manually to try to restore them and noticed that for some of the book, the right data was available, just not as the first match in the list.

I think matching titles against comments and descriptions of books is a mistake but I cannot find a way to turn it off. I hope this is just a bug that can be easily fixed.
sarasas is offline   Reply With Quote
Old 03-12-2017, 08:47 AM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,858
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
There has been no change to how metadata is matched, the only change is that now instead of searching amazon directly, amazon is searched via a search engine instead. Presumably for you problem books, the search engine's top link is not to the book page but to another book.

You can get back the old behavior by customizing the amazon metadata download plugin to download from amazon's servers, but be warned that doing so might result in failures because amazon is trying to block calibre.
kovidgoyal is offline   Reply With Quote
Advert
Old 03-12-2017, 08:49 AM   #3
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,858
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
And post the title and author for some of these problem books, and I wil see if I can tighten the results a bit.
kovidgoyal is offline   Reply With Quote
Old 03-12-2017, 06:02 PM   #4
sarasas
Junior Member
sarasas began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Mar 2017
Device: Kindle
An example is my entry named with
title: "Bitter Almonds"
author: "Dorothy L. Sayers"

Download metadata searches "Goodreads, Barnes & Noble, Google, Amazon.com".
It gets 2 hits, both from Amazon.com. One is "Bitter Almonds: Recollections and Recipes From a Sicilian Girlhood" by "Maria Grammatico" and the other is "Bitter Almonds : The True Story of Mothers, Daughters, and the Seattle Cyanide Murders" by Gregg Olsen. Neither is right, but is at least in the same general area.

A more extreme example is an entry
title: "Nebuchadnezzar"
author: "Dorothy L. Sayers"

Here the download matches it to "Psalm 119: The Diary of a Captive" by "Gene Cunningham". The description of that book contains the text "This study of Psalm 119 is based on the assumption that the author was one of the captives taken in Nebuchadnezzar's third and last deportation of the Jews to Babylon" which includes the string Nebuchadnezzar.
sarasas is offline   Reply With Quote
Old 03-13-2017, 03:51 AM   #5
Eve1972
Junior Member
Eve1972 began at the beginning.
 
Posts: 4
Karma: 10
Join Date: May 2016
Device: Samsung Tab S2
I have been having the same issue actually. I always batch download metadata for the books I import and before I updated it would accurately fetch the correct book info 98% of the time. Since the update, that accuracy has dropped to 50% if that.

I am going to try the server trick and see if that helps!
Eve1972 is offline   Reply With Quote
Advert
Old 03-14-2017, 04:05 AM   #6
sarasas
Junior Member
sarasas began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Mar 2017
Device: Kindle
kovidgoyal, maybe Calibre could do some internal matching of title and author after fetching the results from the search, and if they don't match, discard them? Because it does seem the search interface is a bit too "helpful".
sarasas is offline   Reply With Quote
Old 03-14-2017, 06:51 AM   #7
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,858
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Yes, that's what I meant when I said


"And post the title and author for some of these problem books, and I wil see if I can tighten the results a bit."
kovidgoyal is offline   Reply With Quote
Old 03-15-2017, 12:14 AM   #8
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,858
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
This should take care of it: https://github.com/kovidgoyal/calibr...b56f98687efc7b
kovidgoyal is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Regarding using metadata objects in identify method of metadata download plugin api aprekates Development 1 07-06-2014 03:35 AM
Failed to download metadata (both metadata & cover) EddieSean Calibre 0 01-31-2013 09:49 PM
Why tags download only after second click on "Download metadata"? fufu42 Library Management 2 12-08-2012 12:08 PM
[Metadata Download Plugin] Goodreads Metadata **Deprecated** kiwidude Plugins 30 04-23-2011 02:10 PM
Does "Download Metadata & Covers" also download social metadata? iridius Library Management 3 02-22-2011 12:50 PM


All times are GMT -4. The time now is 06:17 AM.


MobileRead.com is a privately owned, operated and funded community.