03-12-2017, 06:56 AM | #1 |
Junior Member
Posts: 4
Karma: 10
Join Date: Mar 2017
Device: Kindle
|
Metadata download matches too generously
I just upgraded from Calibre 2.75-ish to 2.81 and ran a batch metadata download on items in my database with no previous match and it went horribly wrong.
A lot of entries got matched of with completely unrelated books based on the title name being mentioned in a comment to another book somewhere. Those entries got updated with new titles, authors, tags and comments and were basically replaced with a different book. I went through them manually to try to restore them and noticed that for some of the book, the right data was available, just not as the first match in the list. I think matching titles against comments and descriptions of books is a mistake but I cannot find a way to turn it off. I hope this is just a bug that can be easily fixed. |
03-12-2017, 08:47 AM | #2 |
creator of calibre
Posts: 43,858
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
There has been no change to how metadata is matched, the only change is that now instead of searching amazon directly, amazon is searched via a search engine instead. Presumably for you problem books, the search engine's top link is not to the book page but to another book.
You can get back the old behavior by customizing the amazon metadata download plugin to download from amazon's servers, but be warned that doing so might result in failures because amazon is trying to block calibre. |
Advert | |
|
03-12-2017, 08:49 AM | #3 |
creator of calibre
Posts: 43,858
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
And post the title and author for some of these problem books, and I wil see if I can tighten the results a bit.
|
03-12-2017, 06:02 PM | #4 |
Junior Member
Posts: 4
Karma: 10
Join Date: Mar 2017
Device: Kindle
|
An example is my entry named with
title: "Bitter Almonds" author: "Dorothy L. Sayers" Download metadata searches "Goodreads, Barnes & Noble, Google, Amazon.com". It gets 2 hits, both from Amazon.com. One is "Bitter Almonds: Recollections and Recipes From a Sicilian Girlhood" by "Maria Grammatico" and the other is "Bitter Almonds : The True Story of Mothers, Daughters, and the Seattle Cyanide Murders" by Gregg Olsen. Neither is right, but is at least in the same general area. A more extreme example is an entry title: "Nebuchadnezzar" author: "Dorothy L. Sayers" Here the download matches it to "Psalm 119: The Diary of a Captive" by "Gene Cunningham". The description of that book contains the text "This study of Psalm 119 is based on the assumption that the author was one of the captives taken in Nebuchadnezzar's third and last deportation of the Jews to Babylon" which includes the string Nebuchadnezzar. |
03-13-2017, 03:51 AM | #5 |
Junior Member
Posts: 4
Karma: 10
Join Date: May 2016
Device: Samsung Tab S2
|
I have been having the same issue actually. I always batch download metadata for the books I import and before I updated it would accurately fetch the correct book info 98% of the time. Since the update, that accuracy has dropped to 50% if that.
I am going to try the server trick and see if that helps! |
Advert | |
|
03-14-2017, 04:05 AM | #6 |
Junior Member
Posts: 4
Karma: 10
Join Date: Mar 2017
Device: Kindle
|
kovidgoyal, maybe Calibre could do some internal matching of title and author after fetching the results from the search, and if they don't match, discard them? Because it does seem the search interface is a bit too "helpful".
|
03-14-2017, 06:51 AM | #7 |
creator of calibre
Posts: 43,858
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Yes, that's what I meant when I said
"And post the title and author for some of these problem books, and I wil see if I can tighten the results a bit." |
03-15-2017, 12:14 AM | #8 |
creator of calibre
Posts: 43,858
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
This should take care of it: https://github.com/kovidgoyal/calibr...b56f98687efc7b
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Regarding using metadata objects in identify method of metadata download plugin api | aprekates | Development | 1 | 07-06-2014 03:35 AM |
Failed to download metadata (both metadata & cover) | EddieSean | Calibre | 0 | 01-31-2013 09:49 PM |
Why tags download only after second click on "Download metadata"? | fufu42 | Library Management | 2 | 12-08-2012 12:08 PM |
[Metadata Download Plugin] Goodreads Metadata **Deprecated** | kiwidude | Plugins | 30 | 04-23-2011 02:10 PM |
Does "Download Metadata & Covers" also download social metadata? | iridius | Library Management | 3 | 02-22-2011 12:50 PM |