Quote:
Originally Posted by kiwidude
One point I did not mention which you brought up again with the find_identical_books comment. One of the common things I find is that the built-in logic that Calibre has at time of import does you no good if the filename was not "close enough" at the time of import. For instance a common thing I miss is to not spot a missing space between the series hyphen and the title. So my title gets imported with a name like "Series X-Title name" which the Calibre logic cannot pick up. Now that is easy to spot in Calibre when you review your newly added books, you fix the title/series up correctly and think job done. However of course that can now result in a duplicate.
|
Yes. This specific issue is an interaction between the regex used to identify the title and series, and the autosort/automerge code that compares the title passed to it by the regex with the title of existing book records. The missing space caused the regex to think the title was "Series X-Title name" and that didn't match the book title of "Title name."
Quote:
My point being that regardless of how much cleverness goes into the "merge" logic, there will always be situations where as the result of an edit you now have a duplicate that only some sort of post-check can pickup, replicating similar and other more extensive checks.
|
Agreed. I've started to put together a dupe checker a few times, but my motivation is low now that my library is in good shape.