Quote:
Originally Posted by Giuseppe Chillem
If you have a directory full of e-books, each ebook has size and CRC32 which are always the same. If you store the CRC32 into the DB of Calibre and you sercch for it (and file size) during a new import you are able to match with 100% accurancy if the file is duplicate and already been imported.
|
This is true, but it doesn't help you very much to know that it's 100% the same book. Maybe the user wants to add it anyway. I had lots of books that were CRC matched, but had different filenames. Each multiple author book was stored under both author names with the author name as part of the title. I wanted them added until I could edit the metadata and list the multiple authors on one copy, then delete the other.
It seems more useful to me to identify duplicates based on title and/or author, then ask. Most of my duplicates weren't 100% CRC duplicates anyway.