One more thing ...
Calibre has a very powerful tool for identifying duplicates. It lets you search for duplicate books and has many interesting options, including fuzzy searches, soundex searches ... . In some cases the duplicates it finds (or mis-identifies ;-) ) look like found by a black magic. Plus you can mark similar, but not duplicate books as "not duplicate" so when you search for duplicates again they do not come up. As I said, very powerful tool. You will be surprised how many duplicates you have in your database.
|