Quote:
Originally Posted by kovidgoyal
The problem isn't that files with the same hash will be different, the problem is that files with different hashes may be the same.
|
Exactly. During my importing process, I found ebooks I'd obtained from the Gutenberg Project years ago, and later versions of the same book from GP that had been edited to fix scanning errors. Many of those near-duplicates I'd originally obtained in format 1 had been converted to formats 2 and 3 in one of my mass conversion efforts, which produced more near-duplicates.
I'm not saying it's useless information to know which have the same hash, but Calibre can't use that information to automatically do anything for me. It will still have to ask what I want done. Sometimes if the hash matches, I want it added anyway (multiple author situations) and other times, even with hash differences, I don't want it added (it's the same book, but an earlier version without my bookmarks or with scanning errors not yet corrected).