Quote:
Originally Posted by dwanthny
Excellent point, anyone wishing to use MD5 hashes to find dupes should do so prior to adding the books.
|
Couldn't agree more - there are reasons why we haven't bothered to mention any kind of hash duplicate comparisons in the proposals for duplicate functionality in Calibre and you have covered most of them.
Personally I am very much in the "before you add to Calibre" camp. Why waste your time cleaning up filenames of files (or fixing up metadata inside Calibre)? Just run a hash comparison using any one of a number of free utilities out there on the internet first on your source folder and Calibre, then delete from the source folder. Don't directly delete from Calibre's folders though - or if you do you will need to run one of the repair database options to get Calibre's internal database matching the fact that a book format is no longer present.
Quote:
Just viewing a epub in calibre changes the book file and thus the hash due to adding or changing the bookmark.
|
Ok, so this used to "always" be the case. Then a number of us campaigned to have an option available such that the EPUB would *not* be touched. It screws with incremental backups and obviously hash comparisons for those of use who have no interest in "touching" an EPUB just by opening it in the viewer. I wrote a hacky patch then chaley from memory did the job "properly" in a Calibre release a few months ago.
In the ebook viewer preferences if you disable "Remember the current page when quitting" and don't add bookmarks then your EPUB should remain untouched - or at least that was the hope

.