That is what I am doing currently (well except with sha256). I just tried it with sha512 and that made no difference on my smaller test library anyways in terms of the false positives.
I ran a third party product (Duplicate Cleaner) to see what it's performance was like. Maybe 4.5 minutes isn't so bad after all.
Perhaps I now just need to add a third pass to compare file streams - is there an efficient way of doing that?
|