Just curious. How do you tell the file is physically the same.? File name and size or scanning the whole book? If file name and size that is not foolproof but pretty easy for a person to do manually.
Calibre seems to have a much more sophisticated approach which does not always find all books the same but a remarkable number.
AFAIK it has not got all duplicates (impossible I think) but I imported 47,000 files and ended up with 21,000 ebooks.
I had sorted out the obvious duplicates based on file name/size (took about an hour). Needless to say I was impressed at how much work Calibre did for me. And unlike you imply it might have not detected some, but does not seem to have mismatched any. And Calibre does not ever destroy your original copy. Nothing dangerous there that I can see.
I don't imagine it will ever perform magic tasks such as figuring out each individual users file naming conventions and directory structures, but if you spend a little time using it it will make it easier for you to do this yourself.
For instance you could add the files from one directory or group of directories and use bulk edit to put in the appropriate tags.
BTW Calibre crashed for my first import try but did import my 47,000 files without crashing when I used my spare laptop solely for that purpose. Took about day to do it but it did it.
Helen
|