View Single Post
Old 09-10-2012, 02:02 PM   #15
thehawkman
Junior Member
thehawkman can extract oil from cheesethehawkman can extract oil from cheesethehawkman can extract oil from cheesethehawkman can extract oil from cheesethehawkman can extract oil from cheesethehawkman can extract oil from cheesethehawkman can extract oil from cheesethehawkman can extract oil from cheesethehawkman can extract oil from cheese
 
Posts: 9
Karma: 1234
Join Date: Jul 2012
Device: none
Quote:
I personally wouldn't use checksums to compare ebooks anyway. I'm less interested in whether a book is byte-for-byte identical to another, than if they are just similar. An epub of Gulliver's Travels downloaded from Project Gutenberg is likely going to have a different checksum than an epub of Gulliver's Travels downloaded from somewhere else, and yet by pretty much any definition they're the same book.
Not necessarily.


See this? Same file, different locations. And that's the lucky case where it has the same name, in both locations. Since the checksums were identical I was able to find them, but if I had processed them using calibre, I would be unable to find them. That's two files, 17 MB wasted. But what if there are hundreds. To speak nothing of the mess of having the same file under multiple names. So. The way I am doing it is, drag the file to Calibre, click on it -> edit metadata -> download metadata individually -> Click the "download metadata button" -> Accept the proposed covers -> OK -> then "save to disk". Either writing the metadata to the file, or the saving to disk part changes the file.
See here:


The first file is the one I renamed using Calibre. The second is the original.
The size is different: 6.50 MB for the renamed file, 6.52 for the original. For all intents and purposes this makes them two different files, which I can't spot in any way (checksum is different, file name is different). Unless Calibre can rename my files without altering them, I will be forced to rename everything manually. And if it can please tell me how.
thehawkman is offline   Reply With Quote