|12-31-2012, 02:51 PM||#1|
Join Date: Jun 2005
Epub comparision tools?
Ok, while the binary comparison in Calibre will eliminate true duplicate files, the use of different versions of Calibre (or other tools), and UUIDs mean that even identical version of an ebook prepared by 2 different people are not identical.
is a great tool, that essentially unzips 2 epubs and diff compares them.
I've hacked on it (adding some command line options to diff) to ignore .opf (where the main trivial differences are), .otf (font differences), and .ncx (minor diffs), so that books which are otherwise similar will not show those differences, and thus be considered the same.
It would be _really_ nice, if a similar means was in Calibre, to compare if a book's _actual_ content (and not metadata, etc) was identical. A batch method, to run through an entire library (or 2) would be amazing.
How are other people dealing with this issue?
|12-31-2012, 11:55 PM||#3|
creator of calibre
Join Date: Oct 2006
Location: Mumbai, India
It should be fairly simple to add an ignore uuids/other metadata option to the binary comparison routine in the find duplicates plugin, but you should post in that plugins' thread.
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|Kindle DX and Boox M92 comparision||Marrko||Onyx Boox||12||02-26-2012 12:35 PM|
|Tools for converting MOBI->EPUB||Syncopated||Workshop||1||09-25-2011 05:42 PM|
|Announcing: epub-tools software||dino8352||General Discussions||0||04-23-2011 02:06 PM|
|tools for epub creation||Toxaris||ePub||15||03-05-2010 05:54 AM|
|epub creation tools||jbenny||ePub||20||03-13-2009 01:30 PM|