12-31-2012, 01:51 PM | #1 |
Junior Member
Posts: 6
Karma: 10
Join Date: Jun 2005
|
Epub comparision tools?
Ok, while the binary comparison in Calibre will eliminate true duplicate files, the use of different versions of Calibre (or other tools), and UUIDs mean that even identical version of an ebook prepared by 2 different people are not identical.
https://github.com/takahashim/epubdiff is a great tool, that essentially unzips 2 epubs and diff compares them. I've hacked on it (adding some command line options to diff) to ignore .opf (where the main trivial differences are), .otf (font differences), and .ncx (minor diffs), so that books which are otherwise similar will not show those differences, and thus be considered the same. It would be _really_ nice, if a similar means was in Calibre, to compare if a book's _actual_ content (and not metadata, etc) was identical. A batch method, to run through an entire library (or 2) would be amazing. How are other people dealing with this issue? |
12-31-2012, 01:58 PM | #2 |
Junior Member
Posts: 6
Karma: 10
Join Date: Jun 2005
|
http://zarb.org/~gc/html/diffzips.html
Will essentially do the same thing (using Perl, and tweaks to handle epub) |
Advert | |
|
12-31-2012, 10:55 PM | #3 |
creator of calibre
Posts: 43,835
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
It should be fairly simple to add an ignore uuids/other metadata option to the binary comparison routine in the find duplicates plugin, but you should post in that plugins' thread.
|
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Kindle DX and Boox M92 comparision | Marrko | Onyx Boox | 12 | 02-26-2012 11:35 AM |
Tools for converting MOBI->EPUB | Syncopated | Workshop | 1 | 09-25-2011 04:42 PM |
Announcing: epub-tools software | dino8352 | General Discussions | 0 | 04-23-2011 01:06 PM |
tools for epub creation | Toxaris | ePub | 15 | 03-05-2010 04:54 AM |
epub creation tools | jbenny | ePub | 20 | 03-13-2009 12:30 PM |