Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Library Management

Notices

Reply
 
Thread Tools Search this Thread
Old 12-31-2012, 01:51 PM   #1
sethcohn
Junior Member
sethcohn began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Jun 2005
Epub comparision tools?

Ok, while the binary comparison in Calibre will eliminate true duplicate files, the use of different versions of Calibre (or other tools), and UUIDs mean that even identical version of an ebook prepared by 2 different people are not identical.

https://github.com/takahashim/epubdiff

is a great tool, that essentially unzips 2 epubs and diff compares them.
I've hacked on it (adding some command line options to diff) to ignore .opf (where the main trivial differences are), .otf (font differences), and .ncx (minor diffs), so that books which are otherwise similar will not show those differences, and thus be considered the same.

It would be _really_ nice, if a similar means was in Calibre, to compare if a book's _actual_ content (and not metadata, etc) was identical. A batch method, to run through an entire library (or 2) would be amazing.

How are other people dealing with this issue?
sethcohn is offline   Reply With Quote
Old 12-31-2012, 01:58 PM   #2
sethcohn
Junior Member
sethcohn began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Jun 2005
http://zarb.org/~gc/html/diffzips.html
Will essentially do the same thing (using Perl, and tweaks to handle epub)
sethcohn is offline   Reply With Quote
Advert
Old 12-31-2012, 10:55 PM   #3
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,835
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
It should be fairly simple to add an ignore uuids/other metadata option to the binary comparison routine in the find duplicates plugin, but you should post in that plugins' thread.
kovidgoyal is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Kindle DX and Boox M92 comparision Marrko Onyx Boox 12 02-26-2012 11:35 AM
Tools for converting MOBI->EPUB Syncopated Workshop 1 09-25-2011 04:42 PM
Announcing: epub-tools software dino8352 General Discussions 0 04-23-2011 01:06 PM
tools for epub creation Toxaris ePub 15 03-05-2010 04:54 AM
epub creation tools jbenny ePub 20 03-13-2009 12:30 PM


All times are GMT -4. The time now is 05:43 AM.


MobileRead.com is a privately owned, operated and funded community.