![]() |
#1 |
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 115
Karma: 12788
Join Date: Nov 2010
Device: Kindle 3
|
How to merge and eliminate duplicates
Hi,
I have a lot of duplicate books and others where the same book but different formats are listed separately. Is there any easy way to merge formats and delete duplicates? Thanks |
![]() |
![]() |
![]() |
#2 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
Select all entries of one book and right click. There should be several merge options available.
|
![]() |
![]() |
![]() |
#3 |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,897
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
1) Select the Destination entry FIRST, then select additional entries. M (for merge with delete) or Right-click for more options.
|
![]() |
![]() |
![]() |
#4 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
That's the manual merge method, which is usually the best. OTOH, if he has lots of these duplicates and all have approximately the same metadata, another option is to do it automatically. To do that he can turn on the autosort/automerge option in Export/Import|Adding Books, then copy the entire library into a new library. This process will check each book as it is copied into the new library and when it finds a book that has the same author and nearly the same title as a book that was previously copied, Calibre will copy the new format into the previous record. This method is not suitable for cases where the author/title differ significantly or where the metadata of the first record is worse than the metadata for later books.
|
![]() |
![]() |
![]() |
#5 |
Member
![]() Posts: 19
Karma: 10
Join Date: Aug 2010
Device: iPad
|
In the automerge option method you describe, Starson, how would I determine which version would be copied?
|
![]() |
![]() |
![]() |
#6 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
It's automatic. You don't have control. The first version of each format for each book sent into the new library is kept and any duplicates of that format are ignored.
|
![]() |
![]() |
![]() |
#7 | |
Junior Member
![]() Posts: 1
Karma: 10
Join Date: Jan 2011
Device: none
|
Quote:
Part of what I am wondering if it would be worth organizing the books properly (author, title, series) or downloading metadata or any other prework that one could do that would make the duplicate matching process more effective or streamlined. P.S. Happy to help out in testing or other tech stuff as needed too since I am currently out of work. ![]() Last edited by Calliastra; 02-07-2011 at 01:49 AM. Reason: incomplete thought |
|
![]() |
![]() |
![]() |
#8 | ||
Calibre Plugins Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,720
Karma: 2197770
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
Quote:
As to "instructions", from a Calibre perspective Starson has given you what you need to do if you decide to try that approach. You just need to be aware of the implications: - It will only find duplicates where the authors exactly match. There is no "fuzzy matching" on authors. - You really have very little control over which version will be kept if you have duplicates of a format. As Starson says above it is done by order of "selection" - but if you are doing a bulk library all at once that "selection order" won't mean too much. You could maybe sort by date or something but unless you investigate each book one by one you won't know which version to keep and it could be pot luck. And if you were doing it one by one controlling selections, you wouldn't need Starson's approach and would just use Merge instead ![]() Quote:
Certainly the 1.0 version may "only" have the exact same comparison logic Starson's automerge functionality has - of exact match on author, fuzzy on title. In which case in terms of cleanup preparation getting any author dups sorted is going to greatly increase the success of any dup search on top. If you dont want to resort to sql, just use the tag browser on the left to look down your authors list and with it's alphabetical sorting you can hopefully spot a lot of the common issues like typos, initials, spacings, abbreviations of names etc. Stuff like "E.E.Doc Smith", "E. E. 'Doc' Smith", "E. E. Smith" etc etc - rename the "wrong" author variations and get them down to one. Then at least if you decide to try Starson's described method above (not caring for instance about which EPUB to keep if you have two of them) you are in the best position to do so. That's my 2p for what its worth. |
||
![]() |
![]() |
![]() |
#9 | |||||||
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
![]() Quote:
Quote:
Quote:
Note that the new code in the linked thread only applies to automerge of incoming books. I did not replicate that code to automerge for Copy to Library. My suggestion to Calliastra (the OP) was to consider using Copy to Library. (In that scenario, there is always only one "identical book" since automerge in it's current form silently ignores duplicate formats. It's guaranteed to merge up all books with identical titles and similar titles. It can't currently make duplicate records with automerge on. Quote:
Quote:
Quote:
|
|||||||
![]() |
![]() |
![]() |
#10 | |
Calibre Plugins Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,720
Karma: 2197770
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
Quote:
![]() Last edited by kiwidude; 02-07-2011 at 07:11 PM. Reason: Linked to other response |
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Classic Eliminate margins | primetime34 | Barnes & Noble NOOK | 6 | 12-26-2010 10:07 PM |
Merge feature request (different merge) | Tarran | Calibre | 1 | 05-24-2010 10:57 AM |
eliminate iphone glare | scottjl | Apple Devices | 2 | 04-29-2010 10:05 PM |
How to eliminate blank lines between paragraphs with Calibre | Mr. Goodbar | Calibre | 8 | 06-02-2008 07:39 AM |
utility to eliminate unwanted line breaks in txt | profnachos | Workshop | 11 | 11-27-2007 06:24 PM |