Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Plugins

Notices

Reply
 
Thread Tools Search this Thread
Old 09-01-2013, 02:56 PM   #421
Merischino
Groupie
Merischino ought to be getting tired of karma fortunes by now.Merischino ought to be getting tired of karma fortunes by now.Merischino ought to be getting tired of karma fortunes by now.Merischino ought to be getting tired of karma fortunes by now.Merischino ought to be getting tired of karma fortunes by now.Merischino ought to be getting tired of karma fortunes by now.Merischino ought to be getting tired of karma fortunes by now.Merischino ought to be getting tired of karma fortunes by now.Merischino ought to be getting tired of karma fortunes by now.Merischino ought to be getting tired of karma fortunes by now.Merischino ought to be getting tired of karma fortunes by now.
 
Merischino's Avatar
 
Posts: 183
Karma: 357868
Join Date: Jul 2010
Location: somewhere south of the mason dixon line
Device: Nexus 7 FHD (aka 2013, 2nd gen), Kindle 2, Samsung Galaxy s3
thanks kiwidude, i will. (I did call that merge discussion in my original post ot... sorry for taking us off topic)
Merischino is offline   Reply With Quote
Old 09-02-2013, 05:50 PM   #422
Sidetrack
Enthusiast
Sidetrack began at the beginning.
 
Posts: 34
Karma: 10
Join Date: Jan 2009
Location: South Pacific
Device: Kindle DX
I'm sure this has been asked before, but what are the chances of adding the ability to check for series spread out across two libraries?
Sidetrack is offline   Reply With Quote
Old 09-04-2013, 10:55 AM   #423
LadyKate
Groupie
LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.LadyKate ought to be getting tired of karma fortunes by now.
 
Posts: 175
Karma: 673232
Join Date: Jul 2013
Location: Quebec CA
Device: android 4 (samsung tablet and asus tablet)
A question about this plugin. The binary search, does that ignore the bookmark file that is inserted when the book is opened and looked at?
Is there any chance of a duplicate finder that would compare the book part but not the tags and/or bookmark file?
LadyKate is offline   Reply With Quote
Old 09-07-2013, 07:13 AM   #424
kiwidude
calibre/Sigil Developer
kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.kiwidude ought to be getting tired of karma fortunes by now.
 
Posts: 4,228
Karma: 1334002
Join Date: Oct 2010
Location: London, UK
Device: Kindle Paperwhite 3G, iPad 3, iPad Air
@Sidetrack - if it has been asked for before then it was a long time ago, then again I do have goldfish memory and been a long time since I updated this plugin. It's an interesting idea, I will have a think about it the next time I look at this plugin.

@LadyKate - a binary search is an *exact* match type search on the underlying files, so yes bookmarks screw things up. Personally I turned off calibre bookmarks a long, long time ago because (a) I don't read books using the calibre viewer, and (b) I was getting annoyed with the contents and timestamps changing every time I opened one up to look at it.

All I can suggest is turning off the feature (in the Preferences in the calibre ebook viewer application itself), and then using the Quality Check and Modify ePub plugins to find ePubs with Bookmarks and remove the bookmarks respectively.
kiwidude is offline   Reply With Quote
Old 10-02-2013, 02:41 PM   #425
halfcore
Junior Member
halfcore began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Sep 2013
Device: ibook
kiwidude,

Is there a way to do a Title/Author search between two libraries that will find all the books from the authors in the 2nd library and mark all tittles by those authors in the first. I tried the library search with exact author search and Ignore title but it only give me a log of match. I would like to mark those matched books as well for removal or move to another library. FYI, I am trying to remove all the erotic books from the main library for general consumption.

Thanks,

halfcore

Last edited by halfcore; 10-02-2013 at 03:58 PM. Reason: Typos
halfcore is offline   Reply With Quote
Old 10-07-2013, 09:26 AM   #426
AnselmD
Member
AnselmD began at the beginning.
 
Posts: 14
Karma: 10
Join Date: Oct 2013
Device: none
find duplicate covers

to be able to find duplicate covers would be great.
AnselmD is offline   Reply With Quote
Old 10-07-2013, 10:06 AM   #427
theducks
Grand Sorcerer
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 14,843
Karma: 5654321
Join Date: Aug 2009
Location: (The original) Silicon Valley, USA
Device: Galaxy Tab 2, Astak Pocket Pro, K4NT
Quote:
Originally Posted by AnselmD View Post
to be able to find duplicate covers would be great.
that would take Image analysis (and a hefty CPU )
All covers are stored inside calibre with the same name: cover.jpeg

switch to Grid view with the Details and Tag Browser hidden for maximum # of covers showing. Page through the Library.
When you see a candidate, select the image, Tap I for information or turn off the Grid (and decide )
theducks is offline   Reply With Quote
Old 10-07-2013, 07:30 PM   #428
Sabardeyn
Guru
Sabardeyn ought to be getting tired of karma fortunes by now.Sabardeyn ought to be getting tired of karma fortunes by now.Sabardeyn ought to be getting tired of karma fortunes by now.Sabardeyn ought to be getting tired of karma fortunes by now.Sabardeyn ought to be getting tired of karma fortunes by now.Sabardeyn ought to be getting tired of karma fortunes by now.Sabardeyn ought to be getting tired of karma fortunes by now.Sabardeyn ought to be getting tired of karma fortunes by now.Sabardeyn ought to be getting tired of karma fortunes by now.Sabardeyn ought to be getting tired of karma fortunes by now.Sabardeyn ought to be getting tired of karma fortunes by now.
 
Sabardeyn's Avatar
 
Posts: 629
Karma: 1242364
Join Date: May 2009
Location: The Right Coast
Device: PC (Calibre), Nexus 7 2013 (Moon+ Pro), HTC HD2/Leo (Freda)
With regards to duplicate covers...

I have no idea what would be involved, but finding approximate similarity might be acceptable for images so something like an MD5 checksum comparison might work. The results wouldn't be error free, particularly for a series of books with a rigidly enforced layout/logo, but it would substantially cut down on the number of possible duplicates. Eyeballing a couple thousand covers is not reasonable.

Of course, it also depends on what kind of duplication the OP is looking for. Either multiple books with the same cover image, or multiple cover images within a single ebook.

(Personally this feature would be of no use to me as I explode anthologies and short story collections. I prefer story titles over book titles. So I am going to have multiple books with the same cover.)
Sabardeyn is offline   Reply With Quote
Old 10-07-2013, 08:07 PM   #429
BetterRed
null operator
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 3,611
Karma: 2183656
Join Date: Mar 2012
Location: NSW Australia
Device: none
Quote:
Originally Posted by AnselmD View Post
to be able to find duplicate covers would be great.
@AnselmD - I use a defunct program called DupDetector from Prism Systems, its a bit clunky UI wise, but its the best free duplicate image finder I know of. There are a few programs out there with the same name - make sure you get the one from Prism.

Its been a couple of years since I last looked for similar programs, nothing came close, including some add-ons (plugins) to high end expensive commercial products.

I don't have it installed on my 'calibre' system so I cant test it on a library right now. But I've used it on friends systems to dedup tens of thousands of images scattered around their disk drives. It analyses the image properties not the file properties, edge analysis, shape detection, colour histograms that sort of thing. And it detects mirrors, flips, monochrome etc.

The only thing I'm unsure of is whether it could handle 1,000s of images all with the same name, theoretically it shouldn't care but... So don't use it or anything like it without doing a backup first - because they all have options to delete the duplicates etc.

As theducks said these things ain't super fast, but the last system I deduped was a fairly modest Toshiba laptop. It probably took me a day, it it had tens of thousands of high res images (8-12MP+) on it. I can't recall how much disk space I recovered, but along with the crud and bloat I removed it should mean the owner will get another couple years use from it.

BR

Last edited by BetterRed; 10-07-2013 at 08:16 PM.
BetterRed is offline   Reply With Quote
Old 10-08-2013, 03:35 AM   #430
AnselmD
Member
AnselmD began at the beginning.
 
Posts: 14
Karma: 10
Join Date: Oct 2013
Device: none
Smile

Quote:
Originally Posted by AnselmD View Post
to be able to find duplicate covers would be great.

Thanks to all to your answers.

With your answers i have an idea how to solve my special use case.

I importet the ebooks from gutenberg. The most of them has an identical cover image (thousands). I wanted to find these one automtically and start to find better covers or at least delete this covers. I do not like to do it manually.

So i think i will do it like this: find all binary duplicates with some external tool. Delete all the duplicate covers. After this, i can find all books which has "no cover" with calibre and do whatever i like.

If the images are not binary duplicate, i will try to install Dup Detector with wine at Linux.

Thnx a lot!!
AnselmD is offline   Reply With Quote
Old 10-08-2013, 10:22 AM   #431
theducks
Grand Sorcerer
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 14,843
Karma: 5654321
Join Date: Aug 2009
Location: (The original) Silicon Valley, USA
Device: Galaxy Tab 2, Astak Pocket Pro, K4NT
Quote:
Originally Posted by AnselmD View Post
Thanks to all to your answers.

With your answers i have an idea how to solve my special use case.

I importet the ebooks from gutenberg. The most of them has an identical cover image (thousands). I wanted to find these one automtically and start to find better covers or at least delete this covers. I do not like to do it manually.

So i think i will do it like this: find all binary duplicates with some external tool. Delete all the duplicate covers. After this, i can find all books which has "no cover" with calibre and do whatever i like.

If the images are not binary duplicate, i will try to install Dup Detector with wine at Linux.

Thnx a lot!!
KISS
Leave the details pane open.

Use Grid View and eyeball Mk I,

Get the Search the internet PI. Right click (assumes the PI was set to use the Context Menu) on the 'bad image'
I use the Search Google for Images
When I find a 'good one', I just drag it over the old one on details

Having a 2nd Monitor makes this a LOT easier
theducks is offline   Reply With Quote
Old 10-11-2013, 07:08 AM   #432
BetterRed
null operator
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 3,611
Karma: 2183656
Join Date: Mar 2012
Location: NSW Australia
Device: none
@kiwidude - for your information/attention - http://www.mobileread.com/forums/sho...d.php?t=224527 - whatever.

ETA : Good news at http://www.mobileread.com/forums/sho...6&postcount=14

BR

Last edited by BetterRed; 10-11-2013 at 04:23 PM. Reason: see ETA
BetterRed is offline   Reply With Quote
Old 10-13-2013, 09:32 AM   #433
JohnnyBook
Zealot
JohnnyBook for a long time would go to bed early.JohnnyBook for a long time would go to bed early.JohnnyBook for a long time would go to bed early.JohnnyBook for a long time would go to bed early.JohnnyBook for a long time would go to bed early.JohnnyBook for a long time would go to bed early.JohnnyBook for a long time would go to bed early.JohnnyBook for a long time would go to bed early.JohnnyBook for a long time would go to bed early.JohnnyBook for a long time would go to bed early.JohnnyBook for a long time would go to bed early.
 
Posts: 123
Karma: 17646
Join Date: Dec 2008
Location: Out There
Device: K3 WF/3G (dead screen) & K4NT
Quote:
Originally Posted by AnselmD View Post
Thanks to all to your answers.

With your answers i have an idea how to solve my special use case.

I importet the ebooks from gutenberg. The most of them has an identical cover image (thousands). I wanted to find these one automtically and start to find better covers or at least delete this covers. I do not like to do it manually.

Thnx a lot!!
I have been using a program for years called "uniquefiler" it is designed for Images, but will do all file types.

It is Lightning fast at exact matches (Binary duplicates), and Duplicate file names, and will do image comparisons also, finding close matches (typically different sizes of the same picture or cropped pictures or pictures with slight variations) with the ability to set how close % you want to search for.

Its also probably defunct ware, but I am pretty sure it is availible out there somewhere.

EDIT: the website is still there. http://www.uniquefiler.com/ I use V1.4
JohnnyBook is offline   Reply With Quote
Old 11-01-2013, 05:08 PM   #434
AnselmD
Member
AnselmD began at the beginning.
 
Posts: 14
Karma: 10
Join Date: Oct 2013
Device: none
Quote:
Originally Posted by theducks View Post
KISS
Leave the details pane open.

Use Grid View and eyeball Mk I,

Get the Search the internet PI. Right click (assumes the PI was set to use the Context Menu) on the 'bad image'
I use the Search Google for Images
When I find a 'good one', I just drag it over the old one on details

Having a 2nd Monitor makes this a LOT easier
I needed some time to understand "eyeball Mk I"
I tried Internet PI, it is nice to find Images. But for this special case it is too much work. I put ~ 15000 books from Project Gutenberg into calibre. Maybe it better to download all in bulk and Use Grid View and eyeball Mk I to throw away the bad ones. Or generate the covers....
AnselmD is offline   Reply With Quote
Old 11-01-2013, 05:11 PM   #435
AnselmD
Member
AnselmD began at the beginning.
 
Posts: 14
Karma: 10
Join Date: Oct 2013
Device: none
Quote:
Originally Posted by JohnnyBook View Post
I have been using a program for years called "uniquefiler" it is designed for Images, but will do all file types.

It is Lightning fast at exact matches (Binary duplicates), and Duplicate file names, and will do image comparisons also, finding close matches (typically different sizes of the same picture or cropped pictures or pictures with slight variations) with the ability to set how close % you want to search for.

Its also probably defunct ware, but I am pretty sure it is availible out there somewhere.

EDIT: the website is still there. http://www.uniquefiler.com/ I use V1.4
This seems to be nice tool, i will try it (but for another purpose ).
BTW, i found 2 groups of exact images. As i can remember the 1st group had a few hundred duplicates, the 2nd more than 9000.
AnselmD is offline   Reply With Quote
Reply

Tags
cross library duplicates, in library duplicates

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
[GUI Plugin] Generate Cover kiwidude Plugins 502 10-10-2014 06:47 AM
[GUI Plugin] Open With kiwidude Plugins 232 10-09-2014 12:38 AM
[GUI Plugin] Quality Check kiwidude Plugins 785 10-06-2014 05:25 PM
[GUI Plugin] View Manager kiwidude Plugins 83 09-24-2014 07:00 PM
[GUI Plugin] Plugin Updater **Deprecated** kiwidude Plugins 159 06-19-2011 12:27 PM


All times are GMT -4. The time now is 08:56 AM.


MobileRead.com is a privately owned, operated and funded community.