Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book General > General Discussions

Notices

Reply
 
Thread Tools Search this Thread
Old 11-03-2010, 09:34 AM   #16
oggelbe2007
Limited Warranty
oggelbe2007 will become famous soon enoughoggelbe2007 will become famous soon enoughoggelbe2007 will become famous soon enoughoggelbe2007 will become famous soon enoughoggelbe2007 will become famous soon enoughoggelbe2007 will become famous soon enough
 
oggelbe2007's Avatar
 
Posts: 89
Karma: 576
Join Date: Jul 2007
Location: North Georgia, USA
Device: A sweet PRS-500, DXG
Ah, it seems that what you really need is some code with Judgemental Heuristics. Which would scan your ebook files (regardless of format, title errors, etc.) and product a list of duplicate ebook files as output.
oggelbe2007 is offline   Reply With Quote
Old 11-03-2010, 09:57 AM   #17
Ken Maltby
Wizard
Ken Maltby ought to be getting tired of karma fortunes by now.Ken Maltby ought to be getting tired of karma fortunes by now.Ken Maltby ought to be getting tired of karma fortunes by now.Ken Maltby ought to be getting tired of karma fortunes by now.Ken Maltby ought to be getting tired of karma fortunes by now.Ken Maltby ought to be getting tired of karma fortunes by now.Ken Maltby ought to be getting tired of karma fortunes by now.Ken Maltby ought to be getting tired of karma fortunes by now.Ken Maltby ought to be getting tired of karma fortunes by now.Ken Maltby ought to be getting tired of karma fortunes by now.Ken Maltby ought to be getting tired of karma fortunes by now.
 
Ken Maltby's Avatar
 
Posts: 4,466
Karma: 6900052
Join Date: Dec 2009
Location: The Heart of Texas
Device: Boox Note2, AuraHD, PDA,
Quote:
Originally Posted by oggelbe2007 View Post
Ah, it seems that what you really need is some code with Judgemental Heuristics. Which would scan your ebook files (regardless of format, title errors, etc.) and product a list of duplicate ebook files as output.
The NSA (and perhaps the FBI now a days) could provide such a program,
but wouldn't be inclined to do so. There would not be much of an incentive
for one of their contractors to create such a spin-off product, either.

Luck;
Ken
Ken Maltby is offline   Reply With Quote
Advert
Old 11-03-2010, 10:52 PM   #18
oggelbe2007
Limited Warranty
oggelbe2007 will become famous soon enoughoggelbe2007 will become famous soon enoughoggelbe2007 will become famous soon enoughoggelbe2007 will become famous soon enoughoggelbe2007 will become famous soon enoughoggelbe2007 will become famous soon enough
 
oggelbe2007's Avatar
 
Posts: 89
Karma: 576
Join Date: Jul 2007
Location: North Georgia, USA
Device: A sweet PRS-500, DXG
Quote:
Originally Posted by Ken Maltby View Post
The NSA (and perhaps the FBI now a days) could provide such a program,but wouldn't be inclined to do so. There would not be much of an incentive for one of their contractors to create such a spin-off product, either.

Luck;
Ken
They would probably drop by your house/apt and ask you what kind of shenanigans you're up too...but all we need here is a simple adaptive sort;

1) take 2 files: A & B
2) convert them to a common format
3) take a random sentence from file A
4) compare this sentence with all of file B
5) if it hits - remove both files, place in dup file list & log results
6) if it doesn't hit - toss file B & grab another one
7) repeat until finished...or xmas arrives
8) verify and clean the dup file list.
9) bubble sort the remaining file list.
10) Take rest of the day off.
oggelbe2007 is offline   Reply With Quote
Old 11-04-2010, 12:55 AM   #19
wannabee
Media Bloke
wannabee ought to be getting tired of karma fortunes by now.wannabee ought to be getting tired of karma fortunes by now.wannabee ought to be getting tired of karma fortunes by now.wannabee ought to be getting tired of karma fortunes by now.wannabee ought to be getting tired of karma fortunes by now.wannabee ought to be getting tired of karma fortunes by now.wannabee ought to be getting tired of karma fortunes by now.wannabee ought to be getting tired of karma fortunes by now.wannabee ought to be getting tired of karma fortunes by now.wannabee ought to be getting tired of karma fortunes by now.wannabee ought to be getting tired of karma fortunes by now.
 
Posts: 2,382
Karma: 113956855
Join Date: Sep 2010
Location: NSW - Australia
Device: iOS
I'll bet some of those books you got rid of were more than 16 years old. That's how long ago I was storing files on removable Syquest and tape drives that I tossed out.

I saw on telly that the museum of technology here has electronic data storage devises that aren't too old but they have no way of accessing the data because there are no readers that read it anymore.

So I hope you start a backup regime to future proof your collection when you finally get it sorted.
wannabee is offline   Reply With Quote
Old 10-19-2011, 03:02 PM   #20
T.D.02809
Enthusiast
T.D.02809 doesn't litterT.D.02809 doesn't litterT.D.02809 doesn't litter
 
T.D.02809's Avatar
 
Posts: 38
Karma: 200
Join Date: Sep 2011
Location: Fort Lauderdale, FL, USA
Device: Kindle 3rd Gen WiFi, Kindle App for MAC
Quote:
Originally Posted by dpayment View Post
Thanks Susan & Worldwalker, both of these answers are excellent, I hadn't thought to use Calibre to do the comparison, it just never occurred to me, but it makes perfect sense. Even if the file titles are different, the metadata should help to identify "most" of the duplicates, if not all.

Thanks again,
Dan
I am unsure if any of you (Dan, Susan,Worldwalker) will see this; however, I want to say Gracias-Thanks for this information. I have hit a similar difficulty. There are duplicate copies of ebooks on my Kindle 3Gen Keyboard model. When I have the time I have said I would organize them. Then I would say, 'another day'. Thanks to this info I am indeed organizing my ereader's ebooks eliminating duplicate copies. Gracias, T.D.02809
T.D.02809 is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Duplicate files on SD card drdman Astak EZReader 6 08-08-2010 07:03 PM
Some basics - duplicate files, filenames clintbradford Kobo Reader 3 07-11-2010 04:18 AM
Duplicate books - multiple formats mranlett Calibre 5 09-26-2009 07:02 AM
Deleting duplicate collections Gazman Introduce Yourself 3 01-25-2009 10:19 AM
Duplicate database files Zach Reading and Management 2 05-31-2005 05:47 AM


All times are GMT -4. The time now is 03:48 PM.


MobileRead.com is a privately owned, operated and funded community.