![]() |
#1 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 820
Karma: 11012
Join Date: Nov 2007
Location: Warsaw, Poland
Device: Bookeen Cybook
|
Does anyone know of an ebook sourcing search engine?
Let's say I found an ebook somewhere on the Internet. All I have is the filename, the book inside, in some specific format. Or possibly a zip, with a handful of html files inside, which form a single book. I'd like to know if that ebook is an illegally created one, or possibly if it's a legal offering, for example that some site offers it on promotion so I could go there and get it from its real source.
I was thinking there should be a search engine where I could go, upload the book or, as it's usually done, point some web form to it so it can read the file, calculate crc32 or md5 sum from it, and look up those sums in its database, and tell me what is the book's source. Often, when I think of such things, and then look for them, they turn out to be existing sites with big databases already. This time, I could find nothing. Does anybody know of such a site? |
![]() |
![]() |
![]() |
#2 |
Feral Underclass
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,622
Karma: 26821535
Join Date: Jan 2010
Location: Yorkshire, tha noz
Device: 2nd hand paperback
|
MD5, etc wouldn't really work on a pirate ebook because it would change when the DRM restrictions were removed.
|
![]() |
![]() |
![]() |
#3 | |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 820
Karma: 11012
Join Date: Nov 2007
Location: Warsaw, Poland
Device: Bookeen Cybook
|
Quote:
Also, in that case the database would just have two entries for that book, one with DRM, the other one after it was removed - or wouldn't include DRMed books at all (I don't know exactly how DRMed books work, it's possible that the resulting DRM-ed file from the same book would be different for every user). Edit: though I remember I read here once about books being offered for free, but with DRM (so requiring to have an account on the site, and often also working credit card, even if there would be no charge for the book). |
|
![]() |
![]() |
![]() |
#4 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,812
Karma: 26912940
Join Date: Apr 2010
Device: sony PRS-T1 and T3, Kobo Mini and Aura HD, Tablet
|
can't you just search for the title/author?
|
![]() |
![]() |
![]() |
#5 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 820
Karma: 11012
Join Date: Nov 2007
Location: Warsaw, Poland
Device: Bookeen Cybook
|
No, because well-known book would likely return such a number of results that wouldn't be able to go through them all to find any meaningful promotion links, and you wouldn't be able to say if it's available anywhere for free or legally just by going through Google results.
|
![]() |
![]() |
![]() |
#6 |
mrkrgnao
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 241
Karma: 237248
Join Date: May 2010
Device: PRS650, K3 Wireless, Galaxy S3, iPad 3.
|
How fantastic would it be to have an equivalent of ISBN that followed each book on the WWW, even if there were OCR errors or if they were PDFs?
A hypothetical way for the system to work might be how Windows Media Player finds album art for music, whatever its provenance, or those pieces of software that allow you to hum a few bars of a tune and tell you its name and the composer. This is an idea that hadn't occurred to me, and a really interesting basis for a thread. |
![]() |
![]() |
![]() |
#7 |
Home Guard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,730
Karma: 86721650
Join Date: Jun 2007
Location: Alpha Ralpha Boulevard
Device: Kindle Oasis 3G, iPhone 6
|
I seem to remember years ago on the Usenet e-book groups someone wrote software that could tell if two files were actually the same file or from a different source. I'm not sure if he kept a database or not.
|
![]() |
![]() |
![]() |
#8 |
Banned
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,344
Karma: 1028477047
Join Date: Aug 2010
Location: Nueva Andalucía
Device: Sony PRS 650
|
Indeed. I downloaded some free ebooks from Amazon and most have DRM.
|
![]() |
![]() |
![]() |
#9 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 820
Karma: 11012
Join Date: Nov 2007
Location: Warsaw, Poland
Device: Bookeen Cybook
|
There are many programs that can compare files by content to see if they're identical. If you mean comparing with some kind of database, I'd be happy to find that program and learn if the database was systematically expanded with new ebooks showing up around the Net.
|
![]() |
![]() |
![]() |
#10 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,466
Karma: 6900052
Join Date: Dec 2009
Location: The Heart of Texas
Device: Boox Note2, AuraHD, PDA,
|
Perhaps it is the cynic in me, but I would think the most obvious players that whould try
and compile and keep such a data base would be the legal staff of the DRM proponents. Luck; Ken |
![]() |
![]() |
![]() |
#11 | |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 820
Karma: 11012
Join Date: Nov 2007
Location: Warsaw, Poland
Device: Bookeen Cybook
|
Quote:
Even if they did have all illegal books in such database, I'm not sure it would benefit them. If they spot some book on Rapidshare etc., going by a link posted on the Net, they probably can't even ask Rapidshare to check its database for all copies of the ebook on various user accounts and remove them all, because some of those may be legally made backup copies by the original creator of the ebook. (I assume that Rapidshare saves its space, by calculating various checksums internally, and if they have identical 700MB video file on 100 different accounts, they store it only once on their servers). |
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Neotake.com: Ebook Search Engine | Neotake | Introduce Yourself | 8 | 05-29-2010 10:24 AM |
Ebook Search Engine | Drezin | News | 2 | 01-05-2010 05:16 PM |
What is a good name for a cross-publisher ebook search engine? | acidzebra | News | 95 | 09-24-2008 09:23 AM |
New eBook Search Engine using Google Technology | chunkabacon | Deals and Resources (No Self-Promotion or Affiliate Links) | 17 | 06-13-2007 07:17 PM |
eBook Search Engine German | TadW | Deals and Resources (No Self-Promotion or Affiliate Links) | 0 | 08-06-2003 02:18 AM |