Thanks all for your inputs. I am building a local database of all 250k+ Amazon Kindle books, and using the API for it. So unfortunately, downloading individual book samples or looking at file extensions is not an option for me.
@wallcraft: That's valuable info. It looks like Amazon does not provide file size in the Kindle response group in the API. I don't want to crawl them just yet, but that might be the best-effort way to do it eventually. Re. DRM, I might also build a database of Mobipocket.com books at some stage, so I will be able to flag at least some books as DRMd.
I will try to put up interesting statistics about the aggregate dataset soon, eg. the average/median/max/min price. Do let me know if you are interested in something specific. My dataset is from last week, and I will probably refresh it at some point as needed.
Last edited by anurag; 04-24-2009 at 12:45 AM.
|