06-25-2008, 03:50 AM | #1 | |
Wizard
Posts: 3,454
Karma: 10484861
Join Date: May 2006
Device: PocketBook 360, before it was Sony Reader, cassiopeia A-20
|
Copyright renewal records for US books online
I found the following information on BoingBoing today:
Quote:
Database download page: Link Last edited by Alexander Turcic; 06-25-2008 at 04:24 AM. Reason: slightly edited for frontpage |
|
06-25-2008, 08:07 AM | #2 |
Sir Penguin of Edinburgh
Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
Question: What do I use to view a 56MB XML file?
|
06-25-2008, 08:23 AM | #3 |
eBuchReisender
Posts: 41
Karma: 208
Join Date: May 2008
Location: Münster
Device: Palm Tungsten-E, iLiad
|
Nate the Great, I just downloaded it, unpacked it has a size of 371.6 MB
In case you have some Unix derivate at hand: less google-renewals-all-20080624.xml and I had within a second the first record. Though with all the tags around. cat google-renewals-all-20080624.xml | grep 'Tolkien' and I found pretty quick there are indeed some books with copyright still, now I think some simple xml-viewer for this file is needed. a third check in similar way: cat google-renewals-all-20080624.xml | grep 'Puchstein' gave not a single entry, so for sure this is not a complete catalogue (I wonder how big that file would have to be ...) |
06-25-2008, 11:14 AM | #4 |
Grand Sorcerer
Posts: 19,832
Karma: 11844413
Join Date: Jan 2007
Location: Tampa, FL USA
Device: Kindle Touch
|
|
06-25-2008, 11:14 AM | #5 |
Enthusiast
Posts: 39
Karma: 2434999
Join Date: Sep 2007
Location: Scottsdale, Arizona
Device: Samsung Galaxy Tab S3; also Moto G Stylus phone
|
Stanford did this some time ago, and there's a very nice Web interface:
http://collections.stanford.edu/copy...1A53459D8CD4A9 Most people don't have much call to view 300+ MB databases, so this approach is probably a lot better. I don't know if this is precisely the same database as Google extracted from government records, but it's still very useful. |
06-25-2008, 11:31 AM | #6 | |
Sir Penguin of Edinburgh
Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
Quote:
|
|
06-25-2008, 11:47 AM | #7 |
Grand Sorcerer
Posts: 8,478
Karma: 5171130
Join Date: Jan 2006
Device: none
|
Now, if only the copyright office would allow me to send my latest book in as a digital file, instead of printed pages...
|
06-25-2008, 01:14 PM | #8 | |||
New York Editor
Posts: 6,384
Karma: 16540415
Join Date: Aug 2007
Device: PalmTX, Pocket eDGe, Alcatel Fierce 4, RCA Viking Pro 10, Nexus 7
|
Quote:
Quote:
A Windows console version of grep is available as part of a set of Gnu utilities for Windows, here: http://gnuwin32.sourceforge.net/packages/grep.htm Quote:
______ Dennis |
|||
06-25-2008, 04:47 PM | #9 |
Wizard
Posts: 3,442
Karma: 300001
Join Date: Sep 2006
Location: Belgium
Device: PRS-500/505/700, Kindle, Cybook Gen3, Words Gear
|
EmEditor can work with large files. But I prefer the viewer built in in my FAR Manager - as it doesn't try to load up the whole file for editing, viewing and searching is extremely quick for any file size.
|
06-25-2008, 04:55 PM | #10 | |
New York Editor
Posts: 6,384
Karma: 16540415
Join Date: Aug 2007
Device: PalmTX, Pocket eDGe, Alcatel Fierce 4, RCA Viking Pro 10, Nexus 7
|
Quote:
Open Office Base 2.4 and 3.0 choked trying to import it. I was able to actually open it with a neat freeware product called Henry's Textplorer, but it took a fair bit of time to do it, and searches in the file were likewise slow. (Textplorer is here: http://www.henrykellner.com/Textplorer/index.html) I suppose I could manage to import it to a MySQL or PostgreSQL database, but it isn't worth the trouble. Less and grep will do for the limited uses I have. ______ Dennis |
|
06-26-2008, 01:02 PM | #11 |
Fanatic
Posts: 509
Karma: 1098204
Join Date: Jun 2008
Location: Earth
Device: iPhone5, iPad Gen3, Kobo, Kindle Fire, Kobo Vox. Samsung Galaxy Tab 7
|
Hummmm..at home I have an 8GB Ram/4TB HD MacPro. I'll give it a try after work.
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Copyright Laws Threaten Our Online Freedom | Daithi | News | 70 | 07-14-2009 08:34 PM |
Can we request books out of copyright? | glenn cornish | Sony Reader | 3 | 02-26-2009 05:15 PM |
Sony eBookstore Renewal | jimhen | Sony Reader | 11 | 11-02-2008 03:33 PM |
In Copyright? - Copyright Renewal Database launched | Alexander Turcic | News | 26 | 07-09-2008 09:36 AM |