View Single Post
Old 06-21-2008, 12:45 AM   #1
TedPark
Zealot
TedPark has exceeded all limitations known to mankindTedPark has exceeded all limitations known to mankindTedPark has exceeded all limitations known to mankindTedPark has exceeded all limitations known to mankindTedPark has exceeded all limitations known to mankindTedPark has exceeded all limitations known to mankindTedPark has exceeded all limitations known to mankindTedPark has exceeded all limitations known to mankindTedPark has exceeded all limitations known to mankindTedPark has exceeded all limitations known to mankindTedPark has exceeded all limitations known to mankind
 
TedPark's Avatar
 
Posts: 135
Karma: 17148
Join Date: May 2008
Location: California
Device: Sony 505
Project Gutenberg Master List

Well, are any of you as bummed as I am that there is no simple list of authors and titles for PG? Then your foundest wish has come true. I downloaded their XML/RDF catalog and did some serious data mining. I kept only those things that were in English or might be in English. I also eliminated a lot of duplicates and did some other housecleaning. Here is a simple XLS file that you can sift and sort all you want.

PS - Phew, that was some ugly job. They need a good DBA bad. Overall their database is good and righteous, but there are hundreds of records that are in need of various types of repairs.

{sigh} <--- is there a smiley for that?

PPS - I don't know if this is the best forum for this, but I figure one of you SYSOPs can move it if appropriate.
Attached Files
File Type: zip Gutenberg 08-06-20.zip (887.3 KB, 7664 views)

Last edited by TedPark; 06-21-2008 at 01:16 PM. Reason: Added a second "author" tab to spreadsheet
TedPark is offline   Reply With Quote