|  09-08-2008, 02:16 PM | #61 | 
| Actively passive.            Posts: 2,042 Karma: 478376 Join Date: Feb 2008 Location: US Device: Sony PRS-505/LC | 
				
				Database Design
			 
			
			Something like this, where the user could maintain the "Categories" and a many-to-many relationship, so that a book can fit as many categories as the user sees fit. Same with Formats, so that a book can be listed under multiple formats. I see "Genre" as something the user builds, since Publisher genres are either too lose or too specific. "SF" doesn't cut it for me. | 
|   |   | 
|  09-08-2008, 02:29 PM | #62 | |
| Grand Sorcerer            Posts: 19,832 Karma: 11844413 Join Date: Jan 2007 Location: Tampa, FL USA Device: Kindle Touch | Quote: 
 In your case you can have a tags like "read", "toreview", "ondevice". Of course, some tags could be auto created by the software. Like "ondevice" would be a good example. To me this is the most flexible way of indexing. Folder are "nice" but many times something fits in more than one folder... or the folder taxonomy is specific to something... say "genre" but you also want to see all the sf books you need to review... the folders fall down. But, with tags you just choose tags sf and toreview and viola. Of, btw, delicious doesn't have it, but you should be able to negate a tag too. For example, I want to see everything withOUT a read tag to know what I haven't read yet rather than forcing me to create a read and a toread tag. BOb | |
|   |   | 
|  09-08-2008, 02:40 PM | #63 | 
| fruminous edugeek            Posts: 6,745 Karma: 551260 Join Date: Oct 2006 Location: Northeast US Device: iPad, eBw 1150 | 
			
			Tags are good. And they're also what Librarything uses. Which reminds me that I'd like to be able to sync my list of books with Librarything.    | 
|   |   | 
|  09-08-2008, 02:42 PM | #64 | 
| Actively passive.            Posts: 2,042 Karma: 478376 Join Date: Feb 2008 Location: US Device: Sony PRS-505/LC | 
			
			TAGs, whatever, have to be stored in a database. I'm suggesting a database structure.
		 | 
|   |   | 
|  09-08-2008, 02:49 PM | #65 | 
| Connoisseur       Posts: 99 Karma: 657 Join Date: Sep 2008 Location: Slough, UK Device: Sony Reader | 
			
			Ah, give the man a prize; that's precisely what I was thinking. I'm currently launching a messaging service using avatars in my day job, and we've been implementing tags as a means for searching for avatars. It's fast, intuitive, and as long as you're allowing people to make their own and assign them as they will, they're devastatingly useful. Reading the category/searching requirements, it seems to me that tags are definitely a big win. I also like categorising groups of tags, so that they can be arranged in a column view, one tag category per column. Filter row results based on combinations of positive and negative occurrences of a set of tags, and virtually everything we've been talking about here can be done automatically. | 
|   |   | 
|  09-08-2008, 02:50 PM | #66 | 
| fruminous edugeek            Posts: 6,745 Karma: 551260 Join Date: Oct 2006 Location: Northeast US Device: iPad, eBw 1150 | 
			
			I think tags are usually just concatenated in a single database field, freely, and selected on the fly. But I could be mistaken-- perhaps the software parses them and stores them in a separate table, as you suggest. The main thing is to be able to have the many-to-many relationship, as you pointed out. | 
|   |   | 
|  09-08-2008, 02:51 PM | #67 | 
| Connoisseur       Posts: 99 Karma: 657 Join Date: Sep 2008 Location: Slough, UK Device: Sony Reader | |
|   |   | 
|  09-08-2008, 02:55 PM | #68 | 
| Actively passive.            Posts: 2,042 Karma: 478376 Join Date: Feb 2008 Location: US Device: Sony PRS-505/LC | 
			
			http://www.pui.ch/phred/archives/200...e-schemas.html I guess I sketched out the "Toxi" solution referred to in the article. To me, it's basic database normalization. Regardless of the user interface (tag clouds), in the end you want a highly normalized database. Last edited by Taylor514ce; 09-08-2008 at 02:58 PM. | 
|   |   | 
|  09-08-2008, 02:55 PM | #69 | |
| Grand Sorcerer            Posts: 19,832 Karma: 11844413 Join Date: Jan 2007 Location: Tampa, FL USA Device: Kindle Touch | Quote: 
 If you just put them all in one field you have to scan through each record to pull out tags, etc. While indexed records can be retrieved so much faster. BOb | |
|   |   | 
|  09-08-2008, 02:57 PM | #70 | |
| Connoisseur       Posts: 99 Karma: 657 Join Date: Sep 2008 Location: Slough, UK Device: Sony Reader | Quote: 
 This brings up a really good question, btw; what is the general view regarding having a database running to store this metadata, versus having a custom file format, emulating a database? Advantages to the database are speed and ease of development, no proprietary files to decode in case of trouble. But the custom files mean no separate program to run or configure. Which would you all choose? Kovid, I recall you mentioned moving from a database to a file; can I ask what your reasoning was behind that, and how you feel it's worked out for you? | |
|   |   | 
|  09-08-2008, 03:00 PM | #71 | 
| Actively passive.            Posts: 2,042 Karma: 478376 Join Date: Feb 2008 Location: US Device: Sony PRS-505/LC | 
			
			Right: Books, Cat (Tags), and the cross-reference table between them. Just like I sketched above. Grrr.  I've always been a corporate developer, so the question of "use a database or not" never comes up. We always use a database. For an end-user program, I see the dilemma. | 
|   |   | 
|  09-08-2008, 03:01 PM | #72 | |
| fruminous edugeek            Posts: 6,745 Karma: 551260 Join Date: Oct 2006 Location: Northeast US Device: iPad, eBw 1150 | Quote: 
 I remember trying to figure out how Otakuworld had done their "similar titles" search at one point and coming up with this convoluted solution involving bitwise comparisons (because they seemed to have a fixed number of tags). In retrospect, I'm sure they had something much simpler going on than that.   | |
|   |   | 
|  09-08-2008, 03:02 PM | #73 | 
| creator of calibre            Posts: 45,598 Karma: 28548962 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			Keep in mind that while user specified tags are great, you also need automatically specified tags from book metadata. This is to cater to users that don't want to go to the trouble of manually tagging every book (title, author,series,isbn etc are examples of "automatic tags"). And in order to do that, you're going to have to write plugins to read metadata from all the ebook formats you want your app to support.
		 | 
|   |   | 
|  09-08-2008, 03:06 PM | #74 | 
| fruminous edugeek            Posts: 6,745 Karma: 551260 Join Date: Oct 2006 Location: Northeast US Device: iPad, eBw 1150 | 
			
			Hm... how about a "statistically improbable phrase" plugin?    | 
|   |   | 
|  09-08-2008, 03:10 PM | #75 | |
| Connoisseur       Posts: 99 Karma: 657 Join Date: Sep 2008 Location: Slough, UK Device: Sony Reader | Quote: 
 If you specify that your books are stored in directory structure platform/author/series/isbn/title, then I could create tag categories for each of those, and automatically populate the specific tags from the specific directories. You're of course absolutely right that I'll need to read metadata from the ebooks themselves in order to more accurately populate certain categories (specifically title, I think, based on what I've seen so far), and of course for those ebooks that aren't in directory hierarchies at all. I was thinking that maybe I should write the metadata readers as separate entities, and put those in the open source community directly; so then we might get to a point where they can be maintained and added to by many people for many different programs, as long as the interface is consistent. | |
|   |   |