View Single Post
Old 08-05-2016, 07:16 PM   #15
DaltonST
Deviser
DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.
 
DaltonST's Avatar
 
Posts: 2,265
Karma: 2091183
Join Date: Aug 2013
Location: Texas
Device: none
@petzi:

What people do often is a function of the total size of their libraries, both individually and collectively, among the other variables people have alluded to.

Attached are examples of Tags from my huge English-language library. I also have 2 Spanish-language libraries. My English Tags were influenced by BISAC codes.

My main English library happens to also be a QuarantineAndScrub library, so I use Q&S to clean all of my books (including their tags, but the other metadata too) that I acquired prior to creating Q&S (which was the majority of them). I also use Q&S on new books in a transient Q&S library before moving them to my main library. 2 years after building Q&S, I am not yet finished with my backlog of legacy books needing attention. New ones go in clean, though.

For my Spanish libraries, I use CALM since the Tag tools in CALM are a port of the most bang-for-the-buck Tag tools in Q&S (I did not port the "minimize Tags using Priorities" tool, for example). I consolidate the 2 Spanish libraries in CALM, standardize their Tags in CALM, and then flush all of the finalized Tags back to their Source Libraries for a complete refresh.

I also use my Library Codes plug-in to extract (for Factual, anyway) the Dewey Decimal Codes and the Library of Congress Codes. From those codes I get the DDC 'genre' if it is available. Otherwise, I get the LCC 'genre' if it is available.
Collectively, DDC, LCC and the 'genres' derived from one of them are another "tag" of sorts.

If I had only a few thousand books, I would never have built Q&S or CALM.

With a huge number of books, a table-driven approach using Regular Expressions, substitutions, deletions, and other automated techniques was the most efficient approach for me to take. I built those tools for me, and just shared them to save a lot of people a lot of their lifespan cleaning book metadata. Life is too short.


Good luck.



DaltonST
Attached Thumbnails
Click image for larger version

Name:	tags_factual.jpg
Views:	256
Size:	199.6 KB
ID:	150704   Click image for larger version

Name:	tags_fiction.jpg
Views:	254
Size:	196.0 KB
ID:	150705  
DaltonST is offline   Reply With Quote