MobileRead Forums - View Single Post

DaltonST · 08-05-2016, 07:16 PM

@petzi:

What people do often is a function of the total size of their libraries, both individually and collectively, among the other variables people have alluded to.

Attached are examples of Tags from my huge English-language library. I also have 2 Spanish-language libraries. My English Tags were influenced by BISAC codes.

My main English library happens to also be a QuarantineAndScrub library, so I use Q&S to clean all of my books (including their tags, but the other metadata too) that I acquired prior to creating Q&S (which was the majority of them). I also use Q&S on new books in a transient Q&S library before moving them to my main library. 2 years after building Q&S, I am not yet finished with my backlog of legacy books needing attention. New ones go in clean, though.

For my Spanish libraries, I use CALM since the Tag tools in CALM are a port of the most bang-for-the-buck Tag tools in Q&S (I did not port the "minimize Tags using Priorities" tool, for example). I consolidate the 2 Spanish libraries in CALM, standardize their Tags in CALM, and then flush all of the finalized Tags back to their Source Libraries for a complete refresh.

I also use my Library Codes plug-in to extract (for Factual, anyway) the Dewey Decimal Codes and the Library of Congress Codes. From those codes I get the DDC 'genre' if it is available. Otherwise, I get the LCC 'genre' if it is available.
Collectively, DDC, LCC and the 'genres' derived from one of them are another "tag" of sorts.

If I had only a few thousand books, I would never have built Q&S or CALM.

With a huge number of books, a table-driven approach using Regular Expressions, substitutions, deletions, and other automated techniques was the most efficient approach for me to take. I built those tools for me, and just shared them to save a lot of people a lot of their lifespan cleaning book metadata. Life is too short.

Good luck.

DaltonST

08-05-2016, 07:16 PM	#15
DaltonST Deviser Posts: 2,265 Karma: 2091183 Join Date: Aug 2013 Location: Texas Device: none	@petzi: What people do often is a function of the total size of their libraries, both individually and collectively, among the other variables people have alluded to. Attached are examples of Tags from my huge English-language library. I also have 2 Spanish-language libraries. My English Tags were influenced by BISAC codes. My main English library happens to also be a QuarantineAndScrub library, so I use Q&S to clean all of my books (including their tags, but the other metadata too) that I acquired prior to creating Q&S (which was the majority of them). I also use Q&S on new books in a transient Q&S library before moving them to my main library. 2 years after building Q&S, I am not yet finished with my backlog of legacy books needing attention. New ones go in clean, though. For my Spanish libraries, I use CALM since the Tag tools in CALM are a port of the most bang-for-the-buck Tag tools in Q&S (I did not port the "minimize Tags using Priorities" tool, for example). I consolidate the 2 Spanish libraries in CALM, standardize their Tags in CALM, and then flush all of the finalized Tags back to their Source Libraries for a complete refresh. I also use my Library Codes plug-in to extract (for Factual, anyway) the Dewey Decimal Codes and the Library of Congress Codes. From those codes I get the DDC 'genre' if it is available. Otherwise, I get the LCC 'genre' if it is available. Collectively, DDC, LCC and the 'genres' derived from one of them are another "tag" of sorts. If I had only a few thousand books, I would never have built Q&S or CALM. With a huge number of books, a table-driven approach using Regular Expressions, substitutions, deletions, and other automated techniques was the most efficient approach for me to take. I built those tools for me, and just shared them to save a lot of people a lot of their lifespan cleaning book metadata. Life is too short. Good luck. DaltonST Attached Thumbnails