For my personal AO3 library, after I stripped all the rating, warning, category, and character tags, there are more than 6000 distinct tags. Canonicalizing them reduces this to more than 5000. Dropping tags that only appear once reduces this to 1277 tags, which is still probably too many collections for a Kobo to handle, I'm assuming? I'm going to look at how much I lose by starting to less-frequent tags, though I'm curious if anyone has other ideas for reducing the number of tags with minimal loss of information.
|