Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Library Management


Thread Tools Search this Thread
Old 03-01-2013, 06:05 PM   #1
Sidetrack began at the beginning.
Posts: 39
Karma: 10
Join Date: Jan 2009
Location: South Pacific
Device: Kindle DX
Pruning redundant and partially redundant tags

I've been using the goodreads metadata download plugin to map tags to a hierarchy I like, and now I'd like to prune some of the redundant information out of the rest of my tags. So I'm looking for an elegant solution. I'm getting there with the regex replacement, but as stated, any more elegant solutions would be appreciated. I'm a little stumped on how to search for books that have redundant info on something better than a case-by-case basis.


foo.fie, foo.fie.fum, foo, fum fie would become simply: foo.fie.fum
fiction, genre.crime, genre.mystery, genre.mystery.hard-boiled, crime, mystery, mystery & detective, hardboiled mystery
would become
genre.crime, genre.mystery.hardboiled

my regex is similar to this, though I've got a bit of a mishmash going with special cases:
template {tags} (\.[^\.,]+)(.*, )?([^,\.]*)\1; \1\2

I have to use separate search terms if the offending tags sort alphabetically before the genre tags

Any ideas on how to search for or otherwise identify books with partially redundant tags? Maybe a calculated column? How about some cleaner more robust replacement terms?

One other thing that bugs me is when I get info like the author's name or publisher mixed in as a tag when I've already got that information in it's appropriate column.
Sidetrack is offline   Reply With Quote

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Redundant topic line Steven630 Recipes 6 06-22-2012 01:43 PM
bad / redundant html ? cybmole Calibre 0 12-29-2010 12:49 PM
Redundant/Invalid TOC entries Stinger Kobo Reader 4 06-26-2010 10:02 PM
Not to be obnoxiously redundant but can we have a jetBook forum? wodin Feedback 7 05-25-2009 04:41 PM
Redundant collections after using calibre Yarrow Calibre 0 12-25-2008 05:30 PM

All times are GMT -4. The time now is 04:12 AM. is a privately owned, operated and funded community.