A spamassassin type score based system may be more successful than the current best match finding algorithm, but it would be a lot of testing/tweaking to get the scores right.
Changing the algorithm to only check for covers of a subset of matched books is a good idea.
|