@Krogenar - welcome to MobileRead.
I vaguely recall something similar being asked on these forums six months or so ago. The problem is that it is virtually impossible to define the "rules" for a "score" that has any meaning. Suggestions for any rules were conspicuously absent from your post
Personally I think that deciding which book to keep is an incredibly subjective process that only manual eyeballing can resolve. There are perhaps some "simple" rules that could be automated, but the amount of effort to do so in proportion to the amount of other factors that cannot makes it not worthwhile imho. For instance, some of the factors I use include:
- Have quotes been lost from a conversion due to an encoding issue (but defining what constitutes a quote is itself non-trivial)
- Emphasis present in text like italics (which not all books use)
- Paragraph structure (obvious PDF conversions, spacing missing near italics etc)
- Presence of images like maps, chapter headings etc
Then you have the issue of different formats etc - you might have a good quality LIT from a different source than the EPUB version. It all just gets too hard too darn quickly.