View Single Post
Old 03-09-2015, 04:20 AM   #522
odinokij
Enthusiast
odinokij began at the beginning.
 
Posts: 29
Karma: 10
Join Date: Jul 2012
Device: Kindle 3
Quote:
Originally Posted by BetterRed View Post
@Odinokij - search this thread for 'articles', I get the impression the PI will detect articles if tweaks and language settings are set appropriately. The tweak - "Set the list of words considered to be "articles" for sort strings" appears to be the 'key'.

BR
Thanks for your answer BetterRed, but...

In the calibre configuration i've got the default value for "per_language_title_sort_articles" (for spanish: 'spa': ('El\\s+', 'La\\s+', 'Lo\\s+', 'Los\\s+', 'Las\\s+', 'Un\\s+', 'Una\\s+', 'Unos\\s+', 'Unas\\s+') ) that may be considered correct (more or less)

But the Find Duplicate plugin doesn't detect duplicity for "El tercer hombre" vs "Tercer hombre" (both with the same author, and language:Español (spanish)) using "fuzzy-fuzzy" nor soundex(6)-soundex(8).

If I test with the books "The third man" and "Third man" (both with same author and language:Inglés (english) the plugin detects the duplicity in fuzzy-fuzzy mode.

Thank you for your help,
Odinokij
odinokij is offline   Reply With Quote