Hmmm... looking at the djvu1txt.txt file, there are 43 instances of "GRAND LAROUSSE DE LA LANGUE FRANÇAISE", looking at the newTSV.txt file, there are 42 instances of that phrase. This strongly suggests that this is an attempt to pirate the Grand Larousse de la Langue Française dictionary.
Sorry but I'm out of this discussion.
|