It seems if they can make applications that can listen to songs and identify a song based on wave frequencies we should have an easier time doing it for books since we are working with data that we can logically see and read. Punctuation would be used in the identification of a sentence or paragraph not in the hash code. While it is true that if you have two books containing the same sentence if an anthology or story in your example we might have an issue. Some books might not have a TOC like a text file.
|