Yes, it should be rare that the same word would appear in two-different languages side by side.
What most people do not recognize is that once parsed, the text sits at the end of the node chain and so search must be used. Character offsets to specify a file position are simply messed up as soon as a replacement or change is performed.
Even a epub3 cfi needs and uses search/pattern matching to identify text segments.
|