The problem is that I am currently working with a pdf in a foreign language. Converting to text turns the accents and foreign characters into garbledy-gook!
Plus, my
real interest is to start generating a Table of Contents for these rather long documents I have to carry around with me. I use the Table a lot to hop back and forth through what can be hundreds of pages.
And the whole "
structure detection" language of XPath is really very daunting!
For now, I can search for the exact word-string in the chapter title and leave a bookmark ("table-as-I-go")...