Yep - I'd read those pages, I also understand HTML and, to a lesser extent, XML. Problem is I'm trying to write small bits of code in Calibre using a language I don't know (XPATH) to process the contents of a file I can't see the contents of (PDF) and for some strange reason I seem to be having problems
If I could just view the text then I could write a little program to stitch things back together. Are there converters that perhaps perform OCR on the PDF and just output the text?
Mike