@ babarous
Try exporting to txt or rtf, the output might need more touching up/proof reading, but it's worth the try...
(For what I know, Acrobat does some sort of OCR on the scans in order to generate TXT, so text recognisation depends heavily on the quality of the scan)
I found both PDF to DOC to HTML as well as PDF to HTML to "messy", cleaning the REFLOW-Code to me is much easier.
|