Manichean,
Thanks for putting me on the right path....adding
<a href.*</a>
to the header/footer removal sections ripped out all the HTML links and the documents only suffered a little - table of contents is missing in action but I can live with that. As a former software engineer I am surprised I had forgotten how truly ugly regex syntax was until I started doing it again tonight!
|