re pdftohtml - does this extact embedded images? Last time I tried the 0.39 Windows command line tool it only extracted text (in simple mode). Complex mode converted to png but for final conversion to lrf that wasn't too useful for me. All formatting, headings, document structure was lost as well.
Darren
|