Thanks Kovid.
I had started doing some testing with pyPdf (and now pdofo as well, for the latter I am doing nothing but calling open at this point obviously). It is interesting that unlike pdftohtml & pdfreflow, both pyPdf and podofo choke on an epub which has security applied to it. pyPdf throws a DRMError. podofo spits an *enormous* # of console lines out all saying like this:
3 m_nPredictor=12 m_nCurPredictor=12
before throwing an exception of: ePdfError_UnsupportedFilter
Any way of preventing the console output?
Still working out which is the least evil combination as yet