Thanks for all the assistance and comments.
I ran in debug mode and found that the parser was throwing an error while processing header information. The files have a large embedded CSS along with lots of other code that isn't needed for this conversion (approximately 1200 lines of code/css).
So, I found a nice basic data parsing tool and created a simple script to extract out just the text of the page.
Everything looks good now.
Thanks again
|