dwangthny:
I tried your suggestion. The debug input folder had two html files in it. One, debug-raw.html, has the entire file and is 388k, but the other testfile.html is only 82k and has the text chopped short.
As I said earlier, I'm HTML challenged, but I can't see anything in the HTML code that should cause the file to stop importing. At the point the text stops, the raw file looks just like it does everywhere else to me.
I'm including an attachment with two clips from the HTML files. The first is from the raw file starting at the paragraph before the error, and continuing a couple of paragraphs into the missing text. The second clip starts at the same place in the abbreviated file and continues into part of the error area. It's just a continuous series of </span>'s to the end of the file.
Is there something in the code that I'm missing?
|