Try enabling the preprocess option under structure detection. I think you mean hard line breaks are embedded in the file, not page breaks based on the description of the behavior. That option will try to fix some of those issues.
If that doesn't work create a bug with the file. I've been trying to test lately with a variety of lit files to try and improve preprocessing on mediocre/bad files, this sounds like it might be a useful example.
|