View Single Post
Old 05-12-2011, 02:07 PM   #21
ldolse
Wizard
ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.
 
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
I've looked at the new engine - it's got a lot of potential. Vertical and horizontal positional information is retained so paragraphs can be detected through indents and other tests (though none of those tests are done now). Header and footer removal will also become trivial as it can be done based on position on the page. Last time I looked at it though I couldn't quite figure out the logic as the reflow function covers single column and two column unwrapping in the same function.

@kiwidude, the specific problem in your example is that punctuation at the end of a line is a full stop - since the current engine loses all positional information including indents punctuation is all we've got. If a line in the middle of a paragraph ends in with a full stop punctuation element then the paragraphs will be split there.
ldolse is offline   Reply With Quote