View Single Post
Old 06-03-2009, 12:34 PM   #22
DaleDe
Grand Sorcerer
DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.
 
DaleDe's Avatar
 
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
Quote:
Originally Posted by HarryT View Post
Because people can easily do "pattern recognition" tasks which are extremely difficult for a computer.

You could say "If a line starts in a capital letter then it's probably a new paragraph", I suppose. It wouldn't be 100% reliable, but it would be a good start.
And the previous line was shorter than usual would also be a clue. Unfortunately the length of the line has become a rather poor indicator due to the fact that often the original assumed mono-spaced fonts and currently this is almost never the case. But if you assume it was mono-spaced you can count characters and determine the next word would have fitted. It is certainly beyond simple regexpressions.

Dale
DaleDe is offline   Reply With Quote