View Single Post
Old 06-05-2009, 09:26 AM   #36
ahi
Wizard
ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.
 
Posts: 1,790
Karma: 507333
Join Date: May 2009
Device: none
Quote:
Originally Posted by rogue_ronin View Post
Did I miss something? In another thread?

m a r
I don't think you missed anything, "mjh215".

I should note though that I am 99% certain that my previously described methodology would also have worked on this file.

Since the need ceased, I didn't force myself to figure out that python script yesterday after all... but when I have it, I will post it. I think it will be remarkably good at removing line-breaks while preserving paragraph breaks... (almost) regardless of what format is used.

In the case of Gideon's file, the second most common whitespace weight being 1001 (space + return) would have identified it as the linebreak sequence, while 1000 (return only) would have been identified as the paragraph break sequence by virtue of being third most common.

- Ahi
ahi is offline   Reply With Quote