Opinion Artiste
Posts: 301
Karma: 61464
Join Date: Mar 2009
Location: Albany, OR
Device: Nexus 5, Nexus 7, Kindle Touch, Kindle Fire
|
First, a Project Gutenberg TXT file with line breaks:
Quote:
Analyzing text...
[1]: Starting on Mon, 14 Sep 2009 07:07:51
[1]: Simplifying linebreaks...
[1]: Analyzing whitespace patterns...
[1]: 100.0% processed (0.6 MB of 0.6 MB)
[1]: Finished on Mon, 14 Sep 2009 07:08:03
[1]: Filesize:
[1]: 658765
[1]: Whitespace analysis:
[1]: [(104865, 0.0), (8362, 8.0), (2689, 16.0), (41, 24.0), (31, 32.0), (28, 1.0), (20, 9.0), (13, 10.0), (3, 2.0), (3, 17.0), (2, 40.0), (1, 88.0)]
[1]: 1591.84231099
[1]: 126.934491055
[1]: 40.8188048849
[1]: 0.622376720075
[1]: 0.470577520056
[1]: ... appears to be a file with line-breaks.
|
Then, another TXT file without line breaks, entire paragraph on each line:
Quote:
[1]: Analyzing text...
[1]: Starting on Mon, 14 Sep 2009 07:11:57
[1]: Simplifying linebreaks...
[1]: Analyzing whitespace patterns...
[1]: 100.0% processed (0.3 MB of 0.3 MB)
[1]: Finished on Mon, 14 Sep 2009 07:11:57
[1]: Filesize:
[1]: 317441
[1]: Whitespace analysis:
[1]: [(55965, 0.0), (2342, 9.0), (186, 8.0), (19, 17.0), (5, 11.0), (3, 16.0), (1, 12.0), (1, 55.0)]
[1]: 1763.00477884
[1]: 73.7774893602
[1]: 5.85935654185
[1]: 0.598536420941
[1]: 0.157509584458
[1]: ... appears to be a file with paragraph breaks.
|
Then one RTF file:
Quote:
[1]: Analyzing text...
[1]: Starting on Mon, 14 Sep 2009 07:13:23
[1]: Simplifying linebreaks...
[1]: Analyzing whitespace patterns...
[1]: 100.0% processed (1.1 MB of 1.1 MB)
[1]: Finished on Mon, 14 Sep 2009 07:13:25
[1]: Filesize:
[1]: 1174398
[1]: Whitespace analysis:
[1]: [(201159, 0.0), (6815, 16.0), (58, 32.0), (15, 1.0), (5, 8.0), (3, 80.0), (2, 48.0), (2, 40.0), (1, 64.0), (1, 24.0)]
[1]: 1712.86906143
[1]: 58.0297309771
[1]: 0.493870050869
[1]: 0.127725013156
[1]: 0.0425750043852
[1]: ... appears to be a file with paragraph breaks.
|
And then a second RTF file:
Quote:
[1]: Analyzing text...
[1]: Starting on Mon, 14 Sep 2009 07:14:20
[1]: Simplifying linebreaks...
[1]: Analyzing whitespace patterns...
[1]: 100.0% processed (1.2 MB of 1.2 MB)
[1]: Finished on Mon, 14 Sep 2009 07:14:22
[1]: Filesize:
[1]: 1209776
[1]: Whitespace analysis:
[1]: [(187376, 0.0), (16677, 16.0), (128, 65.0), (85, 32.0), (39, 49.0), (31, 81.0), (12, 48.0), (3, 114.0), (3, 97.0)]
[1]: 1548.84871249
[1]: 137.851965984
[1]: 1.05804710955
[1]: 0.702609408684
[1]: 0.32237372869
[1]: ... appears to be a file with paragraph breaks.
|
|