View Single Post
Old 07-19-2011, 01:21 AM   #75
therealjoeblow
Zealot
therealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfolded
 
Posts: 106
Karma: 52102
Join Date: Jun 2010
Device: Samsung Android Tablet w/Moon+ Pro Reader
Quote:
Originally Posted by burbleburble View Post
Ebook Cleaner

About:
Many ebooks have messy and inconsistent formatting.
  • <snip>
  • Broken paragraphs/sentences, missing punctuation...


Plans:
  • <snip>
  • a spell checker using heuristics to avoid wasting time on names and places created for that book
  • a punctuation checker finding broken paragraphs/sentences/punctuation - (the ones guarenteed needing you attention, not every possible grammer...)
I am *REALLY* looking forward to trying this out when the features noted above are working. Personally, I could care less about the rest of the features as I've figured out how to manually fix most of them relatively easy with notepad++, but the punctuation, broken paragraphs and general spelling mistakes from bad OCR are killing me!

Cheers,
The REAL Joe
therealjoeblow is offline   Reply With Quote