View Single Post
Old 01-03-2010, 05:37 AM   #32
barnacle
Enthusiast
barnacle is on a distinguished road
 
Posts: 28
Karma: 54
Join Date: Dec 2009
Device: Sony Pocket
Quote:
Originally Posted by HarryT View Post
Fair enough; my standard of proof-reading (for the things that are important to me, like my Dickens novels) is "every comma in the right place" .
And why would you want anything less?

Mind you it gets complicated: I, for example, am going through some serious hoops simply to try and automatically remove hyphen... it's not as straight-forward as it seems, and it's arguable (from the point of view of an OCR program) that they should remain in their original position. But I want the little devils out!

What I want:
- All words the right word
- All words spelled correctly
- All words in the original case and properly capitalised
- All punctuation marks (English style - I'll worry about other scripts some other time!) present and correct
- All hyphens removed

What I don't want:
- Font size or style information (that's a job for the display device)
- Layout specifications (so's that)
- Lines between each carriage return (save as a scene break marker)
- Carriage returns at the end of each line (what were you *thinking*, Gutenberg?)
- Page numbers
- Titles, periodical name, author's name, or other page matter
- Illustrations

In brief, I want a structured document, not a formatted one.

Neil
barnacle is offline   Reply With Quote