Quote:
Originally Posted by HarryT
Fair enough; my standard of proof-reading (for the things that are important to me, like my Dickens novels) is "every comma in the right place"  .
|
And why would you want anything less?
Mind you it gets complicated: I, for example, am going through some serious hoops simply to try and automatically remove hyphen... it's not as straight-forward as it seems, and it's arguable (from the point of view of an OCR program) that they should remain in their original position. But I want the little devils out!
What I want:
- All words the right word
- All words spelled correctly
- All words in the original case and properly capitalised
- All punctuation marks (English style - I'll worry about other scripts some other time!) present and correct
- All hyphens removed
What I don't want:
- Font size or style information (that's a job for the display device)
- Layout specifications (so's that)
- Lines between each carriage return (save as a scene break marker)
- Carriage returns at the end of each line (what were you *thinking*, Gutenberg?)
- Page numbers
- Titles, periodical name, author's name, or other page matter
- Illustrations
In brief, I want a structured document, not a formatted one.
Neil