A manual way is:
1 - do a “find and replace” as “every paragraph mark with 2 paragraph marks”;
2 - look into your text, increasing the size of the font helps showing better the oddities in the text, and it’s going to be quite easy to find the paragraph’s broken. Correct them and go on with looking into the text;
3 - in the end, do a “find and replace” to reverse the original one, “every 2 paragraph marks with just 1”.
Proof reading is a long, costly and tedious process… see Project Gutenberg efford with collective proofreading!
In Digitization projects is by far the most costly part of the project and one of the main reasons the PDF format “image with OCRed text under it” is so popular in these projects and also in the Enterprise world.
Best regards,
Last edited by DDHarriman; 09-10-2008 at 02:07 PM.
|