sad that publishers don't use software that detects common problems with text, there's one PG uses that works pretty well called GutCheck
http://gutcheck.sourceforge.net/
It can grab most of those stupid OCR errors, like the ii and arid, along with missing quotes and whatnot. OCR is good today, but still not quite 100% ... more like 99.8%