View Single Post
Old 01-09-2013, 12:53 PM   #6
Paxman53
Connoisseur
Paxman53 began at the beginning.
 
Posts: 55
Karma: 10
Join Date: Jan 2011
Device: 7" Tablet - Aldiko Reader Premium
Quote:
Originally Posted by grannyGrumpy View Post
@Paxman53 --- Another thought --- are you comparing this to an original text and know for sure that the missing punctuation are commas and full stops?

I've worked with numerous OCR scans that dropped the EMDASH. From reading the text it looked like missing commas/full-stops, but the PDF of the original book revealed the missing emdashes.
Sorry for the late reply, I am having major problems with Calibre at the moment.

Yes I am comparing to an original text and there are definite ommissions, but the Regex provided by Perkin has definitely got around the problem.

Thanks for the input though.
Paxman53 is offline   Reply With Quote