Quote:
Originally Posted by HarryT
A PDF containing scanned pages obviously can't be reflowed - it's just a picture.
|
Not to beat a dead horse, but this does seem to be a common misconception. As Markom pointed out, if the text is regular enough, and there are not too many defects, text re-flow on scanned pages can be done reliably using graphical methods to find the text rows and words within the scanned bitmaps (not OCR). The links below are from
willus.com/k2pdfopt (in the middle of the page where the examples are):
Scanned book pages (no OCR layer)
Scanned pages as processed by k2pdfopt (no OCR performed)