View Single Post
Old 06-04-2020, 11:43 PM   #167
Marinolino
Groupie
Marinolino ought to be getting tired of karma fortunes by now.Marinolino ought to be getting tired of karma fortunes by now.Marinolino ought to be getting tired of karma fortunes by now.Marinolino ought to be getting tired of karma fortunes by now.Marinolino ought to be getting tired of karma fortunes by now.Marinolino ought to be getting tired of karma fortunes by now.Marinolino ought to be getting tired of karma fortunes by now.Marinolino ought to be getting tired of karma fortunes by now.Marinolino ought to be getting tired of karma fortunes by now.Marinolino ought to be getting tired of karma fortunes by now.Marinolino ought to be getting tired of karma fortunes by now.
 
Posts: 184
Karma: 2019866
Join Date: Feb 2018
Device: Kobo Aura-One (using KOReader app), Boox Note-3, iPad(s)
Quote:
Originally Posted by ottischwenk View Post
A scan always results in an image and an image never contains any text that can be further processed.
You can only try to extract this with OCR.
We both have mentioned pdf scans, not just scanned jpeg, bmp etc. image files.

Pdf scans before OCR application are indeed non-searchable (and non-highlightable) scans, but after OCR has been applied thereon they become searchable pdf scans, if OCR layer has been saved within pdf scan.

Although nowadays we can search and highlight even non-searchable pdf scans e.g. using iOS/Android apps that would automatically apply OCR on the currently opened pdf page as we read it (without saving it to pdf thereafter), or we can search a full folder of non-searchable pdf scans in this way without opening any file.

Last edited by Marinolino; 06-05-2020 at 12:20 AM.
Marinolino is offline   Reply With Quote