View Single Post
Old 02-27-2017, 05:34 PM   #4
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,767
Karma: 30237628
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by dwig View Post
It should be noted that as good as they are, none of these search tools incorporate their own OCR (Optical Character Recognition) functions.
Not necessarily true - Windows Search is extensible with 3rd party IFilters

FX : ABBYY OCR IFilter and that's not the only OCR enabled PDF IFilter. I'm unsure if it's packaged in Fine Reader, can't see why not - apart from marketing.

Recent MS PDF IFilters may have it too, given that OneNote does a rather good job of doing OCR on images.

OCR enabled IFilters will use a lot of processor time when they index an image PDF, but that only happens once per PDF. And Windows Search Indexing is normally set so that it only executes when there's nothing else wanting the CPU - you have to go out of your way to make it otherwise.

OCR enabled IFilters will inherit the well know problems of OCR in general, but not all image PDFs originate from scans of 16th century Blackletter on vellum

And I wouldn't be at all surprised if Spotlight didn't have OCR enabled PDF search.

BR

Last edited by BetterRed; 02-27-2017 at 06:27 PM. Reason: add last 2 paras
BetterRed is offline   Reply With Quote