Hi
In order to pre-process image files with scantailor, I may have to convert some source PDF to png files.
There are some online services that do this, I prefer doing it using
imagemagick.
Second try on a 14 pages pdf extract from a bigger book gave this:
Code:
convert garnier.pdf garnier.png
convert: profile 'icc': 'RGB ': RGB color space not permitted on grayscale PNG `garnier.png' @ warning/png.c/MagickPNGWarningHandler/1748.
[roger@lenovo roger]$
It converted nearly instantly all the pages which is pretty good but I am not sure to understand the information above. Has somebody some knowledge about it?
Even adding parameters like
-quality 100, or
-density 300, one such image has a 27k only size, while the same image processed with, say pdfcandy online service at medium resolution has a 55k size (see screenshot). Does this difference may hinder the ocr process later?
The second image (001) comes from pdfcandy