View Single Post
Old 12-08-2021, 03:27 PM   #3
Shohreh
Addict
Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.
 
Posts: 207
Karma: 304158
Join Date: Jan 2016
Location: France
Device: none
Thanks, I forgot about it.

This does the trick…
Code:
cpdf.exe -crop "0 70 340.2 462.12" input.pdf -o output.pdf
… but it obviously also removes the bottom of each page.

From what I read, a PDF is a list of objects, with an index at the end.

Is there no way for a script/application to go through that list of objects, find those that contain a given string, and remove them from the list?

The string I want to remove probably lives in the text layer that was added after running the scanned document through an OCR so the user can select/copy instead of having just a bitmap.
Shohreh is offline   Reply With Quote