Quote:
Originally Posted by scotty1024
Even "text" PDF's need something close to a tesseract to figure out what text is where.
|
PDF sometimes seems to be synonymous with "fear the worst".
Preview.app somehow does this job, however. But Apple has legions of programmers to implement something like that and doesn't give us the sourcecode...
I hoped that it was relatively "easy" to figure out which letters are located in a given rectangle and to just order them up-to-down-left-to-right and then send the resulting word to whatever helper application.
edit: I found some very nice links that explain basic why-PDF-can-be-difficult-to-handle-stuff:
http://www.dclab.com/converting_from_pdf.asp
http://www.dclab.com/converting_from_pdf2.asp
(have a look at the links on the right in the second article)
#!chris