Hi
I just continued the work on this project a little bit.
To figure out whether the pdfloc coordinates refer to actual pixel localizations in a certain document (independent of the font, letter,....) I created another similar documents with slight changes.
New settings :
Font size: 18 instead of 12
again adding one x per line.
Results:
The second coordinate indeed represents the page
Surprisingly the lines are still increasing every line by a value +8
--> either it is font size independent (would be much easier), or it actually describes the spacing between paragraphs
Edit: I made another test with increased spacing and the numbers still increased by 8!!
The number per letter is again increasing by (x-1)+1
--> Since I found the same for a much smaller font size, it indicates that the pdfloc indeed describe the number of letters independent of their size !
Summary:
We know for the coordinates, that the
2nd is the page
3rd is the line increasing by 8 (1st line is 9)
5th is the number of letters increasing by 1 (beginning from 2 letter =1)
last indicating end of the mark? at least it is 0 for the start coordinates and 1 for the end
Note: I tried as well to mark over two lines (same system: increasing the x`s per line) and found that it does NOT add any additional information, just the start and end letter coordinate
Last edited by ebioman; 05-13-2010 at 05:21 AM.
|