Thread: PRS-600 prs-600 notes to printing
View Single Post
Old 10-23-2009, 01:40 PM   #24
NightWhistler
Developer of PageTurner
NightWhistler began at the beginning.
 
NightWhistler's Avatar
 
Posts: 34
Karma: 10
Join Date: Jun 2009
Location: Voorburg, Netherlands
Device: Samsung Galaxy S3, Asus Transformer Prime
I did some testing with this earlier: created the same document as ePub and as PDF and selected the same text in both.

For PDF it uses the same pdfloc style to indicate start and end point. I couldn't find a way to reliably trace a pdfloc back to a specific point in the text though.

For the epub it used a different format, indicating the line and position on the line. Again: a start and end point... this does make me think that we'd need a seperate decoder per format... though with ePub, PDF and LRF covered we should have the most important ones covered.

If you want I could upload my sample files including annotations. From what I've seen so far the process would be:
  1. Find the annotation for a specific document
  2. Decode the start- and end-positions
  3. Extract the text between those positions from the original document
  4. Add any annotations entered by the user
  5. Output it all in some kind of readable format.
NightWhistler is offline   Reply With Quote