View Single Post
Old 04-14-2008, 08:12 AM   #24
WillAdams
Wizard
WillAdams ought to be getting tired of karma fortunes by now.WillAdams ought to be getting tired of karma fortunes by now.WillAdams ought to be getting tired of karma fortunes by now.WillAdams ought to be getting tired of karma fortunes by now.WillAdams ought to be getting tired of karma fortunes by now.WillAdams ought to be getting tired of karma fortunes by now.WillAdams ought to be getting tired of karma fortunes by now.WillAdams ought to be getting tired of karma fortunes by now.WillAdams ought to be getting tired of karma fortunes by now.WillAdams ought to be getting tired of karma fortunes by now.WillAdams ought to be getting tired of karma fortunes by now.
 
WillAdams's Avatar
 
Posts: 1,258
Karma: 3439432
Join Date: Feb 2008
Device: Amazon Kindle Paperwhite (300ppi), Samsung Galaxy Book 12
BlackVoid, the problem is that the formatting in the .pdf is encoded as positional informatin (place this character in this font at these x,y coordinates), so one needs to analyse that so as to determine where paragraphs begin / end &c.

Marcel Weiher wrote a utility, TextLightning for Mac OS X (ob. discl. it's shareware and I was a beta tester) and there're other tools which do this, and there are a few others, e.g., SolidPDF for Windows.

--- unless it's a ``tagged'' .pdf where such is embedded in the file structure.

William
WillAdams is offline   Reply With Quote