MobileRead Forums - View Single Post

mike_bike_kite · 04-30-2010, 04:09 AM

Yep - I'd read those pages, I also understand HTML and, to a lesser extent, XML. Problem is I'm trying to write small bits of code in Calibre using a language I don't know (XPATH) to process the contents of a file I can't see the contents of (PDF) and for some strange reason I seem to be having problems

If I could just view the text then I could write a little program to stitch things back together. Are there converters that perhaps perform OCR on the PDF and just output the text?

Mike

04-30-2010, 04:09 AM	#4
mike_bike_kite Digitally confused Posts: 500 Karma: 1500000 Join Date: Mar 2010 Location: London, UK Device: KPW, K2i, Nexus 7 32gb, Kobo Mini	Yep - I'd read those pages, I also understand HTML and, to a lesser extent, XML. Problem is I'm trying to write small bits of code in Calibre using a language I don't know (XPATH) to process the contents of a file I can't see the contents of (PDF) and for some strange reason I seem to be having problems If I could just view the text then I could write a little program to stitch things back together. Are there converters that perhaps perform OCR on the PDF and just output the text? Mike