View Single Post
Old 05-06-2011, 01:29 AM   #111
saddan
Junior Member
saddan began at the beginning.
 
Posts: 1
Karma: 10
Join Date: May 2011
Device: none
Thanks for this plugin!

I had problems with some books. One of them I would get this exception:

XMLSyntaxError: PCDATA invalid Char value 24, line 159, column 54

After some print statements, I noticed the xml generated in function _read_pdf_text from file scan.py had some invalid characters.

So I modified it to replace most of non-printable chars by something else ('_').

I'm attaching a diff of the modifications I did.

patch.scan.py.txt
saddan is offline   Reply With Quote