View Single Post
Old 01-03-2013, 12:33 PM   #5
Jellby
frumious Bandersnatch
Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.
 
Jellby's Avatar
 
Posts: 5,789
Karma: 4027751
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon
Quote:
Originally Posted by anandudapudi View Post
but i want to know how i can copy text from pdf to HTML page exactly without this encoding hassle.
There's no solution guaranteed to work, other than OCR, because a PDF is more concerned about its looks than about the underlying meaning. In some cases, if the PDF font uses some known/standard encoding, you can maybe copy and paste with the right settings, or do a conversion afterwards. You may be lucky, and maybe the PDF encoding is ISCII (in that case it should be possible to find a converter), but it could be some ad-hoc encoding used only in that particular document.

Last edited by Jellby; 01-03-2013 at 12:35 PM.
Jellby is offline   Reply With Quote