View Single Post
Old 08-20-2014, 11:47 PM   #5
knowledgecrawler
Member
knowledgecrawler began at the beginning.
 
Posts: 12
Karma: 10
Join Date: Aug 2014
Device: kindle
Quote:
Originally Posted by kovidgoyal View Post
You need to fiure out what the encoding for the html pages you are scraping is. Common choices, latin1, cp1252, utf-8
Found the charset
PHP Code:
<meta http-equiv=Content-Type content="text/html; charset=windows-1252"
Tried with
PHP Code:
 encoding 'cp1252' 
This fixed the issue
PHP Code:
encoding 'utf-8' 
Kudos!..

Last edited by knowledgecrawler; 08-20-2014 at 11:59 PM.
knowledgecrawler is offline   Reply With Quote