|05-04-2008, 12:42 PM||#1|
Join Date: May 2008
i've just joined this forum and am quite new to plucker.
basically, i cant figure out why the following website yields a "Parsing failed" error when using plucker.
the website i am after is: http://www.dailyreckoning.com.au
the error i get is:
Error: Runtime error parsing document http://www.dailyreckoning.com.au/: unexpected char in declaration: '<'
---- all 0 pages retrieved and parsed ----
Last edited by DaleDe; 05-04-2008 at 07:39 PM. Reason: spelling
|05-04-2008, 09:10 PM||#2|
Join Date: Feb 2003
Location: New London, CT
Device: Direct Neural Implant
Plucker deals with real HTML, not fake HTML
This is especially important when dealing with XML, because the spec itself says that ANY error in XML should immediately throw a fatal error in the parser... as it does with Plucker.
The result is that you'll either have to tell them to clean up their HTML, or clean it up yourself in an inline filter or parse the pages locally with something like tidy or similar tools.
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|plucker to something||eksor||Other formats||2||09-18-2009 03:52 AM|
|Plucker and TX||tsej||Reading and Management||4||02-28-2006 09:07 AM|
|why plucker?||mutsuenmei||Reading and Management||1||05-29-2005 07:19 AM|
|Plucker V1.6.2 out||TadW||Reading and Management||2||01-11-2004 02:02 PM|
|Plucker V1.6.1 out!||Alexander Turcic||Reading and Management||0||11-09-2003 11:14 AM|