View Single Post
Old 07-16-2012, 07:43 AM   #1
Steven630
Groupie
Steven630 began at the beginning.
 
Posts: 154
Karma: 10
Join Date: May 2012
Device: Kindle Paperwhite2
captureing all articles but all under the first section

Here is how the webpage looks like

Code:
<div class='module'>

<h3>Section1</h3>

<ul>
<li>
<h4>articles links and article titles</h4>
</li>
……(other articles)
</ul>

<h3>Section2</h3>

<ul>
<li>
<h4>articles links and article titles</h4>
</li>
…… (other articles)
</ul>
So I tried to parse it, setting 'module' as the section.:

But it turned out that though all the articles were fetched correctly, they all end up in the first section.But I am at a loss what to do, since all section names are included in "h3", and unlike webpage of built-in recipes, <div class='module'> appears only before the first section, not every section (which I think explains the failure). Can anyone help me out? Just a quick answer is appreciated.

Last edited by Steven630; 07-19-2012 at 03:52 AM.
Steven630 is offline   Reply With Quote