Quote:
Originally Posted by nickredding
Code:
for div in soup.findAll('div','module bodytext'):
div['class']='module'
|
thanks for reply,but it does'nt work
Code:
def preprocess_html(self, soup):
for div in soup.findAll('div',attrs={'class':'module bodytext'}):
div['class']='module'
return soup
Quote:
<div class="bodytext">
<div class="module bodytext">
<img alt="伊朗地震" class="calibre8" src="../Images/img1_u6.jpg" />
<p class="caption">夜幕降临给救援工作增加了难度</p>
</div>
|