View Single Post
Old 04-18-2010, 11:03 AM   #1797
kiklop74
Guru
kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.
 
kiklop74's Avatar
 
Posts: 800
Karma: 194644
Join Date: Dec 2007
Location: Argentina
Device: Kindle Voyage
Quote:
Originally Posted by gambarini View Post
What is the correct way to remove all unnecessary tag from this article?
http://www.lastampa.it/_web/cmstp/tm...ne=29&sezione=

The article is formatted with a table, without attribute...
I would do this (add it to your recipe):

Code:
    extra_css = ' .boxocchiello2{font-size: small} .titoloRub{font-size: xx-large; font-weight: bold } .sezione{font-weight: bold} '

    conversion_options = {'linearize_tables':True}

    remove_attributes = ['colspan']

    keep_only_tags = [dict(attrs={'class':['boxocchiello2','titoloRub','sezione','articologirata']})]
kiklop74 is offline