View Single Post
Old 02-09-2011, 06:57 AM   #2
DoctorOhh
US Navy, Retired
DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.
 
DoctorOhh's Avatar
 
Posts: 9,897
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Kindle PaperWhite SE 11th Gen
Quote:
Originally Posted by tomsem View Post
But one of the news sources I picked at random ('Bill O'Reilly' in fact..) generated an azw file where several of the articles truncate text at the bottom of a page. When the 5way cursor is moved into the text, it goes into 'table pan' mode. I unpacked the azw and sure enough, the article text is all packed inside a <td> tag. The table cell has too much text to fit on one page, even at the smallest text size.
Most folks building recipes go out of their way to eliminate the tables so the Bill O'Reilly recipe should be the rare exception and not the rule.

Adding the conversion_options after remove_javascript = True removed the tables and the recipe should work fine for you after this customization.

Code:
    remove_javascript     = True
    conversion_options = {
                          'linearize_tables' : True 
                        }
Moved to recipe sub-forum in case my advice is off the mark.
DoctorOhh is offline   Reply With Quote