Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Recipes

Notices

Reply
 
Thread Tools Search this Thread
Old 02-09-2011, 02:21 AM   #1
tomsem
Wizard
tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.
 
Posts: 2,410
Karma: 2519673
Join Date: Apr 2009
Location: USA
Device: iPod Touch, Xoom, Kindle PW, iPad3, Fire HD2
Truncated page with some 'Fetch news' source

I was experimenting with the new Kindle software Preview (3.1) and wanted to confirm that the new subscription layout worked with calibre's Fetch News feature. For the most part, it works great, and while I had totally switched over to reading RSS with 'Reeder' app on my iPod Touch, with the new layout, I'm tempted to use calibre again for a few of them so I can read on my Kindle instead. It is just SO much better.

But one of the news sources I picked at random ('Bill O'Reilly' in fact..) generated an azw file where several of the articles truncate text at the bottom of a page. When the 5way cursor is moved into the text, it goes into 'table pan' mode. I unpacked the azw and sure enough, the article text is all packed inside a <td> tag. The table cell has too much text to fit on one page, even at the smallest text size.

Kindle has known issues with <table> and this must certainly be one of them.

Another news source I picked, 'Chicago Tribune', seemed to work okay, though I didn't check everything.

What I'm wondering is if this is an issue with calibre recipes in general, or just the one with the problem? I did not see this problem before the 3.1 update, but then, I never tried 'Bill OReilly' before... I really don't think it is the 3.1 update, but haven't had a chance to try these on my K2 yet. (will update this thread when I have)

Surely the offending <table> usage is inherited from the source HTML, but to avoid this problem, the HTML needs to be sanitized to avoid causing problems when converted to azw. Does calibre have a facility for doing this?
tomsem is offline   Reply With Quote
Old 02-09-2011, 06:57 AM   #2
DoctorOhh
US Navy, Retired
DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.
 
DoctorOhh's Avatar
 
Posts: 8,811
Karma: 12535517
Join Date: Feb 2009
Location: North Carolina
Device: Nexus 7
Quote:
Originally Posted by tomsem View Post
But one of the news sources I picked at random ('Bill O'Reilly' in fact..) generated an azw file where several of the articles truncate text at the bottom of a page. When the 5way cursor is moved into the text, it goes into 'table pan' mode. I unpacked the azw and sure enough, the article text is all packed inside a <td> tag. The table cell has too much text to fit on one page, even at the smallest text size.
Most folks building recipes go out of their way to eliminate the tables so the Bill O'Reilly recipe should be the rare exception and not the rule.

Adding the conversion_options after remove_javascript = True removed the tables and the recipe should work fine for you after this customization.

Code:
    remove_javascript     = True
    conversion_options = {
                          'linearize_tables' : True 
                        }
Moved to recipe sub-forum in case my advice is off the mark.
DoctorOhh is online now   Reply With Quote
 
Enthusiast
Old 02-09-2011, 11:48 AM   #3
tomsem
Wizard
tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.tomsem ought to be getting tired of karma fortunes by now.
 
Posts: 2,410
Karma: 2519673
Join Date: Apr 2009
Location: USA
Device: iPod Touch, Xoom, Kindle PW, iPad3, Fire HD2
Thanks, that answers my question!
tomsem is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Fetch news chewi Recipes 0 11-30-2010 05:09 AM
customize new source to Fetch News gustavoleo Recipes 0 11-09-2010 06:01 PM
problems with Fetch News megthered Calibre 0 08-05-2010 12:17 PM
Can't Fetch News Catew Calibre 2 07-19-2009 07:46 PM
Fetch News philipdavies Calibre 5 10-08-2008 04:33 AM


All times are GMT -4. The time now is 03:03 AM.


MobileRead.com is a privately owned, operated and funded community.