Truncated page with some 'Fetch news' source
I was experimenting with the new Kindle software Preview (3.1) and wanted to confirm that the new subscription layout worked with calibre's Fetch News feature. For the most part, it works great, and while I had totally switched over to reading RSS with 'Reeder' app on my iPod Touch, with the new layout, I'm tempted to use calibre again for a few of them so I can read on my Kindle instead. It is just SO much better.
But one of the news sources I picked at random ('Bill O'Reilly' in fact..) generated an azw file where several of the articles truncate text at the bottom of a page. When the 5way cursor is moved into the text, it goes into 'table pan' mode. I unpacked the azw and sure enough, the article text is all packed inside a <td> tag. The table cell has too much text to fit on one page, even at the smallest text size.
Kindle has known issues with <table> and this must certainly be one of them.
Another news source I picked, 'Chicago Tribune', seemed to work okay, though I didn't check everything.
What I'm wondering is if this is an issue with calibre recipes in general, or just the one with the problem? I did not see this problem before the 3.1 update, but then, I never tried 'Bill OReilly' before... I really don't think it is the 3.1 update, but haven't had a chance to try these on my K2 yet. (will update this thread when I have)
Surely the offending <table> usage is inherited from the source HTML, but to avoid this problem, the HTML needs to be sanitized to avoid causing problems when converted to azw. Does calibre have a facility for doing this?
|