11-28-2010, 09:16 AM | #1 |
Connoisseur
Posts: 57
Karma: 10
Join Date: Feb 2010
Device: Kindle Paperwhite 1
|
Recipe for Ukrainian Economic / Legal news sites.
Hi folks,
I have to tell that I'm a total rookie here. For half a year I've been using this shamefully primitive self-made recipe to download news from Ukrainian economic sites. Here is the recipe itself: Code:
class AdvancedUserRecipe1268599504(BasicNewsRecipe): title = u'Economics UA' oldest_article = 7 max_articles_per_feed = 100 feeds = [(u'Publications', u'http://www.epravda.com.ua/rss/id_433/'), (u'Columnists', u'http://www.epravda.com.ua/rss/id_432/'), (u'Finance.UA articles', u'http://feed43.com/6441846012758810.xml'), (u'Liga:News', u'http://news.ligazakon.ua/news_rss/tape_clauses.xml')] def print_version(self, url): if url.startswith('http://www.epravda.com.ua'): return url + 'view_print/' else: return url But since inclusion of periodics script for Sony Readers this recipe started to act funny. When opened on my PRS-650, it either freezes, slows or crashes the device, even though random pages, I've manged to load seem to look ok. This problems seems to be specific only to this feed - other standard calibre recipes or self-made recipes are handled perfectly by the reader. I do realize that throwing in several feeds into one recipe is not a good thing, so don't object splitting it up, just don't know how to do it properly. And yeah, the funny RSS-feed address at feed43.com is actually a service I use, which allows you to create an RSS-feeds from almost any site, since that site don't have specific RSS-feeds for the column I want to read. http://news.finance.ua/ua/~/2 - that's the page it fetches news from, if it's of any importance. Calibre itself seems to handle this issue fine and fetches the full article, but maybe it's the reason for reader crashes. Thanks in advance for any help! |
11-28-2010, 11:15 AM | #2 |
creator of calibre
Posts: 43,916
Karma: 22669818
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
generally speaking, you should remove as much extra guff from the downloaded articles as possible. Use the remove_tags and keep_only_tags features for this.
|
Advert | |
|
11-28-2010, 06:10 PM | #3 |
Connoisseur
Posts: 57
Karma: 10
Join Date: Feb 2010
Device: Kindle Paperwhite 1
|
ok. At least I figured where the problem lies with glitches. It now seems to load quite well, but the other thing poped up.
Since it's Ukrainian, the alphabet is Cyrillic and when loaded to the Reader or Reader Library all the Cyrillic letters are replaced by question marks. My Reader uses custom firmware which supports Cyrillic and normally all the lrf or epub files are displayed correctly. So I don't really know where the problem may lie? I have a slight suspicion, that it may have something to do with the periodicals, because before that, epub or lrf recipe files as simple ebooks where displayed properly. |
11-28-2010, 06:23 PM | #4 |
creator of calibre
Posts: 43,916
Karma: 22669818
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
well if you change your output profile to say cybook opus then the epubs will no longer be periodicals, see if that helps.
|
11-28-2010, 06:31 PM | #5 |
Connoisseur
Posts: 57
Karma: 10
Join Date: Feb 2010
Device: Kindle Paperwhite 1
|
It does help - changed to LRF and everything worked perfect.
But still I find Periodicals menu quite useful and don't like all the book folder messed up by news recipes... also, just notices that it didn't create a proper TOC for the Periodical EPUB. It's just Front Page there and that's it... Update: just checked with epub file - non-periodical outlaying doesn't display fon't properly either. So it's something about the encoding (in the calibre viewer everything is fine). I will check with the author of the firmware, but I think it uses Unicode. Can it be the case, that calibre converter uses something different? Will also try to convert some regular books into epub in Calibre and will tell the result... Final Update: Yep, it's definitely some problem with Calibre Epub converter (maybe not a problem, just me being lame with customizations) - all calibre-created epubs show all cyrcillic letters as question marks, while external load on reader properly. Last edited by Dereks; 11-28-2010 at 07:07 PM. |
Advert | |
|
Tags |
periodics, prs-650, request |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
New recipe request - BBC News Ukrainian | storkozos | Introduce Yourself | 7 | 10-25-2010 11:36 AM |
Recipe for BBC Ukrainian | storkozos | Recipes | 1 | 10-21-2010 07:01 AM |
Log-in to news sites? | JDługosz | Calibre | 1 | 07-03-2010 10:06 PM |
NewsRaider scrapes sites for news | Alexander Turcic | Lounge | 6 | 08-12-2005 04:50 AM |
Thoughts on Mobile News Sites | Bob Russell | Lounge | 0 | 05-17-2005 08:38 AM |