View Single Post
Old 01-01-2023, 09:13 PM   #1
rasteps
Zealot
rasteps began at the beginning.
 
Posts: 140
Karma: 10
Join Date: Sep 2010
Device: Kindle, Android phone
NHK New (Japanese) recipe for submission

@Kovid, I know that it's still rough, and you don't like to play around with non-English websites, but thank you for helping out with this. It has been downloading more consistently lately.

But there are very few Japanese language sites available so I would like to submit this recipe. Hopefully, someone more knowledgeable can clean it up and make it better.


Code:
from calibre.web.feeds.news import BasicNewsRecipe


class ReutersJa(BasicNewsRecipe):

# feed source: https://www.nhk.or.jp/toppage/rss/index.html

    title = 'NHK News'
    description = 'NHK News in Japanese'
    __author__ = 'Richard A. Steps'
    use_embedded_content = False
    language = 'ja'
    max_articles_per_feed = 30
    remove_javascript = True
    auto_cleanup = True
    
# This line added to deal with bots on site
    def get_browser(self):
        return BasicNewsRecipe.get_browser(self, user_agent='common_words/based')

    feeds = [(
        '主要ニュース', 'https://www.nhk.or.jp/rss/news/cat0.xml?format=xml'),
         ('社会', 'https://www.nhk.or.jp/rss/news/cat1.xml?format=xml'),
         ('科学・医療', 'https://www.nhk.or.jp/rss/news/cat3.xml?format=xml'),
         ('政治', 'https://www.nhk.or.jp/rss/news/cat4.xml?format=xml'),
         ('経済', 'https://www.nhk.or.jp/rss/news/cat5.xml?format=xml'),
         ('国際', 'https://www.nhk.or.jp/rss/news/cat6.xml?format=xml'),
         ('スポーツ', 'https://www.nhk.or.jp/rss/news/cat7.xml?format=xml'),
         ('文化・エンタメ', 'https://www.nhk.or.jp/rss/news/cat2.xml?format=xml')
    ]
rasteps is offline   Reply With Quote