Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Recipes

Notices

Reply
 
Thread Tools Search this Thread
Old 10-12-2014, 09:19 PM   #1
rajil.s
Junior Member
rajil.s began at the beginning.
 
Posts: 9
Karma: 10
Join Date: May 2012
Device: Kindle Touch
Hindi Newspaper, Amar Ujala

Hello,

I have a Kindle Touch and was wondering if it is possible to read a Hindi Newspaper on it. I couldnt find any internal recipe in Calibre with Hindi language. Has anybody tried to write a recipe for hindi newspaper before?

The newspaper which i am interested in is Amar Ujala which lists its rss feeds at http://www.amarujala.com/channels/rss/

Cheers,
Rajil
rajil.s is offline   Reply With Quote
Old 10-12-2014, 11:30 PM   #2
rajil.s
Junior Member
rajil.s began at the beginning.
 
Posts: 9
Karma: 10
Join Date: May 2012
Device: Kindle Touch
I had a first stab at the recipe. Few problems with this are:
1. Some articles are spread over multiple pages. How do i get text from multiple pages and merge them together?
2. The rss section has a fixed text for each feed e.g. "Amarujala News : A Hindi News Website covers Breaking India news samachar in hindi, News Headlines in hindi from every State of India, news on business, sports, bollywood, political and more only at Amarujala.com". How do i delete this?

any pointers will be appreciated.

Code:
from calibre.web.feeds.news import BasicNewsRecipe
class AmarUjala(BasicNewsRecipe):
    feeds          = [
    (u'National News',
    u'http://www.amarujala.com/rss/national-news.xml'),
    (u'International news',
    u'http://www.amarujala.com/rss/international-news.xml'),
    (u'Sports news',
    u'http://www.amarujala.com/rss/sports-news.xml'),
    (u'Business News',
    u'http://www.amarujala.com/rss/business-news.xml'),
    (u'Technology News',
    u'http://www.amarujala.com/rss/technology-news.xml'),

    ]


    title          = u'Amar Ujala'
    masthead_url   = 'http://epaper.amarujala.com/images/header_logo.gif'
    auto_cleanup = True
    oldest_article = 2.0  # days
    use_embedded_content = False
    language                = 'hi_IN'
    publication_type        = 'newspaper'
    remove_empty_feeds = True


    no_stylesheets = True
    auto_cleanup = True
rajil.s is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Epub Hindi richam79 ePub 7 03-31-2012 01:37 PM
iBooks and Hindi djrulz ePub 6 03-31-2012 10:05 AM
hindi ocr help asdx Workshop 0 12-18-2010 02:24 PM


All times are GMT -4. The time now is 07:58 AM.


MobileRead.com is a privately owned, operated and funded community.