View Single Post
Old 10-04-2011, 01:44 PM   #1
Dizzley
Junior Member
Dizzley began at the beginning.
 
Dizzley's Avatar
 
Posts: 3
Karma: 10
Join Date: Oct 2011
Device: Amazon Kindle 3
Question WordLive daily bible reading progress

I'm making a recipe to download the daily bible reading from WordLive (UK). I'm glad to say that there are RSS feeds for the different daily output.

The basic feed is at http://feeds.feedburner.com/org/ELCH?format=xml.

This seems a good start. I'm now tweaking. Later I will add a subscription login so the user can set preferences.

Right now I have a problem with bible verse numbers: Calibre sees the first few as header numbers. They are actually in sup tags, typically:
Code:
<sup class="versenum" id="en-TNIV-25582">1</sup>
<p> Now the tax collectors... </p>
How can I get these verse numbers to pass through untouched by Calibre?

Here's my current recipe:
Spoiler:
Code:
class WordLiveClassicRecipe(BasicNewsRecipe):
    title          = u'WordLive'
    oldest_article = 7
    max_articles_per_feed = 100
    auto_cleanup = True
    use_embedded_content = True
    oldest_article = 28
    max_articles_per_feed = 100
    use_embedded_content = True
    encoding = 'utf8'
    remove_empty_feeds = True
#   no_stylesheets = True
    remove_javascript = True
#     keep_only_tags = [{'class':'regularitem'}]
    feeds          = [(u'WordLive Classic', u'http://feeds.feedburner.com/org/ELCH?format=xml')]

Last edited by Dizzley; 10-04-2011 at 01:44 PM. Reason: minor brainfade in original
Dizzley is offline   Reply With Quote