It is still centered for some reasons. I'm begin to think that maybe html2lrf just do that for Thai by default. Is there a way to "forced" left-align on the html before the converter process it? I am guessing that may help.
The epub and mobi always crash both the calibre's viewer and the reader. I can only goes so far as the table of content for epub and only the first blank page for mobi
And thank you for your help kiklop74.
Here is the current (result is center-aligned) code.
Code:
class AdvancedUserRecipe1234529365(BasicNewsRecipe):
title = u'Manager Online'
oldest_article = 7
max_articles_per_feed = 100
encoding = 'cp874'
no_stylesheets = True
use_embedded_content = False
remove_javascript = True
html2lrf_options = ['--ignore-tables']
html2epub_options = 'linearize_tables = True'
keep_only_tags = [dict(name='td', attrs={'class':'body'})]
feeds = [
(u'การเมือง', u'http://www.manager.co.th/RSS/Politics/Politics.xml')
]
def print_version(self, url):
return url.replace('http://www.manager.co.th/asp-bin/mgrview.aspx?', 'http://www.manager.co.th/asp-bin/PrintNews.aspx?')
def preprocess_html(self, soup):
for item in soup.findAll(style=True):
del item['style']
for item in soup.findAll(align=True):
del item['align']
return soup