*Edited*: I just realized that maybe I shouldn't ask for help in this thread. Should I make a new thread? Or this is fine? I'm very sorry if this is not an appropriate place to ask.
Still centered. I'm out of idea. What confuse me is that the first code, which does nothing to the source except take out ccs and javescript give a properly left-aligned result. Do you have any suggestion? I tried to make a recipe for another thai newspaper site and it does not have this problem at all. By the way, the table of content is properly left-aligned.
I flashed the firmware so that I get the default font that have thai characters. The result is not that good, unsurprisingly. There are four levels in thai writing system, and the reader just put the upper two at the same place. They also don't do a good job on where to begin a new line, but that maybe due to the converter rather than the reader itself since the same thing appear in calibre's viewer. Still readable though. I'm thinking about telling html2lrf to embedded a thai font if the recipe if actually share with others.
EDIT2: Ok, here's my best try. Since I doubt anyone will use it, it'll just post it here. Thanks kiklop74 for your help.
Code:
class AdvancedUserRecipe1234529365(BasicNewsRecipe):
title = u'Manager Online'
oldest_article = 7
max_articles_per_feed = 100
encoding = 'cp874'
no_stylesheets = True
use_embedded_content = False
remove_javascript = True
remove_tags = [dict(name='td', attrs={'align':'right'})]
remove_tags = [dict(name='td', attrs={'align':'left'})]
html2lrf_options = ['--ignore-tables']
html2epub_options = 'linearize_tables = True'
feeds = [
(u'การเมือง', u'http://www.manager.co.th/RSS/Politics/Politics.xml'),
(u'กีฬา', u'http://www.manager.co.th/RSS/Sport/Sport.xml'),
(u'อาชญากรรมและกระบวนการยุติธรรม', u'http://www.manager.co.th/RSS/Crime/Crime.xml'),
(u'ภูมิภาค', u'http://www.manager.co.th/RSS/Local/Local.xml'),
(u'คุณภาพชีวิต', u'http://www.manager.co.th/RSS/QOL/QOL.xml'),
(u'เศรษฐกิจ', u'http://www.manager.co.th/RSS/Business/Business.xml'),
(u'เกม', u'http://www.manager.co.th/RSS/Game/Game.xml'),
(u'วิทยาศาสตร์', u'http://www.manager.co.th/RSS/Science/Science.xml'),
(u'ชีวิตในเมือง', u'http://www.manager.co.th/RSS/Metrolife/Metrolife.xml'),
(u'ครอบครัว', u'http://www.manager.co.th/RSS/Family/Family.xml'),
(u'ชีวิตในรั้วมหาลัย', u'http://www.manager.co.th/RSS/Campus/Campus.xml'),
(u'บังเทิง', u'http://www.manager.co.th/RSS/Entertainment/Entertainment.xml'),
(u'ผู้จัดกวน', u'http://manager.co.th/rss/getRSS.aspx?browsenewsid=1052'),
(u'ธรรมะ - ผู้จัดการ', u'http://manager.co.th/rss/getRSS.aspx?browsenewsid=8101&sourcenewsid=0'),
(u'ธรรมะ - ทั่วไป', u'http://manager.co.th/rss/getRSS.aspx?browsenewsid=8100&sourcenewsid=0')
]
def print_version(self, url):
return url.replace('http://www.manager.co.th/asp-bin/mgrview.aspx?', 'http://www.manager.co.th/asp-bin/PrintNews.aspx?')
If anyone actually try to use this, keep in mind that for sony reader you'll need to flash the firmware so that the default font has thai characters. I use Leelawadee.