05-08-2012, 03:41 AM | #1 |
Member
Posts: 18
Karma: 10
Join Date: Apr 2012
Device: none
|
How to correct word breakage in ePUB (Tamil font embedded)
Could you help me, how to correct word breakage in ePUB (Tamil font embedded).
My problem is: I have created one ePUB (TAMIL font "SHREE-TAM-0800.TTF" embedded) and loaded in Ipad ibooks. It displays well but some word unwantedly broken and gives bad reading experience at the end of each line. (Refer attached screenshots) So kindly help me how can i solve this issue. Thanks in advance for suggestions and help. |
05-08-2012, 04:36 AM | #2 |
Grand Sorcerer
Posts: 5,584
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
|
Since Tamil is a relatively rare language. It'd help, if you:
- posted a short ePub excerpt - clearly indicated in a screen capture where unwanted line-breaks occur and where they should occur Try the following: - open the ePub in Sigil and ADE and other ePub readers - double-check the language metadata in Sigil or your authoring tool - check the validity and well-formedness of the ePub |
05-08-2012, 04:48 AM | #3 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
Turn of hyphenation in iBooks.
|
05-08-2012, 09:17 AM | #4 | |
Member
Posts: 18
Karma: 10
Join Date: Apr 2012
Device: none
|
How to correct unwanted word/line break in ePUB (Tamil font embedded)
I have tried all the suggestions which you mentioned but still unwanted word/line break occurs.
Also attached sample ePUB for your reference. Quote:
|
|
05-08-2012, 12:40 PM | #5 | |
Grand Sorcerer
Posts: 5,584
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
|
Quote:
However, the ePub standard requires all source files to be encoded as UTF-8 or UTF-16 files. You'll have to convert your source files to Unicode and embed a Unicode compatible Tamil font. |
|
05-09-2012, 07:47 AM | #6 | |
Member
Posts: 18
Karma: 10
Join Date: Apr 2012
Device: none
|
Thanks for the reply.
I have updated Unicode encoding & language declaration in .xhtml file but no favorable result for unwanted word break. (Attached updated sample for your reference) I have also attached screenshots for correct & unwanted words breaks example. It would be more helpful to me if you could provide procedures for source files to Unicode conversion or any website link. Once again thank you so much for your reply. Quote:
|
|
05-09-2012, 08:30 AM | #7 |
Grand Sorcerer
Posts: 5,584
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
|
Simply adding "encoding="utf-8"" to the .html file does not work. You'll need to actually convert the .html file to Unicode.
There seem to be several different Tamil code pages in use; you'll have to find out the encoding of your .html file and then use a converter to convert it to Unicode. Simply google for Tamil Uncode converter and Tamil Unicode fonts and pick one that works. Alternatively, save your source file as Unicode with your word processor/editor. You could also try to copy the text to the clipboard, paste it into BabelPad and then save it as a Unicode text file. BTW, even if you manage to convert your source file to Unicode there's no guarantee that this will solve your problem. I wouldn't bet on ADE, because the current version is pretty limited when it comes to non-Latin alphabets. However, there's a good chance that a properly encoded Tamil ePub with Unicode .html source files might work at least on the iPad. Last edited by Doitsu; 05-09-2012 at 09:25 AM. |
05-09-2012, 08:30 AM | #8 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
Search for the HTML entity & shy; (remove space) in the code and remove those.
Last edited by Toxaris; 05-09-2012 at 08:33 AM. |
06-18-2012, 08:18 AM | #9 |
Member
Posts: 18
Karma: 10
Join Date: Apr 2012
Device: none
|
Hi Doitsu, you are right.
As you said, I have found some solution. By changing the actual content (i.e .html/.xhtml) into UNICODE text using UNICODE converter we can solve this unwanted word break issues. Note: Fonts not required for the UNICODE formatted ePub and looks good in all devices. Last edited by Raja1205; 06-18-2012 at 08:21 AM. |
06-18-2012, 08:53 AM | #10 |
Color me gone
Posts: 2,089
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
|
Thanks, Raja1205 for posting your solution. I didn't even know such things existed, but as I Google I see many many of them.
|
06-19-2012, 10:45 AM | #11 |
Junior Member
Posts: 8
Karma: 10
Join Date: Jun 2012
Device: nook
|
Hi Raja, can you explain a bit more of your experience here. I was also working around on this, but I had some success
I converted html pages (from project madurai) to epub using sigil then I did embed with SUNDARAM tamil font. This works great on my nook. I was trying to convert tamil pdf into epub using calibre, but the fonts are messed up and I can't see font any more on my system, still I did embed with S08000F0.TTF font that you have used on your test epub, but no success. It'll be a great share if you can explain a bit brief on this issue. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
epub in Tamil | Suganthi | General Discussions | 3 | 12-24-2011 08:37 PM |
Can't delete embedded font | eosrose | Sigil | 1 | 08-16-2010 06:28 AM |
Tamil font rendering in ePub(device: jetBook Lite) | adreamer | Sigil | 8 | 08-05-2010 12:34 PM |
problem with embedded font | ericshliao | EPUBReader | 2 | 12-11-2009 10:04 AM |
LRF with embedded font | igorsk | Sony Reader | 8 | 10-26-2006 08:08 AM |