Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Formats > ePub

Notices

Reply
 
Thread Tools Search this Thread
Old 05-08-2012, 03:41 AM   #1
Raja1205
Member
Raja1205 began at the beginning.
 
Posts: 18
Karma: 10
Join Date: Apr 2012
Device: none
Thumbs up How to correct word breakage in ePUB (Tamil font embedded)

Could you help me, how to correct word breakage in ePUB (Tamil font embedded).

My problem is:
I have created one ePUB (TAMIL font "SHREE-TAM-0800.TTF" embedded) and loaded in Ipad ibooks. It displays well but some word unwantedly broken and gives bad reading experience at the end of each line. (Refer attached screenshots)

So kindly help me how can i solve this issue.


Thanks in advance for suggestions and help.
Attached Thumbnails
Click image for larger version

Name:	IMG_0104.png
Views:	708
Size:	221.7 KB
ID:	86187   Click image for larger version

Name:	IMG_0101.png
Views:	591
Size:	232.5 KB
ID:	86188  
Raja1205 is offline   Reply With Quote
Advert
Old 05-08-2012, 04:36 AM   #2
Doitsu
Wizard
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 3,700
Karma: 9800372
Join Date: Dec 2010
Device: Kindle PW2
Since Tamil is a relatively rare language. It'd help, if you:

- posted a short ePub excerpt
- clearly indicated in a screen capture where unwanted line-breaks occur and where they should occur

Try the following:

- open the ePub in Sigil and ADE and other ePub readers
- double-check the language metadata in Sigil or your authoring tool
- check the validity and well-formedness of the ePub
Doitsu is offline   Reply With Quote
Old 05-08-2012, 04:48 AM   #3
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 4,338
Karma: 115718815
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-300, PRS-T1
Turn of hyphenation in iBooks.
Toxaris is offline   Reply With Quote
Old 05-08-2012, 09:17 AM   #4
Raja1205
Member
Raja1205 began at the beginning.
 
Posts: 18
Karma: 10
Join Date: Apr 2012
Device: none
How to correct unwanted word/line break in ePUB (Tamil font embedded)

I have tried all the suggestions which you mentioned but still unwanted word/line break occurs.

Also attached sample ePUB for your reference.


Quote:
Originally Posted by Doitsu View Post
Since Tamil is a relatively rare language. It'd help, if you:

- posted a short ePub excerpt
- clearly indicated in a screen capture where unwanted line-breaks occur and where they should occur

Try the following:

- open the ePub in Sigil and ADE and other ePub readers
- double-check the language metadata in Sigil or your authoring tool
- check the validity and well-formedness of the ePub
Attached Files
File Type: epub test.epub (343.3 KB, 226 views)
Raja1205 is offline   Reply With Quote
Old 05-08-2012, 12:40 PM   #5
Doitsu
Wizard
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 3,700
Karma: 9800372
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by Raja1205 View Post
I have tried all the suggestions which you mentioned but still unwanted word/line break occurs.

Also attached sample ePUB for your reference.
I had a quick look at your ePub and noticed that the source .html file is not Unicode encoded and moreover lacks a language declaration.
However, the ePub standard requires all source files to be encoded as UTF-8 or UTF-16 files.
You'll have to convert your source files to Unicode and embed a Unicode compatible Tamil font.
Doitsu is offline   Reply With Quote
Advert
Old 05-09-2012, 07:47 AM   #6
Raja1205
Member
Raja1205 began at the beginning.
 
Posts: 18
Karma: 10
Join Date: Apr 2012
Device: none
Thanks for the reply.

I have updated Unicode encoding & language declaration in .xhtml file but no favorable result for unwanted word break. (Attached updated sample for your reference)

I have also attached screenshots for correct & unwanted words breaks example.

It would be more helpful to me if you could provide procedures for source files to Unicode conversion or any website link.
Once again thank you so much for your reply.

Quote:
Originally Posted by Doitsu View Post
I had a quick look at your ePub and noticed that the source .html file is not Unicode encoded and moreover lacks a language declaration.
However, the ePub standard requires all source files to be encoded as UTF-8 or UTF-16 files.
You'll have to convert your source files to Unicode and embed a Unicode compatible Tamil font.
Attached Thumbnails
Click image for larger version

Name:	Correct word break in ADE for Tamil font embedded ePUB.jpg
Views:	438
Size:	123.0 KB
ID:	86233   Click image for larger version

Name:	Unwanted word break in ADE for Tamil font embedded ePUB.jpg
Views:	575
Size:	134.7 KB
ID:	86234  
Attached Files
File Type: epub test.epub (343.3 KB, 158 views)
Raja1205 is offline   Reply With Quote
Old 05-09-2012, 08:30 AM   #7
Doitsu
Wizard
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 3,700
Karma: 9800372
Join Date: Dec 2010
Device: Kindle PW2
Simply adding "encoding="utf-8"" to the .html file does not work. You'll need to actually convert the .html file to Unicode.
There seem to be several different Tamil code pages in use; you'll have to find out the encoding of your .html file and then use a converter to convert it to Unicode.
Simply google for Tamil Uncode converter and Tamil Unicode fonts and pick one that works.
Alternatively, save your source file as Unicode with your word processor/editor.
You could also try to copy the text to the clipboard, paste it into BabelPad and then save it as a Unicode text file.

BTW, even if you manage to convert your source file to Unicode there's no guarantee that this will solve your problem.
I wouldn't bet on ADE, because the current version is pretty limited when it comes to non-Latin alphabets. However, there's a good chance that a properly encoded Tamil ePub with Unicode .html source files might work at least on the iPad.

Last edited by Doitsu; 05-09-2012 at 09:25 AM.
Doitsu is offline   Reply With Quote
Old 05-09-2012, 08:30 AM   #8
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 4,338
Karma: 115718815
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-300, PRS-T1
Search for the HTML entity & shy; (remove space) in the code and remove those.

Last edited by Toxaris; 05-09-2012 at 08:33 AM.
Toxaris is offline   Reply With Quote
Old 06-18-2012, 08:18 AM   #9
Raja1205
Member
Raja1205 began at the beginning.
 
Posts: 18
Karma: 10
Join Date: Apr 2012
Device: none
Hi Doitsu, you are right.

As you said, I have found some solution.

By changing the actual content (i.e .html/.xhtml) into UNICODE text using UNICODE converter we can solve this unwanted word break issues.

Note: Fonts not required for the UNICODE formatted ePub and looks good in all devices.

Last edited by Raja1205; 06-18-2012 at 08:21 AM.
Raja1205 is offline   Reply With Quote
Old 06-18-2012, 08:53 AM   #10
mrmikel
Color me gone
mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.
 
Posts: 2,086
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
Thanks, Raja1205 for posting your solution. I didn't even know such things existed, but as I Google I see many many of them.
mrmikel is offline   Reply With Quote
Old 06-19-2012, 10:45 AM   #11
chittu
Junior Member
chittu began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Jun 2012
Device: nook
Hi Raja, can you explain a bit more of your experience here. I was also working around on this, but I had some success

I converted html pages (from project madurai) to epub using sigil then I did embed with SUNDARAM tamil font. This works great on my nook.

I was trying to convert tamil pdf into epub using calibre, but the fonts are messed up and I can't see font any more on my system, still I did embed with S08000F0.TTF font that you have used on your test epub, but no success.

It'll be a great share if you can explain a bit brief on this issue.
chittu is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
epub in Tamil Suganthi General Discussions 3 12-24-2011 08:37 PM
Can't delete embedded font eosrose Sigil 1 08-16-2010 06:28 AM
Tamil font rendering in ePub(device: jetBook Lite) adreamer Sigil 8 08-05-2010 12:34 PM
problem with embedded font ericshliao EPUBReader 2 12-11-2009 10:04 AM
LRF with embedded font igorsk Sony Reader 8 10-26-2006 08:08 AM


All times are GMT -4. The time now is 12:43 AM.


MobileRead.com is a privately owned, operated and funded community.