Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > ePub

Notices

Reply
 
Thread Tools Search this Thread
Old 07-22-2011, 05:42 PM   #1
sovre
Zealot
sovre began at the beginning.
 
sovre's Avatar
 
Posts: 108
Karma: 10
Join Date: Dec 2010
Location: United States
Device: iPad Mini; iPhone; Kindle Paperwhite (10th gen)
foreign characters not showing up?

I have some epub formatted books with foreign characters. The texts look fine in Calibre, and also if read on my iPod Touch, but on my Sony 950 all of the foreign characters show up as question marks, which becomes quite distracting when I am trying to read. I'm trying to figure out if there is a way to fix this? Thanks in advance to anyone who can share the solution.
sovre is offline   Reply With Quote
Old 07-22-2011, 06:07 PM   #2
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
The Sony should not have any issues with foreign characters. Are you sure that the encoding is UTF-8?
Toxaris is offline   Reply With Quote
Advert
Old 07-22-2011, 06:15 PM   #3
sovre
Zealot
sovre began at the beginning.
 
sovre's Avatar
 
Posts: 108
Karma: 10
Join Date: Dec 2010
Location: United States
Device: iPad Mini; iPhone; Kindle Paperwhite (10th gen)
I don't know, as I received the file already formatted. What tool would I use to verify this? Is there a way for me to convert it to Unicode 8 if necessary?
sovre is offline   Reply With Quote
Old 07-23-2011, 01:20 AM   #4
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,608
Karma: 3000161
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
Quote:
Originally Posted by sovre View Post
I don't know, as I received the file already formatted. What tool would I use to verify this?
On Linux, you can use the command tool file which is intended to determine file type (see man file)
Set yourself on the path (using cd command), then use the following command:
Code:
file name_of_the_file
http://linux.die.net/man/1/file

There is one online converter using this command
http://www.cometdocs.com/file.htm

Last edited by roger64; 07-23-2011 at 01:28 AM.
roger64 is offline   Reply With Quote
Old 07-23-2011, 04:21 AM   #5
charleski
Wizard
charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.
 
Posts: 1,196
Karma: 1281258
Join Date: Sep 2009
Device: PRS-505
The most likely answer is that your ePub doesn't have an embedded font. ADE's default glyph set is very narrow and only really suitable for books in English.

If you can edit the ePub, then you can embed a font yourself. Or alternatively you could load kartu's PRS+ firmware into your 950 (see the Sony Dev sub-forum on this board), which allows you to upload a default font of your choice to the reader.
charleski is offline   Reply With Quote
Advert
Old 07-23-2011, 10:40 AM   #6
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
Sorry, I don't agree. I have character test epub which contains all characters used in west-european languages. All the characters display on my old PRS-300 without a problem. I haven't tried Eastern-European or Cyrrilic, so that could be the case.

For me embedding a font would be the last option, due to the extra size and possible copyright issues.

Sovre, what characters are you trying to display? Perhaps then we can identify the problem better.
Toxaris is offline   Reply With Quote
Old 07-23-2011, 03:06 PM   #7
charleski
Wizard
charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.
 
Posts: 1,196
Karma: 1281258
Join Date: Sep 2009
Device: PRS-505
The default font covers enough of Latin Supplement 1 to get by in many Western European languages, but there are gaps and it has no support for Latin Extended-A.
charleski is offline   Reply With Quote
Old 07-23-2011, 08:47 PM   #8
sovre
Zealot
sovre began at the beginning.
 
sovre's Avatar
 
Posts: 108
Karma: 10
Join Date: Dec 2010
Location: United States
Device: iPad Mini; iPhone; Kindle Paperwhite (10th gen)
I am trying to read an English translation of a Pali text. The proper names in Sanskrit which have diacritical marks are the ones which cannot be displayed.

I have PRS+ loaded as the firmware for my reader, but I do not know how to upload a new font?

If you could explain how to do this or send me a link to instructions, I'd appreciate it!

I notice in the PRS+ settings there is something called "User EPUB style (CSS file)" under Book Viewer Settings, but I'm not sure what this means or what its intended use is.

Last edited by sovre; 07-24-2011 at 09:42 PM.
sovre is offline   Reply With Quote
Old 07-24-2011, 05:45 AM   #9
charleski
Wizard
charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.
 
Posts: 1,196
Karma: 1281258
Join Date: Sep 2009
Device: PRS-505
Download the files I've attached. Hook your reader up to the computer via USB and open the READER drive. Make a new directory called fonts in the top level of that drive, then open the CharisSIL zip file and copy the four .ttf files it contains into that directory. Go back to the top level of the READER drive, and go down to database/system/PRSPlus/epub . Copy the CharisSIL.css file into that directory. Close the Sony Reader app if it auto-launched, and then eject the READER drive from the computer and disconnect it when it says it's safe to do so.

If you now go to User EPUB Style you'll have an option for CharisSIL. Select that and the font will be applied to every book you open that doesn't have a font embedded itself. If you already have the book open that you want to read, open a new one, then go back to the home screen and open the book you're reading.
Attached Files
File Type: zip CharisSIL4.106.zip (2.04 MB, 233 views)
File Type: zip CharisSIL.css.zip (262 Bytes, 296 views)
charleski is offline   Reply With Quote
Old 07-24-2011, 09:23 PM   #10
sovre
Zealot
sovre began at the beginning.
 
sovre's Avatar
 
Posts: 108
Karma: 10
Join Date: Dec 2010
Location: United States
Device: iPad Mini; iPhone; Kindle Paperwhite (10th gen)
Hi Charleski,

Thanks for taking the time to give me these clear instructions. I followed all you said, and now have that font selected as my epub style.

Only one problem: I am still getting a "?" for the letters which have diacritical marks in that text.

Oddly enough, I did notice this: the font has changed, but it changed in texts which were fine and not giving me a problem. And the texts which use the new font do not appear to be as readable as those using the old font. The paragraphs look somehow "clumped together," instead of there being a bit of breathing space between the lines as before.

Last edited by sovre; 07-25-2011 at 12:43 AM.
sovre is offline   Reply With Quote
Old 07-24-2011, 11:45 PM   #11
sovre
Zealot
sovre began at the beginning.
 
sovre's Avatar
 
Posts: 108
Karma: 10
Join Date: Dec 2010
Location: United States
Device: iPad Mini; iPhone; Kindle Paperwhite (10th gen)
While reading another thread on another subject I got an idea about this problem.

And then I tried this:

I copied the epub file and pasted it into MS Word, and then saved it as an rtf file. I then converted the rtf file to epub using calibre and sent it once again to my Sony.

Now all words show up fine, with proper accenting. No more question marks.

But the solution is not ideal because some of the formatting was lost, and I have footnote numbers showing up in odd places and causing paragraph breaks where they should be none.

Last edited by sovre; 07-25-2011 at 12:42 AM.
sovre is offline   Reply With Quote
Old 07-25-2011, 06:59 AM   #12
charleski
Wizard
charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.charleski ought to be getting tired of karma fortunes by now.
 
Posts: 1,196
Karma: 1281258
Join Date: Sep 2009
Device: PRS-505
Quote:
Originally Posted by sovre View Post
Only one problem: I am still getting a "?" for the letters which have diacritical marks in that text.
Quote:
Originally Posted by sovre View Post
I copied the epub file and pasted it into MS Word, and then saved it as an rtf file. I then converted the rtf file to epub using calibre and sent it once again to my Sony.

Now all words show up fine, with proper accenting. No more question marks.
This suggests that the problem actually lies in the encoding, as others mentioned earlier in the thread. As you've discovered, calibre's automatic conversion can easily break if there's any problem in the source. Unzip the epub and extract one of the html files, then open that in Notepad++. Can you still see the proper diacritics in the text? Clicking on the Encoding menu, does it say 'Encode in UTF-8 without BOM'? What is the first line of the file? It should be something like
<?xml version="1.0" encoding="utf-8" standalone="no"?>

Quote:
Originally Posted by sovre View Post
Oddly enough, I did notice this: the font has changed, but it changed in texts which were fine and not giving me a problem. And the texts which use the new font do not appear to be as readable as those using the old font. The paragraphs look somehow "clumped together," instead of there being a bit of breathing space between the lines as before.
Yeah, this is a problem with the font. Charis supports an extremely wide range of glyphs, has bold and italic variants, and is free, so I could post it here. But it was designed to be compact, and therefore doesn't have its height metrics set properly (they've released an even more compact version, which is worse). If you look around at the fonts you have installed on your system you may well find one that will work better, just copy those over (you'll need normal, italic, bold, and bold-italic variants, which will be different files) and edit the file names in the css file.
charleski is offline   Reply With Quote
Old 07-25-2011, 02:52 PM   #13
sovre
Zealot
sovre began at the beginning.
 
sovre's Avatar
 
Posts: 108
Karma: 10
Join Date: Dec 2010
Location: United States
Device: iPad Mini; iPhone; Kindle Paperwhite (10th gen)
Ok I downloaded Notepad ++.

Yes it says Encode in UTF-8 without Bom under "encoding"

The first line is: <?xml version='1.0' encoding='utf-8'?>

But I don't know how to access the text itself using this text editor. I only seem to be able to see information about the encoding and fonts. Opening the text in another text viewer I have which displays the entire text, I can see the diacritical marks fine.

Wouldn't this indicate the encoding of the text is ok? Or is there a problem with it?


Also, are there any Unicode 8 fonts you recommend as being clear and readable with the Sony Reader? And is there some kind of basic template I can use for creating a css file to go with the font (I've never done this before and don't know how!).

Thanks.

Last edited by sovre; 07-25-2011 at 03:12 PM.
sovre is offline   Reply With Quote
Old 07-25-2011, 04:10 PM   #14
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
Can you give an example of the characters you want to display that give a question mark? Give a few examples, if you like. Some simple, some exotic (your interpretation of course). Perhaps we can give a better advise then. I could add it to my character test epub and see what the result would be on my Sony.
Toxaris is offline   Reply With Quote
Old 07-25-2011, 08:48 PM   #15
sovre
Zealot
sovre began at the beginning.
 
sovre's Avatar
 
Posts: 108
Karma: 10
Join Date: Dec 2010
Location: United States
Device: iPad Mini; iPhone; Kindle Paperwhite (10th gen)
I've uploaded the file--I think that will make it easier for you to diagnose the problem, because you can see how it displays on your own Reader.
Attached Files
File Type: rar The Jataka Vol. 1.rar (554.9 KB, 160 views)
sovre is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
PRS-505 cannot show foreign characters? paulpeer Sony Reader 19 05-22-2013 02:25 AM
Read foreign? Alilas Reading Recommendations 0 04-30-2010 06:53 AM
ePub doc created with Calibre is not showing polish characters in iRiver Story mareksuski Calibre 10 03-11-2010 07:39 AM
Foreign accented characters and libprs500 Stingo Calibre 6 02-24-2008 07:51 PM


All times are GMT -4. The time now is 09:57 PM.


MobileRead.com is a privately owned, operated and funded community.