View Single Post
Old 03-05-2011, 06:13 PM   #493
DDHarriman
Guru
DDHarriman has a spectacular aura aboutDDHarriman has a spectacular aura aboutDDHarriman has a spectacular aura aboutDDHarriman has a spectacular aura aboutDDHarriman has a spectacular aura aboutDDHarriman has a spectacular aura aboutDDHarriman has a spectacular aura aboutDDHarriman has a spectacular aura aboutDDHarriman has a spectacular aura aboutDDHarriman has a spectacular aura aboutDDHarriman has a spectacular aura about
 
Posts: 860
Karma: 4380
Join Date: Feb 2008
Location: Almada, Portugal
Device: Cybook Gen3, Sony PRS 505, Kindle DXG and Samsung Galaxy Note
Hi FethryDuck

First of all, thanks for the effort on testing the Portuguese (even if the example you have chosen is 19th century Brazilian).

You are completely correct, in a sense…

I have tested what you say, and yes, one gets all the Portuguese diacritics (accented words and the ç) if… it’s a utf-8 txt file.

The same text in epub format, even if made from a converted html utf-8 file via Calibre, gives the crazy 2 font presentation my Spanish friends presented in the last posts - one font for the main text one for the diacritics with an extra space before and after each diacritic -, and no ã and ç diacritic’s - they just disappear from the word!
The same thing happens if the text is a word file saved in utf-8.

One more note: if the text format is, per example, windows western, the result is always the same as I describe for the epub, even if it’s a txt.

So conclusion: if it’s a plain txt saved in UTF-8 format I can get all the diacritics, no right justification and crazy cuts in the words when they reach the right margin.
The rest, well not even all the diacritics, and the formatting is the same crap.

Best regards, and once more a great thanks on all the work you have put into this so far,

Last edited by DDHarriman; 03-05-2011 at 07:35 PM.
DDHarriman is offline   Reply With Quote