06-23-2010, 09:46 AM | #1 |
Junior Member
Posts: 5
Karma: 10
Join Date: Sep 2009
Device: noe
|
ASCII transliteration problems (spanish)
I have been looking around and still cannot fix the issue. I saw there was a ticket (#4782) but I had no help from it. The question is: I have a HTML doc, spanish text, perfectly well seen in Firefox using ISO-8859-1, 8859-3 or 8859-15 or cp1252 (what was supposed to fix the ticket previously mentioned). I set the input in any of those codifications and I cannot get it working properly. When formatting it into another format (btw, I can`t get these docs to generate a proper epub file, conversion always hangs at that point, so I use lrf file, instead), all I get is no "ñ", no "ó", but ns and os and so on. If I force to use iso or cp as mentioned before in the output section, I have strange codes instead. I upload a file just for anybody here to try, if wanted.
I have tried the debug thing and it does not transliterale properly in the input section, so the problem seems to me that it it further up, in the process of converting html to zip. Any help or hint? Thanks |
06-23-2010, 09:53 AM | #2 |
creator of calibre
Posts: 43,858
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Read point number 2 in http://calibre-ebook.com/user_manual...r-smart-quotes
|
Advert | |
|
06-25-2010, 03:57 AM | #3 |
Junior Member
Posts: 5
Karma: 10
Join Date: Sep 2009
Device: noe
|
Thanks for your answer. I have had already done what it is said in that point, out of the embedding characters, as I have had no problem with spanish characters at all before. So, to be clear, I changed the input codification to match the one in which the document showed perfect in Firefox (let's say, ISO 8859-1) and tried with the other codifications that worked in firefox too. The results where always more or less the same. If I left blank the output look and feel in the conversion window, I had no special characters. If I put any other codification there, the resulting file had strange codes, varying depending of codifications. BTW, I have latest version of Calibre for MacOSX. Would it be helpful for anybody if I upload the original file and the resulting converted files?
FJ |
06-25-2010, 09:25 AM | #4 | |
Wizard
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
Assuming that you got that far, viewing the zip file in Calibre's viewer would have shown the correct characters. You would not have ever entered anything into the look and feel of the conversion box, as you would not yet have done a conversion (although Calibre would have done one during the importing process - Calibre always stores html/zips as utf-8). It sounds like you tried the above, but never saw correct characters. To me that sounds like you specified the wrong input encoding or there is a bug. To check, you could try to see what binary code is in your orignal file for a "problem" character (use a hex editor). Then see if that binary code specifies the character you think it should specify in the encoding you have advised Calibre is used for that file (find a table for that encoding). If it matches, there's a bug in Calibre's utf-8 conversion. If not, then you've got the wrong encoding specified and you need to find the right one. |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
transliteration in file&directory names | vadim_k | Calibre | 4 | 11-14-2011 07:04 AM |
Non-ascii symbols on Story | James_Wilde | iRiver Story | 13 | 09-12-2010 05:15 AM |
non-ascii in reader different to library | enriquep | Calibre | 0 | 11-30-2009 05:41 PM |
weird ascii character | p3aul | Calibre | 7 | 10-14-2009 05:10 PM |
Ascii file | ProDigit | Lounge | 1 | 12-25-2008 10:08 PM |