07-29-2009, 12:26 PM | #1 |
Connoisseur
Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
French Language Books
I have been trying to convert html and pdf books in French into prc or ePub.
The conversions for html are better but there is still a problem with the "oe" and "à" and "è". The conversion from pdf is quite terrible - there seem to be line breaks after each accent and the accents are not on the letters but beside the letters. I have tried using the alias for CH in the look and feel to no avail. Could somebody please tell me what I need to put in the look and feel and how? Thanks for the help. |
07-29-2009, 12:28 PM | #2 |
creator of calibre
Posts: 43,778
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
For HTML conversions you may need to specify the correct input encoding, probably one of the following
cp1252 cp1251 latin1 utf-8 |
Advert | |
|
07-29-2009, 12:30 PM | #3 |
Connoisseur
Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
Thanks for your help - how do I specify this - just putting it into the "Look and feel " ? What about the pdfs - is there anything I can do about these?
|
07-29-2009, 12:38 PM | #4 |
creator of calibre
Posts: 43,778
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Yes. As for PDF, not really.
|
07-29-2009, 01:13 PM | #5 |
Connoisseur
Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
I have tried all these options with no success - worse than when leaving it blank. I put them in the "Input character encoding" field in the "Look and Feel". Is this OK? Has anyone tried converting French language docs? Thanks for the help.
|
Advert | |
|
07-29-2009, 01:14 PM | #6 |
creator of calibre
Posts: 43,778
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
zip up the HTML before adding it to calibre (i.e. put it ina ZIP file and add the ZIP file to calibre). Then try those options.
|
07-29-2009, 01:23 PM | #7 |
Connoisseur
Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
Thanks - this is exactly what I have done. Put the index file into calibre and it zipped up everything.
|
07-29-2009, 01:25 PM | #8 |
creator of calibre
Posts: 43,778
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
No I mean zip up all the files yourself first and then add the ersulting ZIP file to calibre
|
07-29-2009, 01:48 PM | #9 |
Connoisseur
Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
Great - this worked for latin-1 except for "oe" but the "è and à" are OK.
Now just need a work-round for pdfs |
07-29-2009, 03:15 PM | #10 |
Member
Posts: 12
Karma: 10
Join Date: Apr 2009
Device: Asus-A696, Cool-er, Hanlin V5
|
I have been trying to convert rtf and html texts in Hungarian to epub. "ű" and "ő" are ok in the calibre ebook viewer but not in ADE and Cool-er.
|
07-29-2009, 03:16 PM | #11 |
frumious Bandersnatch
Posts: 7,514
Karma: 18512745
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
|
07-30-2009, 07:13 PM | #12 |
Sigil & calibre developer
Posts: 2,488
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
I can fix this. There was a similar problem with German characters that have umlauts (e.g. ü). Can you attach or send me a PDF with the French accented characters so I can test my changes?
|
07-31-2009, 07:30 AM | #13 |
Connoisseur
Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
Hi user_none
Glad to enclose the document. But perhaps it is better to send you the link: http://alysse.org/~thomas/autonomy_project/ The link to the pdf is at the bottom of the page. Bye |
07-31-2009, 07:32 AM | #14 |
Connoisseur
Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
The html converted well with latin-9 but calibre only recognised the full ISO in the look and feel.
The "oe" is still missing but perhaps this is due to a problem in the viewer or in the hanlin device. |
07-31-2009, 08:41 AM | #15 |
Sigil & calibre developer
Posts: 2,488
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
I've made a commit that cleans up French characters (e.g. à,û,é...) for PDF input. It should be in the next release. Let me know if I missed anything. Also, at least with conversion from PDF the oe character is displaying correctly in calibre's ebook-viewer.
|
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
3 french language sites with lots of e-books | Liviu_5 | Reading Recommendations | 13 | 08-28-2012 04:29 AM |
Do you read books in more than one language? | ficbot | General Discussions | 129 | 06-06-2010 11:39 AM |
Changing from french language to english | escalla | Sony Reader | 3 | 05-14-2009 12:57 PM |
Multi-language books in mobipocket | Jellby | Kindle Formats | 2 | 12-17-2008 08:34 PM |
E books in French | Leto | Reading Recommendations | 4 | 05-31-2008 11:02 AM |