![]() |
#1 |
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
French Language Books
I have been trying to convert html and pdf books in French into prc or ePub.
The conversions for html are better but there is still a problem with the "oe" and "à" and "è". The conversion from pdf is quite terrible - there seem to be line breaks after each accent and the accents are not on the letters but beside the letters. I have tried using the alias for CH in the look and feel to no avail. Could somebody please tell me what I need to put in the look and feel and how? Thanks for the help. |
![]() |
![]() |
![]() |
#2 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,164
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
For HTML conversions you may need to specify the correct input encoding, probably one of the following
cp1252 cp1251 latin1 utf-8 |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
Thanks for your help - how do I specify this - just putting it into the "Look and feel " ? What about the pdfs - is there anything I can do about these?
|
![]() |
![]() |
![]() |
#4 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,164
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Yes. As for PDF, not really.
|
![]() |
![]() |
![]() |
#5 |
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
I have tried all these options with no success - worse than when leaving it blank. I put them in the "Input character encoding" field in the "Look and Feel". Is this OK? Has anyone tried converting French language docs? Thanks for the help.
|
![]() |
![]() |
Advert | |
|
![]() |
#6 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,164
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
zip up the HTML before adding it to calibre (i.e. put it ina ZIP file and add the ZIP file to calibre). Then try those options.
|
![]() |
![]() |
![]() |
#7 |
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
Thanks - this is exactly what I have done. Put the index file into calibre and it zipped up everything.
|
![]() |
![]() |
![]() |
#8 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,164
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
No I mean zip up all the files yourself first and then add the ersulting ZIP file to calibre
|
![]() |
![]() |
![]() |
#9 |
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
Great - this worked for latin-1 except for "oe" but the "è and à" are OK.
Now just need a work-round for pdfs |
![]() |
![]() |
![]() |
#10 |
Member
![]() Posts: 12
Karma: 10
Join Date: Apr 2009
Device: Asus-A696, Cool-er, Hanlin V5
|
I have been trying to convert rtf and html texts in Hungarian to epub. "ű" and "ő" are ok in the calibre ebook viewer but not in ADE and Cool-er.
|
![]() |
![]() |
![]() |
#11 |
frumious Bandersnatch
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,543
Karma: 19001583
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
|
![]() |
![]() |
![]() |
#12 |
Sigil & calibre developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
I can fix this. There was a similar problem with German characters that have umlauts (e.g. ü). Can you attach or send me a PDF with the French accented characters so I can test my changes?
|
![]() |
![]() |
![]() |
#13 |
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
Hi user_none
Glad to enclose the document. But perhaps it is better to send you the link: http://alysse.org/~thomas/autonomy_project/ The link to the pdf is at the bottom of the page. Bye |
![]() |
![]() |
![]() |
#14 |
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 61
Karma: 7104
Join Date: Jul 2009
Device: Hanlin V3, PB360
|
The html converted well with latin-9 but calibre only recognised the full ISO in the look and feel.
The "oe" is still missing but perhaps this is due to a problem in the viewer or in the hanlin device. |
![]() |
![]() |
![]() |
#15 |
Sigil & calibre developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
I've made a commit that cleans up French characters (e.g. à,û,é...) for PDF input. It should be in the next release. Let me know if I missed anything. Also, at least with conversion from PDF the oe character is displaying correctly in calibre's ebook-viewer.
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
3 french language sites with lots of e-books | Liviu_5 | Reading Recommendations | 13 | 08-28-2012 04:29 AM |
Do you read books in more than one language? | ficbot | General Discussions | 129 | 06-06-2010 11:39 AM |
Changing from french language to english | escalla | Sony Reader | 3 | 05-14-2009 12:57 PM |
Multi-language books in mobipocket | Jellby | Kindle Formats | 2 | 12-17-2008 08:34 PM |
E books in French | Leto | Reading Recommendations | 4 | 05-31-2008 11:02 AM |