|  07-29-2009, 12:26 PM | #1 | 
| Connoisseur            Posts: 61 Karma: 7104 Join Date: Jul 2009 Device: Hanlin V3, PB360 | 
				
				French Language Books
			 
			
			I have been trying to convert html and pdf books in French into prc or ePub. The conversions for html are better but there is still a problem with the "oe" and "à" and "è". The conversion from pdf is quite terrible - there seem to be line breaks after each accent and the accents are not on the letters but beside the letters. I have tried using the alias for CH in the look and feel to no avail. Could somebody please tell me what I need to put in the look and feel and how? Thanks for the help. | 
|   |   | 
|  07-29-2009, 12:28 PM | #2 | 
| creator of calibre            Posts: 45,604 Karma: 28548974 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			For HTML conversions you may need to specify the correct input encoding, probably one of the following cp1252 cp1251 latin1 utf-8 | 
|   |   | 
| Advert | |
|  | 
|  07-29-2009, 12:30 PM | #3 | 
| Connoisseur            Posts: 61 Karma: 7104 Join Date: Jul 2009 Device: Hanlin V3, PB360 | 
			
			Thanks for your help - how do I specify this - just putting it into the "Look and feel " ? What about the pdfs - is there anything I can do about these?
		 | 
|   |   | 
|  07-29-2009, 12:38 PM | #4 | 
| creator of calibre            Posts: 45,604 Karma: 28548974 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			Yes. As for PDF, not really.
		 | 
|   |   | 
|  07-29-2009, 01:13 PM | #5 | 
| Connoisseur            Posts: 61 Karma: 7104 Join Date: Jul 2009 Device: Hanlin V3, PB360 | 
			
			I have tried all these options with no success - worse than when leaving it blank. I put them in the "Input character encoding" field in the "Look and Feel". Is this OK? Has anyone tried converting French language docs? Thanks for the help.
		 | 
|   |   | 
| Advert | |
|  | 
|  07-29-2009, 01:14 PM | #6 | 
| creator of calibre            Posts: 45,604 Karma: 28548974 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			zip up the HTML before adding it to calibre (i.e. put it ina  ZIP file and add the ZIP file to calibre). Then try those options.
		 | 
|   |   | 
|  07-29-2009, 01:23 PM | #7 | 
| Connoisseur            Posts: 61 Karma: 7104 Join Date: Jul 2009 Device: Hanlin V3, PB360 | 
			
			Thanks - this is exactly what I have done. Put the index file into calibre and it zipped up everything.
		 | 
|   |   | 
|  07-29-2009, 01:25 PM | #8 | 
| creator of calibre            Posts: 45,604 Karma: 28548974 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			No I mean zip up all the files yourself first and then add the ersulting ZIP file to calibre
		 | 
|   |   | 
|  07-29-2009, 01:48 PM | #9 | 
| Connoisseur            Posts: 61 Karma: 7104 Join Date: Jul 2009 Device: Hanlin V3, PB360 | 
			
			Great - this worked for latin-1 except for "oe" but the "è and à" are OK.  Now just need a work-round for pdfs | 
|   |   | 
|  07-29-2009, 03:15 PM | #10 | 
| Member  Posts: 12 Karma: 10 Join Date: Apr 2009 Device: Asus-A696, Cool-er, Hanlin V5 | 
			
			I have been trying to convert rtf and html texts in Hungarian to epub. "ű" and "ő" are ok in the calibre ebook viewer but not in ADE and Cool-er.
		 | 
|   |   | 
|  07-29-2009, 03:16 PM | #11 | 
| frumious Bandersnatch            Posts: 7,570 Karma: 20150435 Join Date: Jan 2008 Location: Spaniard in Sweden Device: Cybook Orizon, Kobo Aura | |
|   |   | 
|  07-30-2009, 07:13 PM | #12 | 
| Sigil & calibre developer            Posts: 2,487 Karma: 1063785 Join Date: Jan 2009 Location: Florida, USA Device: Nook STR | 
			
			I can fix this. There was a similar problem with German characters that have umlauts (e.g. ü). Can you attach or send me a PDF with the French accented characters so I can test my changes?
		 | 
|   |   | 
|  07-31-2009, 07:30 AM | #13 | 
| Connoisseur            Posts: 61 Karma: 7104 Join Date: Jul 2009 Device: Hanlin V3, PB360 | 
			
			Hi user_none Glad to enclose the document. But perhaps it is better to send you the link: http://alysse.org/~thomas/autonomy_project/ The link to the pdf is at the bottom of the page. Bye | 
|   |   | 
|  07-31-2009, 07:32 AM | #14 | 
| Connoisseur            Posts: 61 Karma: 7104 Join Date: Jul 2009 Device: Hanlin V3, PB360 | 
			
			The html converted well with latin-9 but calibre only recognised the full ISO in the look and feel. The "oe" is still missing but perhaps this is due to a problem in the viewer or in the hanlin device. | 
|   |   | 
|  07-31-2009, 08:41 AM | #15 | 
| Sigil & calibre developer            Posts: 2,487 Karma: 1063785 Join Date: Jan 2009 Location: Florida, USA Device: Nook STR | 
			
			I've made a commit that cleans up French characters (e.g. à,û,é...) for PDF input. It should be in the next release. Let me know if I missed anything. Also, at least with conversion from PDF the oe character is displaying correctly in calibre's ebook-viewer.
		 | 
|   |   | 
|  | 
| Thread Tools | Search this Thread | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| 3 french language sites with lots of e-books | Liviu_5 | Reading Recommendations | 13 | 08-28-2012 04:29 AM | 
| Do you read books in more than one language? | ficbot | General Discussions | 129 | 06-06-2010 11:39 AM | 
| Changing from french language to english | escalla | Sony Reader | 3 | 05-14-2009 12:57 PM | 
| Multi-language books in mobipocket | Jellby | Kindle Formats | 2 | 12-17-2008 08:34 PM | 
| E books in French | Leto | Reading Recommendations | 4 | 05-31-2008 11:02 AM |