Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 12-12-2010, 04:58 PM   #1
Fking
Junior Member
Fking began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Dec 2010
Device: nexus one
Exclamation Converting cyrillic files to epub, best practices?

I've got bunch of doc, pdf, txt files in cyrillic which i want to convert to epub and enjoy on my nexus one with Aldiko reader.

I tried converting them by using cp1251 and UTF-8 for the character encoding field in Calibre, but they always open unreadable in Aldiko.

So, from here i don't have idea what to do. What would be to best practice to convert all kinds of cyrillic files into epub and read them normally?
Fking is offline   Reply With Quote
Old 12-13-2010, 07:16 AM   #2
Fking
Junior Member
Fking began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Dec 2010
Device: nexus one
I'm trying all kinds of options and still can't make it work
Fking is offline   Reply With Quote
Advert
Old 12-13-2010, 07:52 AM   #3
itimpi
Wizard
itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.
 
Posts: 4,553
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
One thing I am not sure of is what fonts are required on the reading device to support Cyrillic eBooks? It is possible to embed fonts in an ePub file, but at the moment this has to be done manually as there is no automated calibre support for embedding fonts (although it is on the roadmap I believe).
itimpi is offline   Reply With Quote
Old 12-13-2010, 08:13 AM   #4
Fking
Junior Member
Fking began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Dec 2010
Device: nexus one
Hm, i'm not really familiar with that, can you tell me how to do it? Processing the books 1 by 1 would still be good for a start
Fking is offline   Reply With Quote
Old 12-29-2010, 11:34 PM   #5
Agnessa
Junior Member
Agnessa began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Dec 2010
Device: iPad| iPhone4| iPhone3| Kindle 2
Same problem Can't process russian files at all, even 1 by 1.
Agnessa is offline   Reply With Quote
Advert
Old 12-30-2010, 08:05 AM   #6
kiklop74
Guru
kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.kiklop74 can program the VCR without an owner's manual.
 
kiklop74's Avatar
 
Posts: 800
Karma: 194644
Join Date: Dec 2007
Location: Argentina
Device: Kindle Voyage
Quote:
Originally Posted by Fking View Post
I've got bunch of doc, pdf, txt files in cyrillic which i want to convert to epub and enjoy on my nexus one with Aldiko reader.

I tried converting them by using cp1251 and UTF-8 for the character encoding field in Calibre, but they always open unreadable in Aldiko.

So, from here i don't have idea what to do. What would be to best practice to convert all kinds of cyrillic files into epub and read them normally?
Most of the e-book readers do not support Cyrillic letters. Therefore in order to properly render those texts in EPUB format you need to embed fonts that do support that.

To do this correctly you must previously prepare your text. The best results are accomplished with utf-8 HTML. So here are the steps:

1. Convert your source text into utf-8 encoded HTML
2. Make sure HTML file has appropriate meta tag for charset
Quote:
<META HTTP-EQUIV="CONTENT-TYPE" CONTENT="text/html; charset=utf-8">
3. Download Liberation font pack from here https://fedorahosted.org/liberation-fonts/ and unpack them in the same folder with HTML file
4. Add this to the head section of your HTML file:
Quote:
<style type="text/css">
@font-face { font-family: "Liberation Serif";
font-style: normal;
font-weight: normal;
src: url(LiberationSerif-Regular.ttf)
}
@font-face { font-family: "Liberation Serif";
font-style: normal;
font-weight: bold;
src: url(LiberationSerif-Bold.ttf)
}
@font-face { font-family: "Liberation Serif";
font-style: italic;
font-weight: normal;
src: url(LiberationSerif-Italic.ttf)
}
@font-face { font-family: "Liberation Serif";
font-style: italic;
font-weight: bold;
src: url(LiberationSerif-BoldItalic.ttf)
}
body {font-family: "Liberation Serif",serif}
</style>
5. Now you can import that file into calibre and convert it to epub. If you did everything right - you will be able to read russian texts.

Simmilar thing was explained here

https://www.mobileread.com/forums/showthread.php?t=84256
kiklop74 is offline   Reply With Quote
Old 01-09-2011, 06:06 AM   #7
discens
Junior Member
discens began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Feb 2010
Device: Kindle 2, Kindle DX, Sony PRS-650
Reading the previous post made me realize how much of a newbie I am.
If anyone is patient enough, I would like to have some clarifications, please. I would like to convert a .docx file (with both cyrillic and latin characters) to .epub. I own a Sony PRS-650.


Quote:
Originally Posted by kiklop74 View Post
1. Convert your source text into utf-8 encoded HTML
I did this by setting the utf-8 option in MS Word 2007 and then saving the file (let's call it example1) as a "webpage" (not as a single file webpage).

Quote:
Originally Posted by kiklop74 View Post
2. Make sure HTML file has appropriate meta tag for charset
To modify the html file, I opened it with the Note Pad, and replaced the string <meta http-equiv=Content-Type content="text/html; charset=windows-1252"> with the one given above.

Quote:
Originally Posted by kiklop74 View Post
3. Download Liberation font pack from here https://fedorahosted.org/liberation-fonts/ and unpack them in the same folder with HTML file
a)Assuming the name of the html file is example1, is this the folder called example1_file or the folder which contains both the example1_file folder and the example1 file (sorry for the imprecise terminology)?


b) What should the final result of the unpacking be? The .tar file, of should this be unpacked again?

Quote:
Originally Posted by kiklop74 View Post
4. Add this to the head section of your HTML file:
Where exactly in the head section? I succeded in converting the html file to epub through Calibre only by adding this just after the last occurrence of </style> (otherwise I got fail messages), but in any case with the resulting epub file question marks showed instead of cyrillic characters on the Sony Reader.

Finally, after doing these modifications with the Note Pad, is it correct to save the file with the extension .htm choosing utf-8 as encoding?

Thank you very much for any reply.

Last edited by discens; 01-09-2011 at 09:38 AM.
discens is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
original files for converting epub yuxi_kelly ePub 2 12-04-2010 05:56 AM
Problem Converting pdf files to epub file clover_kid Calibre 2 10-28-2010 05:32 PM
EPUB Math: Best Practices for Mathematics in Ebooks Adjust News 0 08-23-2010 10:17 PM
Problems converting ePub Feed files into Mobipocket Format torben Calibre 21 02-21-2009 02:42 PM
EPUB best practices guide Bob Russell ePub 25 04-01-2008 08:36 AM


All times are GMT -4. The time now is 01:16 AM.


MobileRead.com is a privately owned, operated and funded community.