View Single Post
Old 07-13-2011, 06:09 PM   #1
ozofmoz
Junior Member
ozofmoz began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Jul 2011
Device: Archos HT
Junk chars in splitted file names converting lit to epub

Well, I am trying to convert a lit to epub , and the language of the lit file has some special chars like I with a dot on it etc. After conversion I saw it stuck on my device and decided to dig some more. When explode the epub file, I have seen that the splitted file names are "corrupt" like

Necip Hablemito¦şlu-K+ûSTEBEK FETHULLAH+çI ¦-ST¦-HBARAT+çILAR DOSYASI_split_000.htm

while the content.opf includes

<item href="Necip Hablemitoğlu-KÖSTEBEK FETHULLAHÇI İSTİHBARATÇILAR DOSYASI_split_000.htm"............

As far as I understood the names for the splitted files are somehow retrieved from metadata of the lit file , and unfortunately I couldnt find a way to edit metadata of lit files yet.

I have tried to change code page for both cp1254 and to utf-8 with no luck

Is there any way to change the way of constructing name of main html file in the epub ? (as a workaround ,if you go from lit --> txt and then txt--> epub it is using index_split_001.html , since it does not take the "split file name" from metadata of lit , it uses default name, but not a way to go always)
ozofmoz is offline   Reply With Quote