|07-13-2011, 06:09 PM||#1|
Join Date: Jul 2011
Device: Archos HT
Junk chars in splitted file names converting lit to epub
Well, I am trying to convert a lit to epub , and the language of the lit file has some special chars like I with a dot on it etc. After conversion I saw it stuck on my device and decided to dig some more. When explode the epub file, I have seen that the splitted file names are "corrupt" like
Necip Hablemito¦şlu-K+ûSTEBEK FETHULLAH+çI ¦-ST¦-HBARAT+çILAR DOSYASI_split_000.htm
while the content.opf includes
<item href="Necip Hablemitoğlu-KÖSTEBEK FETHULLAHÇI İSTİHBARATÇILAR DOSYASI_split_000.htm"............
As far as I understood the names for the splitted files are somehow retrieved from metadata of the lit file , and unfortunately I couldnt find a way to edit metadata of lit files yet.
I have tried to change code page for both cp1254 and to utf-8 with no luck
Is there any way to change the way of constructing name of main html file in the epub ? (as a workaround ,if you go from lit --> txt and then txt--> epub it is using index_split_001.html , since it does not take the "split file name" from metadata of lit , it uses default name, but not a way to go always)
|07-13-2011, 06:52 PM||#2|
creator of calibre
Join Date: Oct 2006
Location: Mumbai, India
Convert from LIT to MOBI and then convert the MOBI. That will preserve the formatting and workaround your issue.
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|0.7.44 Problem with national chars while converting to epub||AdamV||Conversion||5||02-08-2011 08:01 PM|
|Classic Converting .epub to .pdb file format||ashalluri||Barnes & Noble NOOK||3||05-27-2010 05:07 PM|
|Beginner: Converting lit to epub-linefeed h*ll||PhyrePhox||ePub||10||04-07-2010 03:14 PM|
|Question about converting DRM lit to epub||weeziepepper||ePub||3||12-17-2009 10:52 AM|
|converting lit html output into one big file for BD||Dave Berk||Sony Reader||15||03-29-2007 10:02 PM|