I have now done this conversion - the only way I found to do it so the result is correct - proper index, accents and ligatures and images in place was as follows (Linux and Hanlin specific):
a) Split or use html so separate html for each chapter
b) Display html in firefox, open openoffice word and copy/paste from firefox into wp.
c) Save as odt file and clean up text removing html specific stuff
d) Save this file in html format
e) Open this html in bluefish (screem tends to fall down losing al the work)
f) Where an image has been used for the ligature "oe" replace if you wish with "oe" this can be done using replace and regexp of type: <IMG\b[^>]*img1\b[^>]*>
g) For all other images use replace to change "<IMG" to "(Img and end marker > to )"
h) Now save this modified file as encoded text unicode-8
g) Start a project in eCub and import encoded txt files with the option of immediate change to html
h) Edit these html in the project folder reversing "(IMG" to "<" and ")" to ">"
i) Compile up the ePub file and run in Azardi to make sure all is OK
j) In the build folder take the content.opf file and add a line in the reference section as follows:<reference type="toc" title="Table of Contents" href="TableOfContents.html"/>
h) Put a reference to the table of contents under the main title in the first html
i) Now remake the ePub with
zip -Xr9D $1.epub mimetype * -x .DS_Store
so as not to overwrite
h) make the mobi file either using mobigen or calibre on the on this ePub file.
Mobigen often falls down when run in wine but after 2 or 3 tries it usually works and produces a good result. the file msvcp71.dll needs to be put into wine (System 32) and I found it better to run the dos version:
"wine "h:\\Mobigen\\mobigen\\mobigen.exe" "h:\AuNB\\EcubProjectFiles\\build\\OPS\\content.op f" -c1 -s0"
Wine needs to be configured first to get the equivalent dos drives.
The ePub file is problematic in Hanlin because the font size is minimum and there are spaces after the accents. However the mobi file reads very well.
|