01-13-2012, 02:33 AM | #1 |
Wizard
Posts: 3,413
Karma: 13369310
Join Date: May 2008
Location: Launceston, Tasmania
Device: Sony PRS T3, Kobo Glo, Kindle Touch, iPad, Samsung SB 2 tablet
|
ePub to Word
The background to this question is that the usual work plan when I design an ebook is
- The publisher and author work together until they have a Word version of the book which is ready for the printer, - I design an ePub ebook from that Word document, and then make a Kindle version by converting with calibre. For an ebook which is nearly finished we have followed an unusual work plan because the author wanted to make extensive changes, so she and I have worked together and made the changes. This means of course that the text in the original Word document is quite different from the text and format in the ebooks, and it would be a mammoth task for the publisher to read the ebook and type out a new ready for printer Word document. Is there any software which can read the finished ePub version and create a ready for printer Word document from the ePub version? |
01-13-2012, 04:43 AM | #2 |
Evangelist
Posts: 450
Karma: 343115
Join Date: Nov 2009
Location: Romania
Device: PW2 2014
|
You could try converting the ePub to HTML and then importing the HTML file into Word. Styles will probably turn into direct formatting, though. Hmmm... Wouldn't it be easier to just edit the ePub file with Sigil or something?
And if they have the edited version of the Word file, can't they just send it to you? It would mean that you'd have to start from scratch... but if editing the ePub is out of the question, at least it's a start. What kind of "extensive" changes are we talking about? |
01-14-2012, 02:40 AM | #3 |
Wizard
Posts: 3,413
Karma: 13369310
Join Date: May 2008
Location: Launceston, Tasmania
Device: Sony PRS T3, Kobo Glo, Kindle Touch, iPad, Samsung SB 2 tablet
|
Thanks, DSpider
To answer your last question first: we're talking about multiple punctuation changes, changes of tense, adding or deleting or changing many phrases or sentences ... and probably more that I've forgotten. When I say multiple or many I mean scores if not hundreds of changes. I'm not sure if the author has kept an up to date version of the Word file. The way I make ebooks is to get the original Word file, then mark it up to be valid XHTML files, then make the appropriate content.opf and toc.ncx files, and then package the lot up into ePub. I use an HTML editor - not Sigil, but one I used when I was dabbling in website design. I didn't know that Word could read HTML files, but I don't use Word myself, I use LibreOffice. I'll check to see if that or OpenOffice.org can read HTML and convert the files into Word. |
01-14-2012, 03:57 AM | #4 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
The method I use, is create a single file in Sigil (merge...). Extract the ePUB and load the HTML in Notepad++. Change the header by making is pure HTML and set UTF-8. Load the HTML in Word.
One issue is that the style names are gone. I would have loved that those style names would be retained in Word. At least the styles are applied most of the times. Sometimes italics are gone. |
01-15-2012, 01:27 AM | #5 | |
Wizard
Posts: 3,413
Karma: 13369310
Join Date: May 2008
Location: Launceston, Tasmania
Device: Sony PRS T3, Kobo Glo, Kindle Touch, iPad, Samsung SB 2 tablet
|
Quote:
|
|
01-15-2012, 04:44 AM | #6 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
You can use almost any text editor instead of Notepad++. In this case I only use it to add
Code:
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8"/> |
04-16-2012, 12:28 AM | #7 | |
Member
Posts: 17
Karma: 10
Join Date: Jan 2011
Device: Mac, VM solutions
|
@Toxaris
I find Calibre formatting very ugly to read. Nice team though. I like to retain author's layout where I can. So I attempted your method, and now all muddled. Is there still no automatic program (april 2012)? I found this free online converter from epub (or mobi) to PDF (or word): http://ebook.online-convert.com/convert-to-pdf I used to ipad or msreader. as output setting. although I read on my iphone. these were best result for me after testing all options. bad news on result: 1 images cut in half (so no intelligent page cut offs) 2 images are not "images" in output 3 hyperlinks are NOT active 4 sometimes, for ipad, the bibliography would get split e.g. 1 entry per page.. good news 1 formatting is nice (much better than Calibre) 2 its 1 step process, very quick and free ----- I have sigil, but find it very confusing to use. 1 In sigil, can I add this "merge" function on toolbar so 1 click? 2 In sigil, can I add this "extract" function on toolbar so 1 click? I have another texteditor (also I find confusing). 3 what are steps to "Change the header" by making is pure HTML 4 How do I "set UTF-8". I can do rest of steps 5 Load the HTML in Word. lastly, any improvement on "retaining style names"? Quote:
|
|
04-16-2012, 01:23 AM | #8 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
The situation is about the same. Actually, with my method the text is not muddled, so I wonder why it is for you... Images and everything are retained (if you extract them from the ePUB of course...
Merging in Sigil is easy. Select the text files you need in the Bookbrowser, right-click and select merge. I usually then copy the new HTML file in Notepad++ for additional work. I don't save the ePUB, because I want to keep that. Extracting the images and stylesheet I do with a zip-program. Yes, it is still manual work. BTW, either retain the file structure or adjust your links. Changing the header is also easy. In you big HTML file, delete the first couple of lines. The first line must be <html>. Copy the line I mentioned above between the <head></head> tags. The stylenames will be gone. Blame Microsoft if you want. |
04-16-2012, 02:04 AM | #9 |
Member
Posts: 17
Karma: 10
Join Date: Jan 2011
Device: Mac, VM solutions
|
found this FREE service online.
www.zamzar.com better than online converter as supports intelligent page ends e.g. does not cut pictures in half e.g. ends table full row on one page bad - is the cover was cut bad - hyperlinks still lost as with online converter bad - images as images still lost as with online converter bad - chapter starts, bib and index do NOT start on new page. so all messy still FREE very fast, and sends you link. 24hr to dl. my current reccomendation is this one. |
04-16-2012, 06:59 AM | #10 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
Nah, my own method works a lot better. You can actually speed it up a bit. Convert the ePUB to HTMLZ with Calibre. Basically that is a zip file with all the HTML files merged. Extract the resulting HTMLZ and read the HTML into Word.
|
02-03-2014, 04:39 PM | #11 | |
Banned
Posts: 74
Karma: 20416
Join Date: Nov 2010
Location: Hungary, Budapest
Device: monitor
|
Quote:
I tried: changed all occurences of <a href="../Text/mymergedbook_1.xhtml# to <a href="# before opening the html in Word, but with no success. And, the second: after merging files with Sigil, the new file could not open in Firefox Epubreader! It says: "A Firefox nem találja a fájlt a(z) /C:/Documents and Settings/Lala/Application Data/Mozilla/Firefox/Profiles/febeprof.Lala/epub/127/OEBPS/jogilexikon_1.xhtml helyen." (Firefox does not find the file in that place.) So, total failure... Last edited by laji; 02-03-2014 at 05:38 PM. |
|
02-04-2014, 04:09 AM | #12 | |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
Quote:
If you merge the files with Sigil (take care if you have internal stylesheets like sgc-x, things can really be messy then), you need to recreate the ncx (TOC) before using it as an ePUB again. Probably that is the reason why it doesn't work in ePUBReader. |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
From MS Word 2010 to ePub | OlleF | Workshop | 11 | 04-20-2014 07:12 AM |
ePub to Word | democrite | ePub | 9 | 11-09-2011 02:12 PM |
The best format to convert from Word to epub | yaip | ePub | 1 | 03-26-2011 12:41 AM |
MS Word directly to ePub | Lady Fitzgerald | Workshop | 6 | 09-11-2010 05:21 AM |
Word/PDF to ePUB Format | ygrinch | Calibre | 0 | 02-02-2010 01:13 PM |