Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 12-14-2010, 03:32 PM   #1
gertblij
Enthusiast
gertblij has learned how to buy an e-book online
 
Posts: 39
Karma: 85
Join Date: Oct 2010
Location: Cape Town, South Africa
Device: Kindle 3
MS Word 2010 to EPUB conversion

I have edited a file in Word 2010. Looks great.

Couple of pictures, including book cover in the front. Word generated TOC based on Headings 1 and 2. Indented first line paragrahs, except for first paragraph of a chapter. All text left and right justified.

From this forum I picked up that the best way to get the file into Calibre is to save it as html. And that is where I get stuck. (All action is "save as" in Word, Add Book in Calibre. Convert to EPUB in Calibre)

When I use the straight save as html version I retain the paragraph formatting in the EPUB, but all the pictures are gone.
When I use the save as html filtered option, all the pictures are there, but I loose the paragraph formatting.

Any suggestions for a way to keep all the pictures and the indented and not indented paragraph formats?
gertblij is offline   Reply With Quote
Old 12-14-2010, 03:48 PM   #2
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by gertblij View Post
When I use the straight save as html version I retain the paragraph formatting in the EPUB, but all the pictures are gone.
That sounds like an error in Word. There's no reason it should lose images. Still, I would not use the straight html save option.

Quote:
When I use the save as html filtered option, all the pictures are there, but I loose the paragraph formatting.
It sounds to me like you've defined the paragraph formatting oddly in Word. I retain paragraph formatting when saving as filtered html. Indenting, centering, etc. all save fine.
Starson17 is offline   Reply With Quote
 
Advertisement
Old 12-14-2010, 07:24 PM   #3
DoctorOhh
US Navy, Retired
DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.
 
DoctorOhh's Avatar
 
Posts: 8,908
Karma: 12755553
Join Date: Feb 2009
Location: North Carolina
Device: Nexus 7
Quote:
Originally Posted by gertblij View Post
When I use the straight save as html version I retain the paragraph formatting in the EPUB, but all the pictures are gone.
Just a guess, this may be Word saving the images in the Windows image format (wim) and calibre can't convert those.

Quote:
Originally Posted by gertblij View Post
Any suggestions for a way to keep all the pictures and the indented and not indented paragraph formats?
I would suggest doing what Starson17 suggested and reviewing how you format the document.

You can also try using another application.

Atlantis Word processor can save directly as a epub (ebook). The free, open source Open Office or its spin off under its new name Libre Office can save in ODT which calibre will convert well. There is also a plugin to save directly as a epub, this plugin is still being developed but works great within its current limits and there are plans to extend and improve it.
DoctorOhh is offline   Reply With Quote
Old 12-15-2010, 09:55 AM   #4
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by dwanthny View Post
Just a guess, this may be Word saving the images in the Windows image format (wim) and calibre can't convert those.
I read his post as saying that the images were missing from the saved html, but on rereading it, I suspect you are right. However, I'm not sure why Word would save images in one format for the normal html and in another format for the filtered html, but who knows with Microsoft.
Starson17 is offline   Reply With Quote
Old 12-15-2010, 03:07 PM   #5
gertblij
Enthusiast
gertblij has learned how to buy an e-book online
 
Posts: 39
Karma: 85
Join Date: Oct 2010
Location: Cape Town, South Africa
Device: Kindle 3
Thanks for all the advise, but it's not all there and before I try the other word processor option (big effort), how does one explain the following:

I saved the document from Word 2010 in PDF format. The PDF renders perfectly, e.g. all the pics are there, the TOC works perfect, all paragraphs first lines are indented, except the paragraphs imediately following an (original) Heading 1 or 2, which are left aligned. IOW exactly like the Word layout.

But when I then add this PDF file to Calibre and convert it to EPUB, everything comes accross ok except that the paragraphs imediately following the headers are now indented as well iso left aligned.

Could this be a Calibre conversion problem?
gertblij is offline   Reply With Quote
Old 12-15-2010, 03:35 PM   #6
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by gertblij View Post
I saved the document from Word 2010 in PDF format. The PDF renders perfectly,
PDF is based on Postscript, which is a printer language, and Word is designed to create printable documents. It's expected that the PDF will look correct, just as you expect Word to print correctly.

Quote:
when I then add this PDF file to Calibre and convert it to EPUB, everything comes accross ok except that the paragraphs imediately following the headers are now indented as well iso left aligned.
Could this be a Calibre conversion problem?
Yes. PDF is a horrible ebook format and it's hard to get anything useful out of that format once a book is in it.
Starson17 is offline   Reply With Quote
Old 12-15-2010, 04:47 PM   #7
gertblij
Enthusiast
gertblij has learned how to buy an e-book online
 
Posts: 39
Karma: 85
Join Date: Oct 2010
Location: Cape Town, South Africa
Device: Kindle 3
Quote:
Originally Posted by Starson17 View Post
PDF is based on Postscript, which is a printer language, and Word is designed to create printable documents. It's expected that the PDF will look correct, just as you expect Word to print correctly.


Yes. PDF is a horrible ebook format and it's hard to get anything useful out of that format once a book is in it.
Thanks Starson17. I appreciate your prompt responses. However, they are not very helpful. I am not a techie like you and am dependent on software to work. 100,000 reasons why it won't work doen't solve my problem.

Calibre offers "PDF to other formats" conversion. I love it. However, I apreciate that there are meny technical reasons that E does not equal MC2, but then say so. Just say, "please expect the output to be different from the input" and I'll stop trying/wasting time. That simple.

BTW, I have stopeed trying and consider this thread as closed.

Thanks for all the effort.

PS
Up front apologies if frustration shows. But I am tired (physically) and frustrated from spending many hours trying to get somewhere without moving one inch/cm
gertblij is offline   Reply With Quote
Old 12-15-2010, 04:54 PM   #8
EowynCarter
Wizard
EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.EowynCarter ought to be getting tired of karma fortunes by now.
 
Posts: 4,131
Karma: 1604192
Join Date: Oct 2008
Location: Paris
Device: Cybooks; Sony PRS-T1
Why not use Openoffice to open the .doc, then open office's ePub plugin ?
EowynCarter is offline   Reply With Quote
Old 12-15-2010, 07:05 PM   #9
frabjous
Wizard
frabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameter
 
frabjous's Avatar
 
Posts: 1,213
Karma: 12890
Join Date: Feb 2009
Location: Amherst, Massachusetts, USA
Device: Sony PRS-505
Quote:
Originally Posted by Starson17 View Post
Yes. PDF is a horrible ebook format and it's hard to get anything useful out of that format once a book is in it.
I do wish people would stop saying this.

PDF is far and away the best ebook format. Because it's an output format, It's just a lousy format to convert from, and a lousy format for distributing. If you create your own PDF with your own desired settings, nothing else comes close to comparing in quality, however.

Word would not be my software of choice, but if you set the page size, font size and margins correctly and export to PDF you can get a very nice looking PDF from Word for reading on a portable device: at least as good as any other format.

But no, don't convert the PDF to ePub after making it. If you really need an ePub, you'll need to use other means.
frabjous is offline   Reply With Quote
Old 12-15-2010, 09:11 PM   #10
jackie_w
Wizard
jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.
 
Posts: 2,905
Karma: 4329385
Join Date: Sep 2009
Location: UK
Device: Sony PRS-350, PB360, Kobo Glo/AuraHD/Aura6"/AuraH2O
Quote:
Originally Posted by gertblij View Post
When I use the save as html filtered option, all the pictures are there, but I loose the paragraph formatting.

Any suggestions for a way to keep all the pictures and the indented and not indented paragraph formats?
I have a much older version of Word but html-filtered has always worked for me.

If you are able to post a small piece of your badly-behaving html perhaps one of us will be able to spot something and offer advice.
jackie_w is offline   Reply With Quote
Old 12-16-2010, 08:52 AM   #11
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by frabjous View Post
PDF is far and away the best ebook format.
Each to their own. PDF produces great output, but I don't consider it to be an ebook format. It's too page oriented, not tolerant of different screen sizes, won't flow well, etc. However, if you output to your desired screen size, and can read that output on your screen, I agree that it's nice to look at.
Starson17 is offline   Reply With Quote
Old 12-16-2010, 09:58 AM   #12
DoctorOhh
US Navy, Retired
DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.
 
DoctorOhh's Avatar
 
Posts: 8,908
Karma: 12755553
Join Date: Feb 2009
Location: North Carolina
Device: Nexus 7
Quote:
Originally Posted by Starson17 View Post
Each to their own. PDF produces great output, but I don't consider it to be an ebook format. It's too page oriented, not tolerant of different screen sizes, won't flow well, etc. However, if you output to your desired screen size, and can read that output on your screen, I agree that it's nice to look at.
I agree, PDF might be a great ebook format for a computer, but unless you actually create the PDF for the exact size ereader you have in the exact font size you wish to read it is a very bad ereader format. Even then it lacks the real time flexibility I would expect of any good ebook.
DoctorOhh is offline   Reply With Quote
Old 12-16-2010, 11:30 AM   #13
frabjous
Wizard
frabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameterfrabjous can solve quadratic equations while standing on his or her head reciting poetry in iambic pentameter
 
frabjous's Avatar
 
Posts: 1,213
Karma: 12890
Join Date: Feb 2009
Location: Amherst, Massachusetts, USA
Device: Sony PRS-505
The only point I was making was about ones you create yourself for your own device. (Though I'd love it if retailers would sell a variety of PDFs formatted for different screens, or let you customize when you download rather like feedbooks does.)

But under those conditions, it's still better than anything else. You can use whatever fonts you want, whatever font sizes you want, and true typographical quality like ligatures, pair kerning, margin kerning, paragraph-centric whitespace control, typographical feathering, etc., impossible with any other format.

I don't think we're disagreeing about anything except whether one ought to say "PDF is not an eBook format". I'd qualify that as "PDF is not a one-size fits-all source eBook format".
frabjous is offline   Reply With Quote
Old 12-17-2010, 11:10 AM   #14
gertblij
Enthusiast
gertblij has learned how to buy an e-book online
 
Posts: 39
Karma: 85
Join Date: Oct 2010
Location: Cape Town, South Africa
Device: Kindle 3
Quote:
Originally Posted by EowynCarter View Post
Why not use Openoffice to open the .doc, then open office's ePub plugin ?
Thanks Eowyn (Hope that is your real name. I think it is absolutely beautiful).

I tried that, but the epub plugin crashed ugly :-(. Having said that, I went to the OOo epub plugin thread in this forum (link somewhere in this thread) and reported the problem. Luke, the developer of the plugin is great. He asked me for the file concerned and said he would fix the problem in the next release. Great passion. Absolutely love it.

Obviously I sent him the file and am now waiting. Quite sure it will be solved :-)
gertblij is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
From MS Word 2010 to ePub OlleF Workshop 11 04-20-2014 08:12 AM
word document conversion kairos Amazon Kindle 7 07-16-2010 01:15 PM
Conversion Settings from MS Word Source tochill Calibre 0 07-13-2010 03:02 AM
MS Word .docx file conversion Frank Lowney Calibre 1 06-01-2010 12:53 PM
Used MS Word for a quick blog conversion anotherchance Sony Reader 0 10-23-2007 02:37 PM


All times are GMT -4. The time now is 01:09 AM.


MobileRead.com is a privately owned, operated and funded community.