04-28-2023, 05:06 PM | #1 |
Junior Member
Posts: 4
Karma: 10
Join Date: Apr 2023
Device: none
|
Dunno what I'm doing with converting
I'm attempting to convert a few books a friend wrote to epub format from .docx. I don't know how to create an example/sample that shows the problem, or use the calibre bug report system.
Spoiler:
Trying to do epub but the only formats it's giving me are the original .docx and .opf. The problem is that it won't convert this and three others of the five-book series into epub. Wasn't changing anything other than trying to make it epub format. |
04-28-2023, 05:55 PM | #2 |
Well trained by Cats
Posts: 29,818
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Did you (in error) unzip the plugin ? Plugins stay zipped.
|
04-28-2023, 06:39 PM | #3 | ||
null operator (he/him)
Posts: 20,590
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
Quote:
Code:
InputFormatPlugin: DOCX Input running
on C:\Users\sailo\AppData\Local\Temp\calibre_8k_akbrm \ud5msa5s.docx
DOCX appears to be invalid ZIP file, trying a more forgiving ZIP parser
Traceback (most recent call last):
File "calibre\ebooks\docx\container.py", line 110, in extract
File "calibre\utils\zipfile.py", line 774, in __init__
File "calibre\utils\zipfile.py", line 809, in _GetContents
File "calibre\utils\zipfile.py", line 824, in _RealGetContents
calibre.utils.zipfile.BadZipfile: File is not a zip file
How were these DOCX's created, i.e. which OS and which Word Processor? BR |
||
04-29-2023, 03:01 AM | #4 |
Junior Member
Posts: 4
Karma: 10
Join Date: Apr 2023
Device: none
|
Honestly, I don't know how they were created outside of my friend the author using Word (or something similar) on a Windows-based computer (I'm assuming). The file was emailed to me as-is since I was her beta reader. I never asked since all that mattered at the time was being able to open the file to read the story and give feedback.
I'm wondering if converting it to an OpenOffice format (on a Win10 laptop) will help? |
04-29-2023, 03:41 AM | #5 |
Evangelist
Posts: 482
Karma: 2267928
Join Date: Nov 2015
Device: none
|
So, can you open those files in a word processor or an archive manager?
|
04-29-2023, 06:02 AM | #6 |
the rook, bossing Never.
Posts: 11,171
Karma: 85874891
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Don't convert to Open Office or Libre Office!*
Open Office is dead, replaced by Libre Office. There is an odt conversion on Calibre, but I found an extra Save As in docx from Libre Office Writer works better and Kovid confirmed the docx input is better. If you are using MS Word, open the docx and save as in Word 2007 or later docx with a different name. Open that with Calibre. Also for CSS & HTML to work well the docx should use styles. Headings only when you want a page break. No headers, footers or page numbers. All images to be embedded, not external. Lists should use a paragraph style that looks the same but without auto numbers, letters, Roman numerals or bullets. Type in the item ID manually. This is because while Word Processor lists will convert well to HTML for web pages, actual HTML list formatting (especially auto numbering) is poorly supported on actual physical ereaders. I've been doing docx to epub with Calibre for years and style to CSS is perfect. I only need to edit some images' CSS if they are to be a percent of width or height (other property set to auto). * Obviously convert docx to odt if editing in LO Writer, but an extra Save As in docx at the end for Calibre and don't edit that docx in LO Writer, only the odt. |
04-29-2023, 03:57 PM | #7 |
Junior Member
Posts: 4
Karma: 10
Join Date: Apr 2023
Device: none
|
I don't use MS Word, only OpenOffice. Guess I'm just stuck where I am as mentally it's hard for me to consider yet another word processor since OO has been working for me for years.
|
04-29-2023, 04:06 PM | #8 |
Well trained by Cats
Posts: 29,818
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
|
04-29-2023, 04:47 PM | #9 | |
null operator (he/him)
Posts: 20,590
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
A few years ago we had similar problems when one of our 'colleagues' started using an obscure OS/X word processor (not Pages) to edit a DOCX file that originated in LO Writer… the DOCX it produced couldn't be read by Writer or Word. BR Last edited by BetterRed; 04-29-2023 at 06:04 PM. |
|
04-29-2023, 11:32 PM | #10 |
Junior Member
Posts: 4
Karma: 10
Join Date: Apr 2023
Device: none
|
I wasn't going to, but I uninstalled OO and installed LO to make it work. Guess I better just get used to things... I'm just a stick in the mud about certain things, I suppose. Figured out the hard way that .docx and .doc aren't the same (only the first two books were in .docx format).
I didn't feel like asking her what her OS/word processor was. |
04-30-2023, 12:21 AM | #11 |
Evangelist
Posts: 482
Karma: 2267928
Join Date: Nov 2015
Device: none
|
So can you open the files with a word processor or not?
|
04-30-2023, 05:33 AM | #12 | |
the rook, bossing Never.
Posts: 11,171
Karma: 85874891
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Quote:
Do edit in odt and Save As an EXTRA copy in .docx for people or programs that need it and never edit that .docx. It's because OO and LO Writer always convert any doc or docx read and then when you save in doc or docx a second conversion is done. So when you read a doc or docx into OO or LO you have to check: Hyperlinks to internal anchors/bookmarks. Headings Paragraph, character and graphic styles Reset page format of blocks of pages (Only for PDF or paper print should more than one page style exist; for ebooks only one page style). Inline Contents may need rebuilt Page breaks and which headings set them No heading or paragraph styles should have defined line spacing, so that the ebook CSS has no line-heights set anywhere. Some or all of this needs done even for people using only MS Word because the different versions behave differently (even for docx), fonts on another computer may not have been included, Word also used metrics of default printer (turn that off in LO Wriiter). This is why PDF was invented. Looks same & prints same everywhere unlike MS Word, but it's a delivery document, hence not meant to be edited, not meant to reflow, not meant to resize/change font, line spacing, margins etc like real ebooks. |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
K5 Need Some Advice Unbricking A Kindle Touch - Spent Weeks Trying And Dunno What To Do! | shrtcphr | Kindle Developer's Corner | 10 | 01-11-2019 07:45 PM |
Help converting? | John F | Conversion | 5 | 06-16-2011 10:52 AM |
Best way of converting | PieOPah | Workshop | 8 | 09-18-2009 11:25 AM |
need help converting | darktower | Sony Reader | 1 | 09-25-2008 01:30 AM |
Converting | steverobbo | Sony Reader | 2 | 03-24-2008 12:03 AM |