![]() |
#1 |
Fanatic
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 580
Karma: 810184
Join Date: Sep 2010
Location: Norway
Device: prs-t1, tablet, Nook Simple, assorted kindles, iPad
|
OOXML/docx XML tools?
I've just converted my first Word document to Epub/Kindle. I ended up transforming the xml code in the docx files to xhtml, which was a less painful experience than working with the HTML output from LibreOffice, which was the alternative. It wasn't too cumbersome, the source document was fairly simple. A lot of cruft to be discarded, and the footnotes needed some handling, but otherwise it was fairly straightforward.
However, I expect things can be quite a bit messier. Does anybody know of XML tools that do a good job of handling OOXML, and ideally transforming it into XHTML? (Yes, I know Toxaris has done excellent work on his plugin. But 1) I don't have Word, and 2) I really would like to handle OOXML within a XML/Xpath framework.) |
![]() |
![]() |
![]() |
#2 |
Ex-Helpdesk Junkie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,421
Karma: 85400180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
Maybe take a look at the DOCX Input/Output plugins in calibre?
|
![]() |
![]() |
Advert | |
|
![]() |
#3 | |
Fanatic
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 580
Karma: 810184
Join Date: Sep 2010
Location: Norway
Device: prs-t1, tablet, Nook Simple, assorted kindles, iPad
|
Quote:
![]() On a slightly more serious note: Yeah, should have thought of that myself – thanks for the tip. The relevant code seems readable – unlike OOXML itself... |
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Docx Conversion | Hammo | Conversion | 7 | 05-21-2015 01:01 AM |
Software Engineering Tools and Debugging Techniques: Guide to Build Software Tools | amazon author | Self-Promotions by Authors and Publishers | 2 | 04-07-2015 04:02 AM |
After merging all the .xml files, how do you divide it back into .xml files? | automa | Sigil | 10 | 08-13-2013 07:43 AM |
DOCX | orescb | Other formats | 0 | 06-16-2013 09:25 AM |
DOCX Input and DOCX Metadata Reader | SauliusP. | Development | 5 | 06-15-2012 02:17 AM |