MobileRead Forums

MobileRead Forums (https://www.mobileread.com/forums/index.php)
-   ePub (https://www.mobileread.com/forums/forumdisplay.php?f=179)
-   -   LilbreOffice 6.0 epub2/epub3 output test (https://www.mobileread.com/forums/showthread.php?t=293575)

Doitsu 01-03-2018 09:25 AM

LilbreOffice 6.0 epub2/epub3 output test
 
2 Attachment(s)
Since the upcoming LibreOffice 6.0 version will have epub2/epub3 output support, I tested a LibreOffice 6.0 pre-release version with a demo file originally created by Kovid Goyal to test the Calibre DOCX filter.

I wasn't exactly impressed by the output. The epub2 version had duplicate header title tags in each file and both versions had one broken link.

The converter also couldn't handle footnotes and endnotes and, most annoyingly, all generated HTML files contained inline styles.

Bertrand 01-04-2018 04:13 AM

Wow, what a soup ...
Good luck if you need to edit the code outside Libreoffice.

Sarmat89 01-04-2018 05:10 AM

Well, EPUB is not a semantic markup format, and it cannot be created with a text editor, so this is expected.

Plus, the LibreOffice has a one-trick pony XHTML exporter, which doesn't even try to simplify the output.

DaleDe 01-04-2018 01:48 PM

Better to use Atlantis word processor as it creates very nice compact ePub files.

JSWolf 01-04-2018 01:54 PM

Quote:

Originally Posted by Sarmat89 (Post 3635580)
Well, EPUB is not a semantic markup format, and it cannot be created with a text editor

Actually, ePub can be created with a text editor. It's not all that easy, but it can be done.

arjaybe 01-04-2018 02:28 PM

Quote:

Originally Posted by DaleDe (Post 3635800)
Better to use Atlantis word processor as it creates very nice compact ePub files.

It has poor ODT support and is only for Windows.

JSWolf 01-04-2018 02:36 PM

Quote:

Originally Posted by arjaybe (Post 3635821)
It has poor ODT support and is only for Windows.

And those points are an issue why?

mobama 01-05-2018 12:35 PM

@Doitsu
How does it compare with the epub plugin in the earlier version of Libreoffice?

Toxaris 01-05-2018 12:37 PM

Quote:

Originally Posted by Sarmat89 (Post 3635580)
Well, EPUB is not a semantic markup format, and it cannot be created with a text editor, so this is expected.

Yes it can. If I can write an add-in that does it (for Word, granted), than they can do it as well. The other example given in this thread, Atlantis, is a fine Word processor and it can do it as well with reasonable results.

Doitsu 01-05-2018 02:30 PM

2 Attachment(s)
Quote:

Originally Posted by mobama (Post 3636331)
@Doitsu
How does it compare with the epub plugin in the earlier version of Libreoffice?

I no longer have the old version installed, but since its converter was based on writer2html, I converted the same test file to epub2 and epub3 files using the standalone version of writer2html.

IMHO, the results are much better. It also created lots of inline styles*, but at least footnotes and lists survived the conversion.
If I had to pick one, I'd definitely pick the old converter.

* Inline styles can be easily converted to classes with KevinH's RemoveInLineStyles Sigil plugin.

Tex2002ans 01-05-2018 07:06 PM

Quote:

Originally Posted by Doitsu (Post 3635106)
I wasn't exactly impressed by the output. The epub2 version had duplicate header title tags in each file and both versions had one broken link.

This EPUB support is still in the barebones stages. I found out about it on the LibreOffice Wiki for v6.0:

https://wiki.documentfoundation.org/....0#New_filters

This also linked to the author's blog post about adding EPUB3 support.

This is a good first step. :)

Quote:

Originally Posted by JSWolf (Post 3635829)
Quote:

Originally Posted by arjaybe (Post 3635821)
It has poor ODT support and is only for Windows.

And those points are an issue why?

Because many people use ODT documents and are on non-Windows OSes?

As the LibreOffice EPUB Import/Export grows, there will be less of a need for a lot of the third-party solutions now (Save as DOCX and run through Calibre, etc., etc.).

And maybe some of the libraries used in LibreOffice will tangentially lead to better ODT input on Calibre's side of things.

Notjohn 01-07-2018 10:01 AM

Quote:

Originally Posted by DaleDe (Post 3635800)
Better to use Atlantis word processor as it creates very nice compact ePub files.

Thanks for that information! I collect votes on behalf of purpose-built softwares like Atlantis, Jutoh, Scirvener, Vellum etc, since I don't use any of them myself.

roger64 02-13-2018 10:47 AM

1 Attachment(s)
Quote:

Originally Posted by Doitsu (Post 3636397)
I no longer have the old version installed, but since its converter was based on writer2html, I converted the same test file to epub2 and epub3 files using the standalone version of writer2html.

IMHO, the results are much better. It also created lots of inline styles*, but at least footnotes and lists survived the conversion.
If I had to pick one, I'd definitely pick the old converter.

* Inline styles can be easily converted to classes with KevinH's RemoveInLineStyles Sigil plugin.

writer2xhtml can create directly quite clean Epub2 or 3 without any inline style if you select carefully its -little convoluted- configuration options. The source odt is expected to be written using styles that writer2xhtml obviously cannot invent on behalf of the user.

I opened the source docx file with LO6 and converted it to odt format. Then I used odtImport (a Sigil plugin) to export directly an Epub3. In the joint zip file you'll find the odt and the Epub3.

I only batch renamed the html files to xhtml. Everything else is straight from the converter. Epubcheck reports a missing image . writer2xhtml displays all four images because there are two green dots which are the same image, which it displays with different relative settings.

This version of writer2xhtml is an alpha version 1.6. compiled by Doitsu from the source repository. :2thumbsup

Edit: for reasons I fail to explain, the odt file has a 4.3 MB size but the resulting Epub has a 32 k size...

Sarmat89 02-21-2018 11:25 AM

Your ODF file contains 18 embedded fonts.

Apparition B5 04-22-2018 06:59 PM

According to the update in this blog post, EPUB support has been substantially approved in the LibreOffice master trunk.


All times are GMT -4. The time now is 09:38 PM.

Powered by: vBulletin
Copyright ©2000 - 3.8.5, Jelsoft Enterprises Ltd.
MobileRead.com is a privately owned, operated and funded community.