View Single Post
Old 02-27-2014, 03:31 PM   #5
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by AIDM2 View Post
That explains it! Thanks! Now, going by DaleDe's post, it seems like ABBYY FineReader uses a stripped down implementation of epub. Meaning they don't allows users to use the full extent of the format. Any idea why this is?
Well the EPUB export was just introduced in Finereader 11.... so they probably have tweaks they can do in making the export better in future versions (I see Finereader 12 was just recently released earlier this month, although no mention of better EPUB support).

Since the document is stored in a proprietary format, the Finereader engine itself probably transforms their data into some intermediary, and then transform THAT into the desired output format:
  • When choosing "Formatted Text"
    • Step 1: Full Representation
    • Step 2: Formatted Text intermediary (stripping out most formatting, making it left aligned, only carrying over basic formatting)
    • Step 3: Export to chosen format
  • When choosing "Plain Text"
    • Step 1: Full Representation
    • Step 2: Plain Text intermediary (stripping everything, only carrying over basic text)
    • Step 3: Export to chosen format.

So in order to support a new format, they probably only have to create a new "template" at Step 3, to take everything in that intermediary, and convert it into its equivalent in the output format.

So it probably isn't as simple as just "let's add center/right/justified justification when exporting to EPUB". An update to transfer over alignment in Step 2 would mean having to update all of the Step 3 transformations (most likely very complex) for all the other supported formats.

If you want to carry over more formatting, you would have to choose one of the many formats supported in "Exact Copy", "Editable Copy" or "Flexible" (HTML).

I personally think that the EPUB output is the best/most minimal output, with the cleanest code. Then you can easily just add your little tweaks (alignment, headings, split chapters, etc. etc.) using Sigil or Calibre's Editor. If you need any help with that, I would be glad to help... I use Finereader every day converting books.

Last edited by Tex2002ans; 02-27-2014 at 03:38 PM.
Tex2002ans is offline   Reply With Quote