View Single Post
Old 03-11-2017, 04:54 PM   #2
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by Difermo View Post
The books I need to put is for civil engineering. So there is a lot of math trigonometry, equations, integrals, lots of diagrams, tables.
I suspect these are just PDF scans of books?

Your best bet would probably be reading this as a PDF on a larger screen (tablet/monitor).

Depending on the PDF, you may be able to do some cropping to make it a bit easier to read on your device. For example, using a tool like k2pdfopt:

https://www.mobileread.com/forums/sh...d.php?t=144711

but most of the time it just might not be possible to shrink a very large and complex 8.5"x11" page into a smaller screen.

Converting this type of complex material to a proper ebook is EXTREMELY labor intensive... and if the publisher doesn't release an ebook directly from the source material... it probably wouldn't be worth the time invested for a single individual to OCR (easily tens/hundreds of hours).

The more complex the layout (multi-column, lots of footnotes, tables, figures, captions, equations, [...]) the harder the books are to convert using OCR + the more manual intervention would be needed to fix all the broken formatting.

Quote:
Originally Posted by Difermo View Post
I need to put some books in my eReader. It is kindle paperwhite. But if mobi or azw3 can not suport that i'm ready to buy some epub reader.
Dedicated ereaders can read PDFs... but the experience is typically very poor: slow/sluggish page turns, having to pan/scan, not being able to easily highlight text or take notes, can't resize text, etc. etc.

For example, here is The Digital Reader showing off PDFs on a Kobo Aura One (Kindles/Nooks/others are similar):

http://the-digital-reader.com/2016/0...just-no-video/

Quote:
Originally Posted by Difermo View Post
I'm sorry for openin new thread if one is already there. I searched on google and all what i found is to old. Date is 2012 or 2009. And I gess a lot have change since then.
The largest change in this kind of material is probably MathML in EPUB3.

Equations/Formulas? Each equation is going to have to be included as bitmap images (or SVG or MathML).

Each and every equation would require laborious double-checking to make sure it is correct and require some serious markup.

The only program/engine I know of that handles OCRing Formulas is InftyReader:

http://www.sciaccess.net/en/InftyReader/

and that costs $800+.

Side Note: Back in 2013 I wrote a topic, "Tutorial: Formulas to PNG", where I sort of show off one method of digitizing equations (using LibreOffice Math):

https://www.mobileread.com/forums/sh...d.php?t=223254

I have used this to recreate formulas in books that had <50 equations... now I tend to prefer using LaTeX as a middleman... but it STILL requires a massive amount of manual work per equation. I shudder to think how long it would take working on a book that is as full as your example pages.

Last edited by Tex2002ans; 03-11-2017 at 05:10 PM.
Tex2002ans is offline   Reply With Quote