Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > ePub

Notices

Reply
 
Thread Tools Search this Thread
Old 10-07-2014, 04:18 AM   #46
pdurrant
The Grand Mouse 高貴的老鼠
pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.
 
pdurrant's Avatar
 
Posts: 74,406
Karma: 318076944
Join Date: Jul 2007
Location: Norfolk, England
Device: Kindle Oasis
You are unlikely to get much success with an automatic conversion of a scanned PDF. You will have to do lots of manual clean up/conversion yourself.
pdurrant is offline   Reply With Quote
Old 10-07-2014, 06:00 PM   #47
ittiandro
Connoisseur
ittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notes
 
Posts: 64
Karma: 24500
Join Date: Nov 2013
Device: JuliusvonJD
What is manual cleanup vs automatic cleanup

Quote:
Originally Posted by pdurrant View Post
You are unlikely to get much success with an automatic conversion of a scanned PDF. You will have to do lots of manual clean up/conversion yourself.
Yes, but could you tell me, if you know, in what consists the " lot of manual clean up/conversion" I am to do manually vs the automatic conversion? Would it be better to convert to Word .doc ( or docx) format and then reconvert to EPUB? This is what I often hear, but I am not too clear.
Or perhaps you can refer me to more precise instructions somewhere else on the Web?I couldn't find any.
Bottom line, I want to have a fully controllable page layout after converting to EPUB.

Thanks

Ittiandro
ittiandro is offline   Reply With Quote
Old 10-07-2014, 08:30 PM   #48
mrmikel
Color me gone
mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.
 
Posts: 2,089
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
You are looking for what is pretty much impossible, fixed layout in an reflowable format. Some machines allow attempts at it.

When the reader presses the increase size button, the formatting will go out the window.

PDF is a format that is fixed and will work if laid out for the appropriate size of your device's screen. But it is not a format which is accepted by the major publishing houses for ebooks.

Any conversion from PDF as a Optical Character Recognition will have a least an error per page, often many more. Converting PDF to Word doesn't change that. Working from an original Word document which has not been OCR'ed will be much better and is supported by Toxaris's add in as well as Atlantis Word Processor. Even so, if you want fixed layout in epubs you are padding with a spoon upstream.
mrmikel is offline   Reply With Quote
Old 10-08-2014, 02:12 AM   #49
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 80,665
Karma: 150249619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
Print the PDF, with the pages by your computer keyboard, start typing. You will have to copy the graphics though.

Good luck!
JSWolf is offline   Reply With Quote
Old 10-08-2014, 03:53 AM   #50
pdurrant
The Grand Mouse 高貴的老鼠
pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.
 
pdurrant's Avatar
 
Posts: 74,406
Karma: 318076944
Join Date: Jul 2007
Location: Norfolk, England
Device: Kindle Oasis
Quote:
Originally Posted by ittiandro View Post
Yes, but could you tell me, if you know, in what consists the " lot of manual clean up/conversion" I am to do manually vs the automatic conversion? Would it be better to convert to Word .doc ( or docx) format and then reconvert to EPUB? This is what I often hear, but I am not too clear.
Or perhaps you can refer me to more precise instructions somewhere else on the Web?I couldn't find any.
Bottom line, I want to have a fully controllable page layout after converting to EPUB.
You describe a poor quality scan of a book wrapped as a PDF. To get a reflowable ePub you'll need to do OCR on the scanned pages, and also extract the images from the scans, and then format the extracted text, fixing all the OCR errors and inserting the (cleaned up) images in the text at the right points.

I don't know the best software for doing the OCR. Extract the pages with graphics from the PDF with (say) Adobe Reader as actual image files (e.g. .PNG or .TIFF) and clean them up in your favorite image editing programme (e.g. Photoshop). I'd recommend Sigil for creating/editing the ePub
pdurrant is offline   Reply With Quote
Old 10-08-2014, 07:46 AM   #51
mrmikel
Color me gone
mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.
 
Posts: 2,089
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
Right on, pdurrant. It is all tricks and no treat, except days or weeks later when done.
mrmikel is offline   Reply With Quote
Old 10-10-2014, 12:58 PM   #52
ittiandro
Connoisseur
ittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notes
 
Posts: 64
Karma: 24500
Join Date: Nov 2013
Device: JuliusvonJD
PDF to EPUB conversion does not render non-text areas

Quote:
Originally Posted by JSWolf View Post
First convert from the PDF into HTML and make sure you've fixed all of the errors in the HTML due to converting from PDF. I've never yet seen a PDF converter that does it 100% error free. So it could be the ePub is reflecting the errors in the conversion process from PDF.
I have a PDF physics book which I want to convert to EPUB for reading with my Android tablet. I am trying to convert with the ABBYY Fine Reader but I am having great problems in rendering non text areas in the EPUB conversion. such as diagrams, tables and sketches . Actually they are not rendered at all!
I almost ready to give up! Before that, I might take a last shot with HTML conversion but I don't know what to after it , because my aim is to get an EPUB file. How does an HTML conversion facilitate the EPUB conversion?

Thanks

Ittiandro
ittiandro is offline   Reply With Quote
Old 10-10-2014, 01:06 PM   #53
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 80,665
Karma: 150249619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
Quote:
Originally Posted by ittiandro View Post
I have a PDF physics book which I want to convert to EPUB for reading with my Android tablet. I am trying to convert with the ABBYY Fine Reader but I am having great problems in rendering non text areas in the EPUB conversion. such as diagrams, tables and sketches . Actually they are not rendered at all!
I almost ready to give up! Before that, I might take a last shot with HTML conversion but I don't know what to after it , because my aim is to get an EPUB file. How does an HTML conversion facilitate the EPUB conversion?

Thanks

Ittiandro
Since you do have a tablet, it would be much easier to just use the PDF as a PDF with your tablet. It would be much more hassle to convert then it is worth. Also, if you have any errors in anything important like formulas, you could be screwed. So just keep the PDF as PDF, find a good program to use to view the PDF and you are all set.
JSWolf is offline   Reply With Quote
Old 10-10-2014, 01:29 PM   #54
ittiandro
Connoisseur
ittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notesittiandro can name that song in three notes
 
Posts: 64
Karma: 24500
Join Date: Nov 2013
Device: JuliusvonJD
Converting PDF to EPUB

Quote:
Originally Posted by JSWolf View Post
Since you do have a tablet, it would be much easier to just use the PDF as a PDF with your tablet. It would be much more hassle to convert then it is worth. Also, if you have any errors in anything important like formulas, you could be screwed. So just keep the PDF as PDF, find a good program to use to view the PDF and you are all set.
Yes, I might have no choice but keeping my current PDF format for reading in the tablet with EzPdfReader, because even a software like ABBYY Fine Reader which is very sophisticated, does not display non-text areas in the EPUB cponversion, such as diagrams, sketches or other images. I am sure there must be a way, but it is very time-consuming to do it.
The reason why I wanted to use EPUB instead of my current PDF format is that PDF has a fixed white bright page background which puts a strain on my eyes, whereas EPUB Readers such Cool Reader ( my favorite) and FBReader allow to change the page background and have a wider array of settings. Reading in night mode is an option I do not particularly like.

Thanks

Ittiandro
ittiandro is offline   Reply With Quote
Old 10-11-2014, 06:55 PM   #55
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
Quote:
Originally Posted by ittiandro View Post
Yes, I might have no choice but keeping my current PDF format for reading in the tablet with EzPdfReader, because even a software like ABBYY Fine Reader which is very sophisticated, does not display non-text areas in the EPUB cponversion, such as diagrams, sketches or other images. I am sure there must be a way, but it is very time-consuming to do it.
The reason why I wanted to use EPUB instead of my current PDF format is that PDF has a fixed white bright page background which puts a strain on my eyes, whereas EPUB Readers such Cool Reader ( my favorite) and FBReader allow to change the page background and have a wider array of settings. Reading in night mode is an option I do not particularly like.

Thanks

Ittiandro
Ittiandro:

Did you try outputting Abbyy to WORD, instead of ePUB? That will retain images and graphics.

Hitch
Hitch is offline   Reply With Quote
Old 10-11-2014, 08:26 PM   #56
mrmikel
Color me gone
mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.
 
Posts: 2,089
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
Save To HTML does preserve images and tables also.

BUT you need to go through each page to see how it has analyzed in order to make sure you get complete pictures. It goes a little overboard if it finds text in a graphic. It also doesn't do so well if you have an image that seems to fade out too fast for its liking. But you can tell it where the image boundaries are in these cases and it will pick up the whole thing.

You can also do the same for tables, when it has mistaken them.

Then Read (recognize) and the output is much better.
mrmikel is offline   Reply With Quote
Old 10-12-2014, 03:22 PM   #57
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
Quote:
Originally Posted by mrmikel View Post
Save To HTML does preserve images and tables also.

BUT you need to go through each page to see how it has analyzed in order to make sure you get complete pictures. It goes a little overboard if it finds text in a graphic. It also doesn't do so well if you have an image that seems to fade out too fast for its liking. But you can tell it where the image boundaries are in these cases and it will pick up the whole thing.

You can also do the same for tables, when it has mistaken them.

Then Read (recognize) and the output is much better.
Absolutely. I think the issue/problem here is an expectation of going direct to ePUB, and skipping that step. Direct to ePUB shan't keep the images/figures, etc., as we all know (painfully too well). I think that Tex (Texanns002) has a fairly great lengthy post somewhere around here (hell, is it in this very thread?) about how to competently scan. Wherever it is, it's worth reading, both for newbs and even those of us with a few under our belts.

Hitch
Hitch is offline   Reply With Quote
Old 06-27-2017, 06:18 PM   #58
Shohreh
Addict
Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.
 
Posts: 222
Karma: 304158
Join Date: Jan 2016
Location: France
Device: none
Hello,

This is what an ePUB looks like after uploading the original PDF to http://ebook.online-convert.com:



Before I dive into Caliber… is there a quick, no-brainer way to get a readable ePUB file?

Thank you.
Shohreh is offline   Reply With Quote
Old 06-27-2017, 06:29 PM   #59
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
Quote:
Originally Posted by Shohreh View Post
Hello,

This is what an ePUB looks like after uploading the original PDF to http://ebook.online-convert.com:



Before I dive into Caliber… is there a quick, no-brainer way to get a readable ePUB file?

Thank you.
Shohreh:

Please read the thread. The answer is: no. Now, some person will no doubt come along, and say how great using "save as Word" is, from Acrobat Pro, or using this or that, but my business does this very thing, ALL. DAY. LONG. We have NEVER found any shortcut that makes creating an eBook from a PDF easy or simple or short.

We do everything by HAND. We typically scan/ocr PDFs; then we do a double-proofing round. This is then saved to a Word file format, .doc or .docx. We then clean up the Word file, export that to HTML, clean up THAT, and then, and only then, do we use the HTML to build and ePUB.

That's how it's done.

Do you understand why the file looks that way?

Hitch
Hitch is offline   Reply With Quote
Old 06-28-2017, 02:05 AM   #60
Shohreh
Addict
Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.Shohreh ought to be getting tired of karma fortunes by now.
 
Posts: 222
Karma: 304158
Join Date: Jan 2016
Location: France
Device: none
Thank you.
Shohreh is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Converting Sanskrit PDF to epub sriniamble Calibre 17 11-25-2010 06:10 AM
Problem converting PDF to EPUB in calibre adgpro Calibre 2 07-09-2010 01:10 AM
Problem converting pdf to epub smartin Calibre 3 05-02-2010 06:55 AM
Help with converting PDF to epub neilmarr Sigil 6 11-14-2009 09:26 AM
Formatting issues when converting PDF to EPUB raptir Calibre 2 10-21-2009 10:32 PM


All times are GMT -4. The time now is 04:39 PM.


MobileRead.com is a privately owned, operated and funded community.