Quote:
Originally Posted by Tex2002ans
May I ask what the process was to go from PDF -> EPUB? (This is the area where I do most of my work, I have converted ~180 books so far).
I explained and posted a rough outline of most of my OCR methods (I use Finereader + Sigil) in this topic: https://www.mobileread.com/forums/sho...d.php?t=223817
|
Great thread! I've bookmarked it. PDF->epub is via Calibre (I know the PDF mantra) or Mobipocket. Still not optimal but I got what I got.
Quote:
Originally Posted by Tex2002ans
I would recommend turning off the validate on Save, and keeping the Preview window open instead (View - Preview (F10)):
If you have the Preview window open while you are coding, you will see all of the changes in real time. If you make a mistake in the code, the Preview window will go bonkers and tell you (plus you can see the changes as you make them).
|
I do use the Preview window and definitely get the bonkers a
lot. That's why I have it validate each time. Doesn't take that much longer. This is a rather complicated doc with lots of different lists, font formats, etc. (I'll upload the PDF doc for you to see.)
Quote:
Originally Posted by Tex2002ans
Could have been a corrupt image. Usually I try to take care of all of that at the PDF cleaning + OCR stage, so that crap never even makes it into the EPUB files.
|
Probably a LOT of corrupt images! Not sure how to deal with that other than deleting them in Sigil (or tweaking in Adobe and re-inserting them).
Quote:
Originally Posted by Tex2002ans
May I ask what the book is? That is a mighty large book... and let me warn you beforehand, converting a book this large does become QUITE the daunting task. It feels like you barely make any progress after you have been working on it for HOURS, and really sucks the morale right out of you.
|
Will get you a copy... I'm doing it strictly for myself but it's probably not Kosher. Hours? Can you say weeks and I'm just nearing the middle point of the first (of two) books. I feel my progress each time I start a new subject/chapter!
Great input Tex2002as!