Quote:
Originally Posted by BetterRed
@exaltedwombat - the screenshot in first post, presumably from Word, is riddled with anomalies - I've marked those I can see at a glance with my eyeballs, afaik conversion by almost any means will faithfully convert them to epub :
Attachment 168387
Given you have the document in Word, it should only take a short while to fix most of the anomalies with simple Word macros and Tox's ePub Tools.
I hazard it's a scanned PDF. As has so often been said - getting a perfect conversion of a scanned PDF is tedious. Hitch has explained how they do it in her business on numerous occasions - she probably has it tucked away in her paste buffer
BR
|
Actually, I think it might be a "save as...Word" type product, (when he says that Acrobat has "OCR"ed the file) but it's 6 of 1, half-dozen of another, almost literally. That's what I see, too.
I am struggling to figure out what's being asked. My last take on this is that when he puts the text into Sigil (?), he's NOT seeing indents. If that's the case, it's simply the Styles. (Or, if he's copy-pasting into Sigil, same thing--the styles have to be created/set up in the CSS.)
If that's not it, then I still don't understand the question. I thought it was "how do I NOT import the broken paragraph pilcrows/codes," but his last post indicates that isn't the question.
So....at this point, your guess is as good as mine.
Hitch