As a minor aside, the pdf conversions can be often fixed in by converting to word and using various replacements; i.e. modify the line breaks, either paragraph marks ( ^p in Word ) or end-of-line marks ( ^l in Word ).
I've added a commen conversion pattern below but youll almost definitely need to change it - the trick is to differentiate between a paragraph return and a 'false' return.
convert ^p^p^p to ^p^p
convert ^p^p to @@@@@@
convert ."^p to ."######
?"^p to ?"######
!"^p to !"######
.^p to .######
.)^p to .)######
remove all ^p - may have to replace with a space, depends on doc (can crash a lot here)
convert ###### to ^p
convert @@@@@@ to ^p^p
Conversion rules vary by file but you can end up turning any weirdly formatted Gutenburg text into a properly formatted epub in a few steps without having to resort to much manual typing. Apologies if this is old news to you.
|