Thanks everyone,
It was not until I looked at a .pdf using k2pdfopt in debug mode that I could visualise how a pdf locates the text on the page, words floating in white space.
I'm happy with any amount of mess in the contents or page numbering etc. but the line-breaks are all important. Poetry is a particular pain as there is no standard layout, each is very different and to manually edit 100 pages while constantly referring to the original would mean lots of work.
There would need to be some sort of check in the original .pdf -> .epub process that looked for blank space equal to line-height between lines of text, I think.
Does Mobipocket Creator do anything of that sort?
|