Quote:
Originally Posted by Quoth
You need to properly use paragraph styles in MS Word (or LO Writer) with the heading/outline level set properly, and List style off.
Calibre conversion from docx is practically perfect if the document is styled properly.
|
Quote:
Originally Posted by rosewood
. . .
Thank you Quoth. For my applications, plain text input is easiest. Hopefully the above XPATH expression will see me through from now on. But if the conversion plays up again then I'll give properly styled *.docx a whirl.
|
FWIW - I loaded a plain text file of ~5,300 lines, ~44,000 words into MS Word last week. It was a 1989 Act of Parliament (since repealed) that obviously came from an OCR scan of the printed original - full of broken paragraphs, shambolic indentations, etc, etc.
It took me about 12 hours over several sessions to get a DOCX and a PDF that conform to the current standards for such documents, which are very specific. I wouldn't have bothered without the Word template I obtained from the parliamentary library.
BR