View Single Post
Old 02-01-2023, 05:06 PM   #11
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,796
Karma: 30237628
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by Quoth View Post
You need to properly use paragraph styles in MS Word (or LO Writer) with the heading/outline level set properly, and List style off.

Calibre conversion from docx is practically perfect if the document is styled properly.
Quote:
Originally Posted by rosewood View Post
. . .

Thank you Quoth. For my applications, plain text input is easiest. Hopefully the above XPATH expression will see me through from now on. But if the conversion plays up again then I'll give properly styled *.docx a whirl.
FWIW - I loaded a plain text file of ~5,300 lines, ~44,000 words into MS Word last week. It was a 1989 Act of Parliament (since repealed) that obviously came from an OCR scan of the printed original - full of broken paragraphs, shambolic indentations, etc, etc.

It took me about 12 hours over several sessions to get a DOCX and a PDF that conform to the current standards for such documents, which are very specific. I wouldn't have bothered without the Word template I obtained from the parliamentary library.

BR

Last edited by BetterRed; 02-01-2023 at 05:35 PM.
BetterRed is offline   Reply With Quote