Thank you! Managed to progress further.
The failing articles in the "input" directory are already empty. For each failing article I see the following error message:
Code:
Parsing feed_4/article_0/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "site-packages\calibre\ebooks\oeb\base.py", line 857, in first_pass
File "lxml.etree.pyx", line 2532, in lxml.etree.fromstring (src/lxml/lxml.etre
e.c:48634)
File "parser.pxi", line 1545, in lxml.etree._parseMemoryDocument (src/lxml/lxm
l.etree.c:72245)
File "parser.pxi", line 1417, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:7
1041)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/l
xml/lxml.etree.c:67581)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDo
c (src/lxml/lxml.etree.c:64257)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.e
tree.c:65178)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etr
ee.c:64521)
XMLSyntaxError: xmlParseEntityRef: no name, line 4, column 17
Have you seen anything like this? I am running the latest version of calibre.