Both of those logs in my first two posts end at the same point (even though I let one run go for over two minutes and the other only a few seconds):
Code:
Parsing text/part0035.html ...
********* Heuristic processing HTML *********
replaced 5 nbsp indents with inline styles
There are 1 blank lines. 0.00546448087431694 percent blank
Hard line breaks check returned False
Median line length is 101, calculated with html format
Fixing hyphenated content
Formatting scene breaks
I don't really know how to read that, but if it means it was in text/part0035.html when it got stuck, there are non-breaking spaces in that file. In the original AZW3, those are
Code:
& # x 0 0 A 0 ; (I had to modify that string since even code blocks replaced it
codes. In the file I got from KindleUnpack, there are no codes there at all (just yellow blocks of space. Ditto for what's there if I run the AZW3 through Calibre's Polish Books function (which works fine, though it takes more time than I'd expect). I see references to
and
as being the usual way of creating non-breaking spaces. Could Calibre be having issues with
?