View Single Post
Old 01-02-2023, 04:25 PM   #6
enuddleyarbl
Guru
enuddleyarbl ought to be getting tired of karma fortunes by now.enuddleyarbl ought to be getting tired of karma fortunes by now.enuddleyarbl ought to be getting tired of karma fortunes by now.enuddleyarbl ought to be getting tired of karma fortunes by now.enuddleyarbl ought to be getting tired of karma fortunes by now.enuddleyarbl ought to be getting tired of karma fortunes by now.enuddleyarbl ought to be getting tired of karma fortunes by now.enuddleyarbl ought to be getting tired of karma fortunes by now.enuddleyarbl ought to be getting tired of karma fortunes by now.enuddleyarbl ought to be getting tired of karma fortunes by now.enuddleyarbl ought to be getting tired of karma fortunes by now.
 
enuddleyarbl's Avatar
 
Posts: 792
Karma: 1538394
Join Date: Sep 2013
Device: Kobo Sage
Both of those logs in my first two posts end at the same point (even though I let one run go for over two minutes and the other only a few seconds):
Code:
Parsing text/part0035.html ...
*********  Heuristic processing HTML  *********
replaced 5 nbsp indents with inline styles
There are 1 blank lines. 0.00546448087431694 percent blank
Hard line breaks check returned False
Median line length is 101, calculated with html format
Fixing hyphenated content
Formatting scene breaks
I don't really know how to read that, but if it means it was in text/part0035.html when it got stuck, there are non-breaking spaces in that file. In the original AZW3, those are
Code:
& # x 0 0 A 0 ; (I had to modify that string since even code blocks replaced it
codes. In the file I got from KindleUnpack, there are no codes there at all (just yellow blocks of space. Ditto for what's there if I run the AZW3 through Calibre's Polish Books function (which works fine, though it takes more time than I'd expect). I see references to
Code:
& n b s p ;
and
Code:
& # 1 6 0 ;
as being the usual way of creating non-breaking spaces. Could Calibre be having issues with
Code:
& # x 0 0 A 0 ;
?

Last edited by enuddleyarbl; 01-02-2023 at 04:30 PM.
enuddleyarbl is offline   Reply With Quote