Quote:
Originally Posted by JohnB
C:\Users\John\EBooks\gutlrf>gutlrf.pl http://www.gutenberg.org/dirs/etext00/notra10h.zip
Extracting files...
Archive missing directory, correcting...
Cleaning HTML...
Wrote cleaned HTML "C:\Users\John\AppData\Local\Temp\notra10h\new.htm "
Converting to BBeB...
Processing new.htm
Parsing HTML...
Converting to BBeB...
Traceback (most recent call last):
File "convert_from.py", line 1844, in <module>
File "convert_from.py", line 1838, in main
File "convert_from.py", line 1744, in process_file
File "convert_from.py", line 259, in __init__
File "convert_from.py", line 367, in add_file
File "convert_from.py", line 489, in parse_file
libprs500.ebooks.ConversionError: new.htm does not seem to have any content
Bad file descriptor at C:\Users\John\EBooks\gutlrf\gutlrf.pl line 260.
I tried tidy on all the files also, which elicited a few warnings, but that didn't change things.
Thanks, JohnB
|
I found the problem with the book in the link above - the contents.html page has an error in that a comment (<!-- -->) wasn't formatted properly. Hooray - it now converts! I guess I should learn how to submit corrections to Gutenberg...
I'm liking gutlrf.