Im new here, so I apologize if this is super rough.
Im converting a book LGBTQ+ Support and Care from Amazon. It is copyright protected. Im not sure how to create a extract/sample. I tried to attach the origional file but it doesnt seem to take.
There was no bug report. It seems to think everything went well
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "calibre\ebooks\oeb\parse_utils.py", line 218, in parse_html
File "calibre\utils\xml_parse.py", line 26, in safe_xml_fromstring
File "src/lxml/etree.pyx", line 3237, in lxml.etree.fromstring
File "src/lxml/parser.pxi", line 1896, in lxml.etree._parseMemoryDocument
File "src/lxml/parser.pxi", line 1777, in lxml.etree._parseDoc
File "src/lxml/parser.pxi", line 1082, in lxml.etree._BaseParser._parseUnicodeDoc
File "src/lxml/parser.pxi", line 615, in lxml.etree._ParserContext._handleParseResultDoc
File "src/lxml/parser.pxi", line 725, in lxml.etree._handleParseResult
File "src/lxml/parser.pxi", line 654, in lxml.etree._raiseParseError
File "<string>", line 127
lxml.etree.XMLSyntaxError: Attribute _ redefined, line 127, column 1172
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "calibre\ebooks\oeb\parse_utils.py", line 224, in parse_html
File "calibre\ebooks\oeb\parse_utils.py", line 105, in html5_parse
ValueError: HTML 5 parsing resulted in a tree with nesting depth > 100
Forcing index.html into XHTML namespace
Stripping comments from index.html
Generating default TOC from spine...
Merging user specified metadata...
Detecting structure...
Auto generated TOC with 0 entries.
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Cleaning up manifest...
Trimming unused files from manifest...
Trimming 'images/00001.jpg' from manifest
Trimming 'images/00002.jpg' from manifest
Creating PDF Output...
Converting input as a text based book...
Merged 2 instances of ArialMT reducing size from 106.8 KB to 81.3 KB
Merged 2 instances of Arial-BoldMT reducing size from 84.4 KB to 58.9 KB
Merged 2 instances of Arial-BoldItalicMT reducing size from 85.4 KB to 62.2 KB
PDF output written to C:\Users\jedik\AppData\Local\Temp\calibre_pvlca73k \c7l0docc.pdf
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "calibre\ebooks\oeb\parse_utils.py", line 218, in parse_html
File "calibre\utils\xml_parse.py", line 26, in safe_xml_fromstring
File "src/lxml/etree.pyx", line 3237, in lxml.etree.fromstring
File "src/lxml/parser.pxi", line 1896, in lxml.etree._parseMemoryDocument
File "src/lxml/parser.pxi", line 1777, in lxml.etree._parseDoc
File "src/lxml/parser.pxi", line 1082, in lxml.etree._BaseParser._parseUnicodeDoc
File "src/lxml/parser.pxi", line 615, in lxml.etree._ParserContext._handleParseResultDoc
File "src/lxml/parser.pxi", line 725, in lxml.etree._handleParseResult
File "src/lxml/parser.pxi", line 654, in lxml.etree._raiseParseError
File "<string>", line 127
lxml.etree.XMLSyntaxError: Attribute _ redefined, line 127, column 1172
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "calibre\ebooks\oeb\parse_utils.py", line 224, in parse_html
File "calibre\ebooks\oeb\parse_utils.py", line 105, in html5_parse
ValueError: HTML 5 parsing resulted in a tree with nesting depth > 100
Forcing index.html into XHTML namespace
Stripping comments from index.html
Parsing styles.css ...
Generating default TOC from spine...
Merging user specified metadata...
Detecting structure...
Auto generated TOC with 0 entries.
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Cleaning up manifest...
Trimming unused files from manifest...
Trimming 'images/00001.jpg' from manifest
Trimming 'images/00002.jpg' from manifest
Creating EPUB Output...
Splitting markup on page breaks and flow limits, if any...
Looking for large trees in index.html...
No large trees found
This EPUB file has no Table of Contents. Creating a default TOC
EPUB output written to C:\Users\jedik\AppData\Local\Temp\calibre_pvlca73k \3_z1y6db.epub
Moderator Notice Please use spoiler tags for logs
The output file is attached. Ideally, we need this as a pdf, as well as epub. Both come out as garbeled
Im using all default options except on epub, I have to raise the split larger files up, I moved it to 10,000 kb
Included are the epub, the pdf, and a word doc I tried.