Okay,
Let me know if you have any plugin related questions.
Sigil's embedded python has access to a variety of parsers built right in including lxml, our own bs4 variation, html5lib, google's gumbo parser, and an easy to use python only serial parser called QuickParser, so there are lots of tools to use. Also we have cssselect that works with lxml, and cssutils if you need to parse css stylesheets. Plus access to opf_parser.py to extract info from the opf as needed.
BTW: as Sigil's epub3 handling improves, we will have need of an SMIL dtd for parsing SMIL files. If you have access to one with a usable license, I would love to know where to find it. Alternatively, we could develop specialized python for doing just that not using lxml at all.
Thanks,
KevinH
|