Determining best epub candidates for Edit ToC.
While I was using the new Edit TOC feature (with belated thanks to Kovid !) to go through hundreds of epubs currently without a useful TOC, I was wondering how to better isolate the epubs where Edit TOC would be most likely to be effective?
The "Generate ToC from files" option has been the one that is most likely to work (an epub without a useful ToC typically also does not have links or headers that will help) and the key factor for that "from files" option to work well seems to be whether the number of html files within the epub correspond to a few more than the number of chapters one might expect in the epub.
Does anyone know of an existing feature in Calibre or any of the plugins (or a regular expression) that could could provide a count of the number of html files within the ePub, so that the count could be put into a temporary sort column?
I am already using a feature in the Quality Check plugin (Check ePub structure / Check NCX TOC with < 3 entries) to separate out most of the epubs without a useful TOC, and I can't think of a better way to do that in bulk.
Any other suggestions for sorting out ePubs that are the best candidates for Edit ToC would also be great, including identifying those where there are in fact links and headers that might provide an effective alternative when "from files" will not work.
The innovative simplicity and effectiveness of Edit ToC is greatly appreciated Kovid !
|