View Single Post
Old 05-08-2023, 09:06 AM   #1
Quoth
Still reading
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 14,496
Karma: 107078855
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
Over enthusiastic "Chapter" detection?

Just yesterday I fixed an ebook conversion to epub from azw3 where every paragraph was a new page. The regular paragraphs had a class called "chapter"
Quote:
<p class="chapter">
and the first paragraph had a class called "chapter-first" which was OK
Quote:
<p class="chapter-first">
I renamed those in the azw3 file with a right-click rename which did enitire of all HTML files and CSS file in one go.
Reconverted the azw3 and it was fine.

Never saw this before on azw3 conversions to epub. I've not edited the Structure detection for ages:
Quote:
//*[((name()='h1' or name()='h2') and re:test(., '\s*((chapter|book|section|part)\s+)|((prolog|prol ogue|epilogue)(\s+|$))', 'i')) or @class = 'chapter']
How do I make it ignore @class = 'chapter' when it's the body paragraphs?
Quoth is offline   Reply With Quote