ok - all I can say is that I've been doing that regex remove manually for 2- 3 years, over 100 books I ma sure - & I have never seen that construction anywhere except at the end of a file. it maybe that calibre always breaks after one of those, so that is logically impossible for a calibre conversion to leave only the middle of a html file ( on default structure detect settings anyway )
but ok , tweak to:
find
<div class="mbppagebreak" id="calibre_pb_\d+"></div>
</body>
replace
</body>
|