It looks like there are page-breaks that can be removed by using Regex. I would merge all the html files (starting with the file where the code is messed up) and then find all the codes that have_pb# in them (or whatever the last line in each html file is for a page-break) and delete all those lines. Then convert the file again to create the individual html files back to the way they should be split.
|