Since he is trying to keep the HTML TOC, gutting it might not work so easily.
I wonder if HTTrack, a website copier, might allow him to follow only one track at a time. It has the option of excluding selected directories. It could be run several times to yield separate outputs.
Much depends on exactly how the thing is structured.
|