Quote:
Originally Posted by JSWolf
If you'd like, I can send you some HTML output from CLIT and see how you get on converting it if you'd like.
|
Version 2.1.1 (next post!) updates conversion slightly. I tested with the output you provided, and this generates a "correct" book (no extant HTML tags hanging around) when run with the "--strip-html" switch.
However, you will very likely still want to do some editing prior to running
pielrf with any CLIT output. For example, running
pielrf on raw CLIT HTML doesn't give you chapters, so you have one long book. And it's kind of "inelegant" to have the "THIS BOOK IS GREAT!" blurbs, followed by the Copyright Page, followed by the book itself, all in one shot.
So,
pielrf --strip-html DOES work, but I recommend using
striphtml to do the initial cleaning of all tags
pielrf doesn't understand. THEN you can easily add <chapter> tags, and run THAT through
pielrf. This will produce much nicer results, and only take a little extra time (depending on the complexity of the book).
-Pie