View Full Version : Rudy Rucker on Making Epubs


Snooty Baronet
09-24-2011, 03:30 AM
The science fiction author Rudy Rucker has just posted a useful, basic guide to creating epubs using common software:

http://www.rudyrucker.com/blog/2011/09/23/making-your-own-ebooks/

Rudy uses Adobe Dreamweaver to filter Word crap from his (already) filtered HTML files. Is there a free HTML editor that does something similar?

Toxaris
09-24-2011, 05:01 AM
You can easily remove it by hand. It is not that complicated and a few good RegEx expressions will help.

Keroberos
09-24-2011, 02:50 PM
There's also a few websites that can clean word html files. I use this one ocasionally http://www.wordhtmlcleaner.co.uk/. It does have a 1 megabyte file limit so you might have to split your word file. You still have to do some cleanup though (it converts all the <p> tags into <br /> tags)

DiapDealer
09-24-2011, 06:02 PM
You still have to do some cleanup though (it converts all the <p> tags into <br /> tags)
Ouch!

Keroberos
09-24-2011, 08:14 PM
It's actually not to bad, just find/replace the <br /> with <p> then let Sigil automatically close the tag and you're ready to start adding your own formatting.

DiapDealer
09-24-2011, 08:21 PM
It's actually not to bad, just find/replace the <br /> with <p> then let Sigil automatically close the tag and you're ready to start adding your own formatting.
I rarely let any program "automatically" make wholesale changes to my html. I'm too much of a control freak. ;)

I've experienced disasters relying on Sigil to close tags properly on it's own (as much as I love Sigil), but if the process works for you.... :)

Keroberos
09-24-2011, 08:37 PM
I agree, I never let Sigil automatically close tags after I have started applying formating (I've experienced disasters there too). But as a first step if anything goes wrong not too much time is lost, usually less then the time it takes to manually remove word's retarded html formating). also I suck at regular expressions, so this way is actually faster for me (and removes almost all formating but headings, italics and bolds, so is useful for those times I want to completely remove all formating). And this seems to be one case where Sigil doesn't have a problem closing the tags automatically.

Snooty Baronet
09-25-2011, 02:35 AM
There's also a few websites that can clean word html files. I use this one ocasionally http://www.wordhtmlcleaner.co.uk/. It does have a 1 megabyte file limit so you might have to split your word file. You still have to do some cleanup though (it converts all the <p> tags into <br /> tags)

Thanks! I'll check it out.