View Single Post
Old 10-12-2010, 10:47 PM   #18
Ken Irving
Writer
Ken Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileRead
 
Posts: 86
Karma: 65586
Join Date: Aug 2010
Location: New York
Device: Nook "1st Edition" Wireless, Nook4PC, NookStudy, Kindle4PC
Quote:
Originally Posted by PatNY View Post
Is there an easy way to clean up the styling "crap" that Word puts at the top of all html files? Like maybe through a regex search and replace?
...
Anyone have any suggestions?
A chapter in a book called EPUB Straight to the Point, by Elizabeth Castro, gives very specific instructions for turning a Word 2007 doc into an epub file that will pass epubcheck validation. There are quite a few steps, so I can't really summarize it here, but it starts with saving to filtered html, closing, and then reopening and editing the raw file with a text processor. It is a process of moving some things around, changing a few things by hand, and the rest requires search and replace using regular expressions. I recommend this book highly, by the way, which you can get in print or as an ebook with DRM from Amazon or B&N, or directly from her site with no DRM: http://www.elizabethcastro.com/epub/

She's primarily interested in formatting for the iPad, but most of what she says can be applied to any ereader that uses epub because she gets into the nuts and bolts of an epub file and is very good at explaining things.
Ken Irving is offline   Reply With Quote