Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 10-12-2010, 04:56 PM   #16
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
Quote:
Originally Posted by DTM View Post
Yes, that was created under Word 2007. I opened it, saved it "down" to 97-2003 format, and have attached the resulting file. I have not tried it.
Thanks--I'll let you know how it works with my itchy-twitchy file. ;-)

Hitch
Hitch is offline   Reply With Quote
Old 10-12-2010, 08:40 PM   #17
Worldwalker
Curmudgeon
Worldwalker ought to be getting tired of karma fortunes by now.Worldwalker ought to be getting tired of karma fortunes by now.Worldwalker ought to be getting tired of karma fortunes by now.Worldwalker ought to be getting tired of karma fortunes by now.Worldwalker ought to be getting tired of karma fortunes by now.Worldwalker ought to be getting tired of karma fortunes by now.Worldwalker ought to be getting tired of karma fortunes by now.Worldwalker ought to be getting tired of karma fortunes by now.Worldwalker ought to be getting tired of karma fortunes by now.Worldwalker ought to be getting tired of karma fortunes by now.Worldwalker ought to be getting tired of karma fortunes by now.
 
Posts: 3,085
Karma: 722357
Join Date: Feb 2010
Device: PRS-505
Quote:
Originally Posted by PatNY View Post
Why does Word put a gazillion font items at the start of html files even when those fonts are not being used in the file? It makes it much harder to edit the files in Sigil or any other editor.
You answered your own question.

Personally, I'm fond of EditPlus for my HTML editing -- and pretty much everything else. It isn't free, but I promote it anyway because it's just so darn good. In any event, though, I'd use just about anything except Word for HTML. As it's been said, Word is as good an HTML editor as Dreamweaver is a word processor.
Worldwalker is offline   Reply With Quote
Advert
Old 10-12-2010, 10:47 PM   #18
Ken Irving
Writer
Ken Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileReadKen Irving has read every ebook posted at MobileRead
 
Posts: 86
Karma: 65586
Join Date: Aug 2010
Location: New York
Device: Nook "1st Edition" Wireless, Nook4PC, NookStudy, Kindle4PC
Quote:
Originally Posted by PatNY View Post
Is there an easy way to clean up the styling "crap" that Word puts at the top of all html files? Like maybe through a regex search and replace?
...
Anyone have any suggestions?
A chapter in a book called EPUB Straight to the Point, by Elizabeth Castro, gives very specific instructions for turning a Word 2007 doc into an epub file that will pass epubcheck validation. There are quite a few steps, so I can't really summarize it here, but it starts with saving to filtered html, closing, and then reopening and editing the raw file with a text processor. It is a process of moving some things around, changing a few things by hand, and the rest requires search and replace using regular expressions. I recommend this book highly, by the way, which you can get in print or as an ebook with DRM from Amazon or B&N, or directly from her site with no DRM: http://www.elizabethcastro.com/epub/

She's primarily interested in formatting for the iPad, but most of what she says can be applied to any ereader that uses epub because she gets into the nuts and bolts of an epub file and is very good at explaining things.
Ken Irving is offline   Reply With Quote
Old 10-13-2010, 10:19 AM   #19
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 8,697
Karma: 5703586
Join Date: Nov 2009
Device: many
Hi,

Although mentioned earlier, an even easier way to make the conversion is to use OpenOffice.org and its free plugin writer2xhtml.oxt. Both are free and cross-platform and work very well.

The work flow then becomes:

1. edit your file in Word until your heart is content, then save it as a Word file.

2. open your Word file in OpenOffice.org (after installing the free plugin) and yes it can read and write Word files. Yes it has dictionaries in many languages not supported by MS. Yes, it can do pretty much whatever Office can do - and it is free (as in free beer) and is GPL licensed as well.

3. Do any other editing you want, then export your document to xhtml via the plugin.

4. Fire up Sigil or Calibre to load/convert the xhtml to epub or whatever you want.

The xhtml is much better formatted than the crap that MS Word produces.

There is even a plugin to take you direct to epub although I am not sure how far along that is.

The nice thing is that if you are unfamiliar with OpenOffice.org, you can do most of your editing in Word and only use OpenOffice.org for the conversion to xhtml if you so desire.

KevinH
KevinH is online now   Reply With Quote
Old 10-14-2010, 06:57 AM   #20
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
I take the "nuke it" approach; I wipe Word files completely clean; apply a base layer of formatting; output to html; put the html in my much-beloved NoteTab Pro (thank you, thank you, @capidamonte for turning me on to NoteTab!), attach a standardized css that I keep handy, insert various Sigil formatting (like the chapter breaks), regex the chapters to create headers, and then put that fine-tuned html into Sigil. Works GREAT. Best method I've found thus far. I just delete all the crapola that MS puts in the beginning of the html as "styles," most of it is utter garbage. HTH.

Hitch
Hitch is offline   Reply With Quote
Advert
Old 10-20-2010, 09:43 AM   #21
edbro
Banned
edbro is fluent in JavaScript as well as Klingon.edbro is fluent in JavaScript as well as Klingon.edbro is fluent in JavaScript as well as Klingon.edbro is fluent in JavaScript as well as Klingon.edbro is fluent in JavaScript as well as Klingon.edbro is fluent in JavaScript as well as Klingon.edbro is fluent in JavaScript as well as Klingon.edbro is fluent in JavaScript as well as Klingon.edbro is fluent in JavaScript as well as Klingon.edbro is fluent in JavaScript as well as Klingon.edbro is fluent in JavaScript as well as Klingon.
 
Posts: 640
Karma: 4911
Join Date: Jul 2007
Location: Grapevine, TX
Device: iPad4
Quote:
Originally Posted by KevinH View Post
Although mentioned earlier, an even easier way to make the conversion is to use OpenOffice.org and its free plugin writer2xhtml.oxt.
At your suggestion I've been using OO with this plugin. I have a question about some of the choices though. If I choose to output xhtml using "original style" vs something else, like "chocolate", I get a much larger file size. I notice that original style has a style description in front of every line. However, in practice I've found that the larger sized original seems to give a more efficient epub. When I used "chocoate" the epub was slow on my reader.

So, I guess I'm answering my own question but I want to be sure. Another factor that makes me unsure of all this is that the output file size using "original style" is almost the same as MS Word's Filtered Html output. So, am I achieving anything by using OO?

Also, should I be exporting to xhmtl or xhtml + MathML (which comes up by default)?
edbro is offline   Reply With Quote
Old 10-21-2010, 03:47 AM   #22
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
The main gripe I have with wiping, because that's easy, is that the formatting is gone. I can live with the headers, but I find it for myself unacceptable that things like italics will be removed. Usually the writer puts that in on purpose and it should remain there.

I used to have a good tool to remove all the Word-specific muck, but unfortunately that does not run on Vista and higher...
Toxaris is offline   Reply With Quote
Old 10-21-2010, 05:34 PM   #23
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
Quote:
Originally Posted by Toxaris View Post
The main gripe I have with wiping, because that's easy, is that the formatting is gone. I can live with the headers, but I find it for myself unacceptable that things like italics will be removed. Usually the writer puts that in on purpose and it should remain there.

I used to have a good tool to remove all the Word-specific muck, but unfortunately that does not run on Vista and higher...
Toxaris: I use the BookDesigner template...it's available somewhere around here on MobileRead, and that works perfectly to preserve italics, fortunately. So I load the template; run the macro to tag the italics; strip the formatting; I apply a base font by modifying the normal font, then re-italicize the text with the built-in macro, save it and export it to html. Any other way simply leads to madness, IMHO...and believe me, I've tried them ALL. OO, Atlantis...I've actually lost track of everything I've tried.

HTH.

Hitch
Hitch is offline   Reply With Quote
Old 10-21-2010, 06:22 PM   #24
Amalthia
Wizard
Amalthia does all things with Zen-like beautyAmalthia does all things with Zen-like beautyAmalthia does all things with Zen-like beautyAmalthia does all things with Zen-like beautyAmalthia does all things with Zen-like beautyAmalthia does all things with Zen-like beautyAmalthia does all things with Zen-like beautyAmalthia does all things with Zen-like beautyAmalthia does all things with Zen-like beautyAmalthia does all things with Zen-like beautyAmalthia does all things with Zen-like beauty
 
Amalthia's Avatar
 
Posts: 1,185
Karma: 32196
Join Date: Jan 2007
Location: Anchorage, AK
Device: Sony Reader PRS-505, PRS-650, PRS-T3, Pocketbook HD2
I use Word Perfect and the Publish to Plain HTML option. Then I use macros to remove the span tags and the break tags it adds. But compared to Word it's a cake walk. The rest of the HTML is plain and easy to edit.
Amalthia is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Content Kindle's "Topaz" Format Looks Like Crap Gideon Amazon Kindle 21 10-07-2010 10:35 PM
Kindle DX optimal "page" size - PDF or Word template guiyoforward Amazon Kindle 12 09-28-2010 07:05 PM
Sigil 024 and regular expressions on "all HTML files" WS64 Sigil 4 08-13-2010 07:33 PM
Microsoft Reader plugin "Read in" for Word doesn't load anymore K-Thom Reading and Management 15 04-17-2009 05:52 AM
"Beginning Ruby: From Novice to Professional" $10 teamonkey Deals and Resources (No Self-Promotion or Affiliate Links) 0 06-20-2008 03:05 PM


All times are GMT -4. The time now is 03:49 PM.


MobileRead.com is a privately owned, operated and funded community.