Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 06-03-2013, 02:01 PM   #1
abeonis
eBook DIYer
abeonis began at the beginning.
 
abeonis's Avatar
 
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
Word > DocToHtml > Sigil

I am surprised there are only 2 very old threads in this forum speaking about DocToHtml. I just had a look at their documentation. The tool does every thing I need including a batch mode and command line support.

I have noticed in the example they provide that the HTML code generated is pretty clean and could easily passed through a battery of regex SR. For instance ...

The tool creates :
Code:
<h3><a name="_Analysis_document">A</a>nalysis document</h3>
I will have to modify the HTML:
Code:
<h3 id="_Analysis_document">Analysis document</h3>
Not a big deal. It has a SR tool or I can use Regex outside of the tool and before Sigil.

If their marketing is correct (hum, hum), I already have a lot of ideas to improve my flow. Zen.

I plan to start the evaluation next Monday. However I feel suspicious the silence in this forum.

Any feedback before I invest time on the evaluation?

Thanks.
abeonis is offline   Reply With Quote
Old 06-03-2013, 02:45 PM   #2
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 8,328
Karma: 36126080
Join Date: Jan 2010
Device: Kindle Fire HD, Kindle 2
Quote:
However I feel suspicious the silence in this forum.
The virtual "silence" might have something to do with the fact that this isn't the "Getting your Document Adequately HTMLized So You Can Turn It into an ePub That You Might Then Want to Edit with Sigil" forum.
DiapDealer is offline   Reply With Quote
Old 06-03-2013, 02:57 PM   #3
theducks
Grand Sorcerer
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 13,631
Karma: 5126946
Join Date: Aug 2009
Location: The (original) Silicon Valley, USA
Device: Galaxy Tab 2, Astak Pocket Pro, K4NT
Quote:
Originally Posted by DiapDealer View Post
The virtual "silence" might have something to do with the fact that this isn't the "Getting your Document Adequately HTMLized So You Can Turn It into an ePub That You Might Then Want to Edit with Sigil" forum.

another way of saying: Just because you use Sigil (or xyz) in your process, does not make Those forums the best place to pose a question.
Your (the OP) job (besides creating the work ) is to determine where things go Left and get help (from the dedicated forum) keeping thing straight and narrow.

Good luck
theducks is offline   Reply With Quote
Old 06-03-2013, 03:00 PM   #4
JSWolf
Suspended
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
Posts: 35,392
Karma: 16147088
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Sony Reader PRS-650, iPad, nook STR
This belongs in the conversion forum where there is a thread on the new Tool by Toxaris that is used to take Word's mess and clean it up for use in converting to ePub.
JSWolf is offline   Reply With Quote
Old 06-03-2013, 04:01 PM   #5
abeonis
eBook DIYer
abeonis began at the beginning.
 
abeonis's Avatar
 
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
Oops. Sorry if I selected the inappropriate forum. I meant there are only 2 threads in the entire MobileRead forum, not in Sigil only.

I know Toxaris tool and started testing it. I gave up though. It doesn't work in its current state, it doesn't implement what I need and I am not sure to be understood.

Gentle moderator, please move this thread to whatever forum you want.

Thanks.
abeonis is offline   Reply With Quote
Old 06-03-2013, 04:10 PM   #6
abeonis
eBook DIYer
abeonis began at the beginning.
 
abeonis's Avatar
 
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
Silence and agressivity. Interesting.
abeonis is offline   Reply With Quote
Old 06-03-2013, 05:20 PM   #7
JSWolf
Suspended
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
Posts: 35,392
Karma: 16147088
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Sony Reader PRS-650, iPad, nook STR
Quote:
Originally Posted by abeonis View Post
Silence and agressivity. Interesting.
Have a look in the Conversion forum and look for the thread on Toxaris' new tool for dealing with Word and cleaning the code. That's most likely what might work for you.
JSWolf is offline   Reply With Quote
Old 06-03-2013, 05:23 PM   #8
abeonis
eBook DIYer
abeonis began at the beginning.
 
abeonis's Avatar
 
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
You don't listen to me my friend. I said ...

Quote:
I know Toxaris tool and started testing it. I gave up though. It doesn't work in its current state, it doesn't implement what I need and I am not sure to be understood.
abeonis is offline   Reply With Quote
Old 06-03-2013, 05:44 PM   #9
JSWolf
Suspended
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
Posts: 35,392
Karma: 16147088
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Sony Reader PRS-650, iPad, nook STR
Quote:
Originally Posted by abeonis View Post
You don't listen to me my friend. I said ...
Then what to do is this...

Save as filtered HTML, convert to ePub with Calibre, and then use Sigil to fix the mess.
JSWolf is offline   Reply With Quote
Old 06-03-2013, 05:55 PM   #10
abeonis
eBook DIYer
abeonis began at the beginning.
 
abeonis's Avatar
 
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
Quote:
Originally Posted by JSWolf View Post
Then what to do is this...

Save as filtered HTML, convert to ePub with Calibre, and then use Sigil to fix the mess.
You are kidding?
abeonis is offline   Reply With Quote
Old 06-03-2013, 06:03 PM   #11
Notjohn
Groupie
Notjohn reads XML... blindfoldedNotjohn reads XML... blindfoldedNotjohn reads XML... blindfoldedNotjohn reads XML... blindfoldedNotjohn reads XML... blindfoldedNotjohn reads XML... blindfoldedNotjohn reads XML... blindfoldedNotjohn reads XML... blindfoldedNotjohn reads XML... blindfoldedNotjohn reads XML... blindfoldedNotjohn reads XML... blindfolded
 
Posts: 172
Karma: 52104
Join Date: Dec 2012
Device: Kindle
I use word2cleanhtml.com to accomplish this, and I welcome seeing discussions of this problem in the Sigil forum. I had no idea they were frowned on!
Notjohn is offline   Reply With Quote
Old 06-03-2013, 06:10 PM   #12
Turtle91
Guru
Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.
 
Turtle91's Avatar
 
Posts: 669
Karma: 3807234
Join Date: Dec 2012
Location: Shannon, Ireland today
Device: iPhone 5/iPad 1&2/Surface Pro/Kindle PW
Perhaps you can write your own set of macros that do what you want, how you want it?

There is no need to use some other regex s/r program before sigil. Sigil has it built in. If you dont know how to write macros you can actually save common regex's in sigil so you don't need to write a macro to do it.

Wolf actually had a very good suggestion for you if you are unwilling or unable to use the recommended toxaris plugin. If you are asking for our feedback or recommendations then the silence IS feedback of a sort...people don't have anything good to say about it either because it's not a good tool, or there are better methods/processes and we don't use doc2html.

But by all means, try out the doc2html program. Let us know how it works for you.
Turtle91 is offline   Reply With Quote
Old 06-03-2013, 06:20 PM   #13
Turtle91
Guru
Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.
 
Turtle91's Avatar
 
Posts: 669
Karma: 3807234
Join Date: Dec 2012
Location: Shannon, Ireland today
Device: iPhone 5/iPad 1&2/Surface Pro/Kindle PW
Quote:
Originally Posted by JSWolf View Post
Then what to do is this...

Save as filtered HTML, convert to ePub with Calibre, and then use Sigil to fix the mess.
The only thing I would add or change to wolf's suggestion is:
There is no need to use calibre to convert to ePub first, just add the HTML file directly to sigil. It will create the ePub without needing to clean up all the calibre mess.

So:
1) use Toxaris' plugin to clean up the document and save as HTML (or ePub)
2) open the resulting file in sigil and make final corrections (s/r, regex)

Or

1) SaveAs filtered HTML
2) open the resulting file in sigil and make final corrections (s/r, regex)


Cheers!
Turtle91 is offline   Reply With Quote
Old 06-03-2013, 06:29 PM   #14
abeonis
eBook DIYer
abeonis began at the beginning.
 
abeonis's Avatar
 
Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
I know word2cleanhtml.com, it's the best solution I found so far except doing everything with Sigil. It is a shame it can't be run with a command file.

Quote:
Originally Posted by JSWolf View Post
Then what to do is this...
Save as filtered HTML, convert to ePub with Calibre, and then use Sigil to fix the mess.
Actually I prefer agressivity than this kind of hazing. Someone like JSWolf must know that this solution doesn't work. And if it works by miracle, there is a chance that KDP will not accept the mobi file generated. This misinformation is a mistery ... maybe a self-esteem problem.

Anyway, I like this kind of acid environment. It makes me stronger.
abeonis is offline   Reply With Quote
Old 06-03-2013, 06:30 PM   #15
dickloraine
Enthusiast
dickloraine is faster than a rolling 'o,' stronger than silent 'e,' and leaps capital 'T' in a single bound!dickloraine is faster than a rolling 'o,' stronger than silent 'e,' and leaps capital 'T' in a single bound!dickloraine is faster than a rolling 'o,' stronger than silent 'e,' and leaps capital 'T' in a single bound!dickloraine is faster than a rolling 'o,' stronger than silent 'e,' and leaps capital 'T' in a single bound!dickloraine is faster than a rolling 'o,' stronger than silent 'e,' and leaps capital 'T' in a single bound!dickloraine is faster than a rolling 'o,' stronger than silent 'e,' and leaps capital 'T' in a single bound!dickloraine is faster than a rolling 'o,' stronger than silent 'e,' and leaps capital 'T' in a single bound!dickloraine is faster than a rolling 'o,' stronger than silent 'e,' and leaps capital 'T' in a single bound!dickloraine is faster than a rolling 'o,' stronger than silent 'e,' and leaps capital 'T' in a single bound!dickloraine is faster than a rolling 'o,' stronger than silent 'e,' and leaps capital 'T' in a single bound!dickloraine is faster than a rolling 'o,' stronger than silent 'e,' and leaps capital 'T' in a single bound!
 
Posts: 49
Karma: 50202
Join Date: Apr 2013
Location: Berlin
Device: PRS 350+
Quote:
Originally Posted by JSWolf View Post
Then what to do is this...

Save as filtered HTML, convert to ePub with Calibre, and then use Sigil to fix the mess.
No need to convert the filtered HTML with calibre. In fact, i think this is a bad idea. Just import the HTML directly into sigil. Copy the stlye-informations from the word generated HTML into a stylesheet. Let Sigil get rid of unused styles. Now just adjust the remaining styles to your liking.

If you have a good word document - one with styles - this will produce a very clean epub. Maybe you want to run a few regexes above it, for example to get rid of MsNormal. If you always use consistent styles in word, you can reuse your epub stylesheet for each converted document. If you have footnotes or some other more complicated things, you want to "fix" them with regexes also. They do work, but could be a little bit prettier. Just remember: Use styles in word.
dickloraine is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Best Pre-Sigil word processor tool/workflow? Leverpullr Sigil 25 08-27-2012 02:18 PM
cleaning up a word document in Sigil BeccaPrice Sigil 9 10-08-2011 03:06 PM


All times are GMT -4. The time now is 08:32 PM.


MobileRead.com is a privately owned, operated and funded community.