![]() |
#1 |
eBook DIYer
![]() Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
Word > DocToHtml > Sigil
I am surprised there are only 2 very old threads in this forum speaking about DocToHtml. I just had a look at their documentation. The tool does every thing I need including a batch mode and command line support.
I have noticed in the example they provide that the HTML code generated is pretty clean and could easily passed through a battery of regex SR. For instance ... The tool creates : Code:
<h3><a name="_Analysis_document">A</a>nalysis document</h3> Code:
<h3 id="_Analysis_document">Analysis document</h3> If their marketing is correct (hum, hum), I already have a lot of ideas to improve my flow. Zen. I plan to start the evaluation next Monday. However I feel suspicious the silence in this forum. Any feedback before I invest time on the evaluation? Thanks. |
![]() |
![]() |
![]() |
#2 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 28,358
Karma: 203720150
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
Quote:
![]() |
|
![]() |
![]() |
![]() |
#3 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,905
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
![]() another way of saying: Just because you use Sigil (or xyz) in your process, does not make Those forums the best place to pose a question. Your (the OP) job (besides creating the work ![]() Good luck |
|
![]() |
![]() |
![]() |
#4 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,131
Karma: 144284184
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
This belongs in the conversion forum where there is a thread on the new Tool by Toxaris that is used to take Word's mess and clean it up for use in converting to ePub.
|
![]() |
![]() |
![]() |
#5 |
eBook DIYer
![]() Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
Oops. Sorry if I selected the inappropriate forum. I meant there are only 2 threads in the entire MobileRead forum, not in Sigil only.
I know Toxaris tool and started testing it. I gave up though. It doesn't work in its current state, it doesn't implement what I need and I am not sure to be understood. Gentle moderator, please move this thread to whatever forum you want. Thanks. |
![]() |
![]() |
![]() |
#6 |
eBook DIYer
![]() Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
Silence and agressivity. Interesting.
|
![]() |
![]() |
![]() |
#7 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,131
Karma: 144284184
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
|
![]() |
![]() |
![]() |
#8 | |
eBook DIYer
![]() Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
You don't listen to me my friend. I said ...
Quote:
|
|
![]() |
![]() |
![]() |
#9 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,131
Karma: 144284184
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
|
![]() |
![]() |
![]() |
#10 |
eBook DIYer
![]() Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
|
![]() |
![]() |
![]() |
#11 |
mostly an observer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,518
Karma: 987654
Join Date: Dec 2012
Device: Kindle
|
I use word2cleanhtml.com to accomplish this, and I welcome seeing discussions of this problem in the Sigil forum. I had no idea they were frowned on!
|
![]() |
![]() |
![]() |
#12 |
A Hairy Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,312
Karma: 20171571
Join Date: Dec 2012
Location: Charleston, SC today
Device: iPhone 15/11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire
|
Perhaps you can write your own set of macros that do what you want, how you want it?
There is no need to use some other regex s/r program before sigil. Sigil has it built in. If you dont know how to write macros you can actually save common regex's in sigil so you don't need to write a macro to do it. Wolf actually had a very good suggestion for you if you are unwilling or unable to use the recommended toxaris plugin. If you are asking for our feedback or recommendations then the silence IS feedback of a sort...people don't have anything good to say about it either because it's not a good tool, or there are better methods/processes and we don't use doc2html. But by all means, try out the doc2html program. Let us know how it works for you. |
![]() |
![]() |
![]() |
#13 | |
A Hairy Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,312
Karma: 20171571
Join Date: Dec 2012
Location: Charleston, SC today
Device: iPhone 15/11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire
|
Quote:
There is no need to use calibre to convert to ePub first, just add the HTML file directly to sigil. It will create the ePub without needing to clean up all the calibre mess. So: 1) use Toxaris' plugin to clean up the document and save as HTML (or ePub) 2) open the resulting file in sigil and make final corrections (s/r, regex) Or 1) SaveAs filtered HTML 2) open the resulting file in sigil and make final corrections (s/r, regex) Cheers! |
|
![]() |
![]() |
![]() |
#14 | |
eBook DIYer
![]() Posts: 111
Karma: 10
Join Date: Oct 2012
Location: Europe
Device: K4, KF HD 8.9, Readium
|
I know word2cleanhtml.com, it's the best solution I found so far except doing everything with Sigil. It is a shame it can't be run with a command file.
Quote:
Anyway, I like this kind of acid environment. It makes me stronger. |
|
![]() |
![]() |
![]() |
#15 | |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 631
Karma: 7544528
Join Date: Apr 2013
Location: Berlin
Device: PRS 350, Kobo Aura
|
Quote:
If you have a good word document - one with styles - this will produce a very clean epub. Maybe you want to run a few regexes above it, for example to get rid of MsNormal. If you always use consistent styles in word, you can reuse your epub stylesheet for each converted document. If you have footnotes or some other more complicated things, you want to "fix" them with regexes also. They do work, but could be a little bit prettier. Just remember: Use styles in word. |
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Best Pre-Sigil word processor tool/workflow? | Leverpullr | Sigil | 25 | 08-27-2012 02:18 PM |
cleaning up a word document in Sigil | BeccaPrice | Sigil | 9 | 10-08-2011 03:06 PM |