Quote:
Originally Posted by BillSmithBooks
Another way to do this (without using Calibre) might be to convert Epub to HTML, then go into the HTML code and convert the <b>bold</b> to *bold* and <i>italic</> to _italic_ (Might also be emphasis and another code word for bold and italic).
Then go back into the regular browser, highlight all of the text, copy and paste into a txt browser editor.
Don't know if this helps but I think it would be a pretty foolproof way to do things.
|
It isn't that easy. The HTML conversion doesn't just use tags like that; it generates and refers to a style.css file, so that every change in the text is made up of one to several span tags. Stripping them out one by one feels like stripping the layers off an onion, and is not always doable either - how do you differentiate /span end-tags between the various span types? If I remove all </span> instances I have no way of knowing where the italic text ends.
I've manually converted one book like that because it was slightly clearer and I eventually got it right, but the one I wanted to convert next was essentially unfeasible.
I dunno, it might be possible to do something with regular expressions, but honestly life's too short.