|10-13-2008, 06:23 AM||#1|
Join Date: Aug 2008
Device: Cybook Gen3 [512mb, FW: 1.5]
Tool to easily clean and refurbish html-text before conversion
What I was looking for was an easy way to go through the html-files I created before I formated them to mobi.
I used to use a simple text editor to go through the source and then check the result in a browser, but this is rather annoying with long documents.
While searching for a simple WYSIWYG (what you see is what you get) I was very close to desperation as there are thousands of them out there, some even free - but the main problem was that they created bulky code and there was no way to force them not to.
I know that bookdesigner is a tool to do it, but I never found it to be intuitive and I had a lot of trouble editing non english books.
Even though my requirements were simple (at leasts I thought so ) it took me a lot of tries to find the perfect program:
I finaly ended up with kompoZer, an unofficial successor of Nvu.
It allows to
kompoZer is fast, stable and easy to use - I hope it will help some of you the way it helps me
|10-13-2008, 07:11 AM||#2|
Join Date: May 2006
Device: PocketBook 360, before it was Sony Reader, cassiopeia A-20
my favourite tool for converting html books is a commandline program demoroniser.
it processes html sources and removes all the fancy characters that Microsoft tools insert into html code. Problem is all those fancy curly quotes, non-breaking spaces, optional hyppens, em-dashes, en-dashes, ... in some html files are non standard and display on my reader as very distracting two-exotic-character-combinations. Such files are practically unreadable.
I know that quite a lot of people here are very fond of their properly formated curly braces, em-dashes, em-dashes, and other typographical sugar, but when such things display on my reader as unreadable characters I resort to demoroniser or my own scripts written in vim.
By the way, I always use nvu generated html code as a textbook example how a really well written html code is supposed to look. I also use MSWord generated html code as an oposite example.
|10-13-2008, 10:40 AM||#3|
Join Date: Jul 2008
Location: The Third World
Device: iLiad + PRS-505 + Kindle 3
Amaya is also good.
And, being developed by W3C, I suppose it's strictly standard-compliant...
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|clean HTML or PDF before mobi conversion in Calibre||mark235||Calibre||9||12-25-2010 10:37 PM|
|BookDesigner HTML0 to clean HTML conversion utility||Pablo||Workshop||15||08-24-2010 01:05 PM|
|calibre: HTML to LRF conversion, problem with justified text||Juggle4Evr||Sony Reader||6||07-12-2009 08:19 PM|
|Best tool to strip text out of PDF for LRF conversion?||the7gerbers||LRF||3||03-22-2009 08:27 PM|
|Today only - Free IntraPDF conversion tool (PDF -> HTML)||Bob Russell||7||04-10-2007 01:16 PM|