12-30-2009, 07:27 AM | #1 |
ePub Maker
Posts: 120
Karma: 16
Join Date: Dec 2009
Location: Mordor
Device: iPad,Kindle 3, Nook 2
|
Clean and compress HTML before making ebook
Many ebook producers collect resources from web pages. But lots of work is still needed to compose these raw meterials into an elegant ebook.
Here, I would like to recommend a small tools, HTML Page Cleaner 2.2 This tool is designed for collected web pages. You can clean up the pages by removing all useless elements, from javascript, forms to weblinks. I ever made some news collection for our company from some certain site. The result really looks like a book, at least much better than sending these webpages directly to my manager. I also made some chm ebook for myself from wikipedia and other website I am interested in. I made my own ebooks, fairly good. So if you have a hobby of collecting information from web, you can try this tool before compiling them. This tool can be downloaded from www.htmlcleaner.com or by searching in download.com. I'm still using MS HTML Help WorkShop as chm compiler which can also be downloaded from download.com. Clean the webpages, then compile them, it's an easy way to make your ebook more professional. At last, please do not publish you ebook by collecting copyright-protected web pages. |
12-30-2009, 04:47 PM | #2 |
Punctuation Fetishist
Posts: 557
Karma: 1070000
Join Date: Nov 2008
Location: The Bluest Commonwealth In East America
Device: Kindle PW, Nexus 7 (2013), Galaxy S5 phone, Galaxy Tab 4 8.0
|
There is also a plugin for MS Word from Microsoft that can be set to various levels of strippage. I generally set it to strip out almost everything, resulting in a file with no styles or cruft, beyond headings, bold and italic.
Regards, Jack Tingle |
Advert | |
|
12-30-2009, 05:44 PM | #3 |
Resident Curmudgeon
Posts: 73,660
Karma: 127838196
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Where can we get this plugin and does it work for Word 2003?
|
12-30-2009, 10:02 PM | #4 |
ePub Maker
Posts: 120
Karma: 16
Join Date: Dec 2009
Location: Mordor
Device: iPad,Kindle 3, Nook 2
|
Where to download? the url? thanks.
|
01-13-2010, 07:51 PM | #5 |
Austrian Bookworm
Posts: 141
Karma: 2138662
Join Date: Oct 2007
Location: Austria
Device: Pocketbook Inkpad 4
|
Can't find this plugin either.
I am doing the stripping with notepad++ and regular expressions. Would be nice to have a faster way ... |
Advert | |
|
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
clean HTML or PDF before mobi conversion in Calibre | mark235 | Calibre | 9 | 12-25-2010 09:37 PM |
BookDesigner HTML0 to clean HTML conversion utility | Pablo | Workshop | 15 | 08-24-2010 12:05 PM |
Best way to get clean HTML | JSWolf | Kindle Formats | 18 | 04-02-2009 11:00 AM |
Tool to easily clean and refurbish html-text before conversion | Pulp | Workshop | 3 | 10-13-2008 10:16 AM |
Docvert 2.0 converts MS Word files to clean HTML | Alexander Turcic | Lounge | 0 | 03-16-2006 04:50 AM |