View Single Post
Old 06-24-2010, 12:10 PM   #21
steven522
binomial: homo legentem
steven522 ought to be getting tired of karma fortunes by now.steven522 ought to be getting tired of karma fortunes by now.steven522 ought to be getting tired of karma fortunes by now.steven522 ought to be getting tired of karma fortunes by now.steven522 ought to be getting tired of karma fortunes by now.steven522 ought to be getting tired of karma fortunes by now.steven522 ought to be getting tired of karma fortunes by now.steven522 ought to be getting tired of karma fortunes by now.steven522 ought to be getting tired of karma fortunes by now.steven522 ought to be getting tired of karma fortunes by now.steven522 ought to be getting tired of karma fortunes by now.
 
steven522's Avatar
 
Posts: 1,061
Karma: 25222222
Join Date: Feb 2010
Location: Alabama, USA
Device: iriver Story HD; Archos 80 G9
Quote:
Originally Posted by dougf4 View Post
does "Open Office" do a better job or is just the preferred word processor of some?
Well this is what I am doing right now as I tend to keep a USB flash drive in my pocket at all times:

I have an 8GB flash drive with all "raw" books on it, most of them are html encoded off of different web sites, etc., held in an "ebook" folder.

On the root of the drive I have placed the free "NotePad2" editor which does not need an install and runs off of the portable flash drive on any Windows based OS.

I launch the NotePad2 program and navigate to the ebook folder and open the raw html file.

Within NotePad2, you can search and replace the newline characters "\r\n" with the "transform backslashes" option checked. I usually do this first and replace the newline characters with a single space. This will make sure that you do not end up with two words jammedtogether.

Then search and replace all </p> ( and/or <p> depending on the html encoder used ) and replace with "\r\n" (with transform backslashes checked) to get all paragraphs locked to a single line.

After that, I do search and replace for various bold, underline, emphasis, etc., commands to clear the html formatting. If the chapter headers have obvious <H1> html header blocks around them, you can search and replace with a string of 5 "\r\n" to make sure you have 4 empty lines before each.

I continue on after the mass changes with a quick scroll up/down to look for any odd looking lines that might need to be manually cleaned up.

Once done, I save the file to a name like "Author_Last, First # [Series_Number -] Book Name.txt". This will allow proper naming and sorting in the jetbook's book list display.

I keep all books on an SD card with the folders named "!Sci-Fi" and "!Horror" so that the SD card folders appear first on the book list. Within those categories I put sub folders to further divide what I have ("Classics", "aliens", "werewolf" or by author names).
steven522 is offline   Reply With Quote