08-03-2007, 11:19 PM | #16 |
eNigma
Posts: 503
Karma: 1335
Join Date: Dec 2006
Location: The Philippines
Device: HTC G1 Android FBReader
|
monkpalmer, does this tool operate on the entire text in a "setup and then process" mode, or do you use it to go through the entire file manually correcting it? Thanks for the link and info.
|
08-04-2007, 12:08 AM | #17 | |
Technogeezer
Posts: 7,233
Karma: 1601464
Join Date: Nov 2006
Location: Virginia, USA
Device: Sony PRS-500
|
Quote:
I spent many years programming (mainframes, minis, and micros) and this is the best editor I have ever used. It is an excellent replacement for Notepad. How it differs from Word is mainly in its philosophy of operations. Everything is displayed, there are no hidden codes. (This is wonderful when you are setting up hyperlinks.) It also has a fully functional hex editor that I wrongly thought I would never need with the Harvard Classics series. There is a 30 or 45 days free trial of the product. |
|
Advert | |
|
08-04-2007, 03:03 AM | #18 |
creator of calibre
Posts: 43,857
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
08-04-2007, 04:05 AM | #19 |
Groupie
Posts: 189
Karma: 793
Join Date: Oct 2006
|
Could I ask if anyone has any tips on the least labour-intensive way of replacing ascii quotation marks (' ") with proper curly quotes either on Mac (TextMate?) or Windows thanks.
|
08-04-2007, 06:30 AM | #20 |
Member
Posts: 10
Karma: 3650
Join Date: Dec 2004
Device: Tungsten TC
|
mogui, all you have to do is
1. open from within "E-Book Tidy" the text doc you want to reformat 2. press a button. The program does the whole thing for you in a couple of seconds. There's a preview tab as well, so you can check on how your Gutenberg text is shaping up. |
Advert | |
|
08-04-2007, 01:35 PM | #21 |
fruminous edugeek
Posts: 6,745
Karma: 551260
Join Date: Oct 2006
Location: Northeast US
Device: iPad, eBw 1150
|
If you find you need more flexible search/replace and other text processing functionality and you want to use the same rules on a whole batch of files at once, you might want to look at http://www.datamystic.com/textpipe.html. It's not free, but the "lite" version is under US$50. I'm using the "pro" version for another (non ebook related) task and it's quite powerful, and fairly easy to use. (I have no other connection with this company or product.)
|
08-04-2007, 10:48 PM | #22 |
eNigma
Posts: 503
Karma: 1335
Join Date: Dec 2006
Location: The Philippines
Device: HTC G1 Android FBReader
|
We often forget the old tools. AWK is available for windows, as are mawk and gawk. This page links to tutorials and contains some scripts. The textpipe folks say textpipe is better than awk -- quicker to program. Awk is free, has useful variants, and lots of free scripts and tutorials. Your choice.
Awk and sed have been around since the beginning of time. There are forums for getting help and getting scripts. Imagine writing an awk script that formats your Gutenberg text files just the way you like them and then running that script in batch mode on entire directories. Explore the world of awk scripts. Or you can use E-Book Tidy. Thanks monkpalmer. It is always good to have a choice of tools. Gutenmark takes Gutenberg text files and converts them to nicely formatted HTML. Oh, how I wish I could use HTML on my Reader! Replacing ASCII quote marks with the left and right versions ought not be difficult for an awk script writer. If you want a one-click solution, write the script and then upload it here. Last edited by mogui; 08-04-2007 at 10:53 PM. |
08-05-2007, 07:18 PM | #23 |
fruminous edugeek
Posts: 6,745
Karma: 551260
Join Date: Oct 2006
Location: Northeast US
Device: iPad, eBw 1150
|
Oh sure, awk is very powerful. So is perl. I'm just saying that some categories of repetitive tasks have been needed by so many people that other tools have been created that are easier to use -- for those tasks.
|
08-10-2007, 06:02 PM | #24 |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Text Editor
|
08-23-2007, 03:36 PM | #25 |
Groupie
Posts: 189
Karma: 793
Join Date: Oct 2006
|
There seems to be a problem with the GutenMark download pages. I don't suppose anyone has a copy of either the OSX compiled tarball or the Windows compiled Zip that they could upload?
My simple primitive approach to the curly quotes problem was to do a replace all on the '."' ',"' '?"' and then when I've eliminated all the right-hand double quotes simply do a replace all on the left hand double quotes. Similar approach to single quotes and apostrophes. Last edited by andym; 08-24-2007 at 02:20 PM. |
08-23-2007, 04:06 PM | #26 | |
New York Editor
Posts: 6,384
Karma: 16540415
Join Date: Aug 2007
Device: PalmTX, Pocket eDGe, Alcatel Fierce 4, RCA Viking Pro 10, Nexus 7
|
Quote:
Notepad replacements have their own category. Look under TextEditorFamilies. I'm currently using Notepad++, one of a batch of free, open source text editors based on the Scintilla edit control, but I've used a number of others. If all you want to do is replace Notepad, Ultra Edit is overkill. ______ Dennis Collector of Text Editors |
|
08-27-2007, 01:27 PM | #27 | |
eBook Enthusiast
Posts: 85,544
Karma: 93383043
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
Quote:
It's best to stick with straight ASCII quotes if you want everyone to be able to display your book correctly. |
|
08-31-2007, 02:21 AM | #28 | |
Groupie
Posts: 189
Karma: 793
Join Date: Oct 2006
|
Quote:
Last edited by andym; 08-31-2007 at 02:23 AM. |
|
08-31-2007, 12:17 PM | #29 |
eBook Enthusiast
Posts: 85,544
Karma: 93383043
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
As long as you do use the HTML entities then you're right - there's no problem. Unfortunately it's very common to find them as straight "characters" in files, which then don't display correctly for everyone.
|
08-31-2007, 12:39 PM | #30 | |
Groupie
Posts: 189
Karma: 793
Join Date: Oct 2006
|
Quote:
If anyone is interested, there's a list of the html entities supported by mobipocket here. Unfortunately, the formatting is pretty scrambled. It seems to be the same as the Open eBook list which you can download from the ipdf.org site - the link is here: http://www.idpf.org/oebps/oebps1.2/d.../oeb12-dtd.zip. The list has the suffix .ent but it should be possible to open it in a decent text editor. [Edit - direct link here: http://openebook.org/dtds/oeb-1.2/oeb12.ent opens in a browser] Last edited by andym; 09-07-2007 at 03:09 AM. |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Anti-recommendations: Read any terrible books lately? | ficbot | Reading Recommendations | 82 | 01-26-2011 01:09 PM |
need a quick lesson how how to download and read e-books. | clear | General Discussions | 9 | 10-10-2010 05:28 PM |
Classic Quick question - library books | Thrasher | Barnes & Noble NOOK | 6 | 06-23-2010 01:11 PM |
quick question regarding removing books | oncdoc | Amazon Kindle | 2 | 07-26-2009 09:53 PM |
connect store downloads books i didnt order! Terrible connectstore support | alexjlee | Sony Reader | 15 | 01-01-2007 06:26 PM |