Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Readers > More E-Book Readers > iRex

Notices

Reply
 
Thread Tools Search this Thread
Old 07-21-2006, 08:19 AM   #1
Riocaz
Fulfilled but not by iRex
Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.
 
Posts: 932
Karma: 286846
Join Date: May 2006
Location: London
Device: Far too many
RTF conversion.

Because it bugs me...

Anyone know of a tool which will strip font and size tags from an rtf file, but leaves the bold and italic tags in place?

It would aide me in converting my rtf files to html (as they bloat the end file if you convert straight. For example I stripped 1.4mb of unnessessary crud from one I was playign with yesterday and dropped the file size from 2.5MB to 1.1MB)

I know I saw such a tool when I was searching for an rtf/html conversion tool, but unfortunately I diddn't grab it at the time, and now cannot find it.
Riocaz is offline   Reply With Quote
Old 07-21-2006, 08:24 AM   #2
ElaHuguet
iLiad freak
ElaHuguet doesn't litterElaHuguet doesn't litterElaHuguet doesn't litter
 
ElaHuguet's Avatar
 
Posts: 339
Karma: 243
Join Date: Apr 2006
Location: Mallorca, Spain
Device: iRex iLiad
There's a tool called... Tidy something-or-other (TidyHTML? TidyUI? I have it at home), which is really cool for cleaning up code, also works great for cleaning up general MS Word crud. It's free, so you can check it out. Just google it, I found it that way.
ElaHuguet is offline   Reply With Quote
 
Advertisement
Old 07-21-2006, 08:38 AM   #3
Riocaz
Fulfilled but not by iRex
Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.
 
Posts: 932
Karma: 286846
Join Date: May 2006
Location: London
Device: Far too many
Doh! why diddn't I think of that? I was so focussed on finding something to "fix" the input file, I diddn't think of "fixing" the output. (even though I was trying to do that manually).

That looks perfect, I will have a play and tell you all how it does.
Riocaz is offline   Reply With Quote
Old 07-21-2006, 08:55 AM   #4
ElaHuguet
iLiad freak
ElaHuguet doesn't litterElaHuguet doesn't litterElaHuguet doesn't litter
 
ElaHuguet's Avatar
 
Posts: 339
Karma: 243
Join Date: Apr 2006
Location: Mallorca, Spain
Device: iRex iLiad
Hehehe... you're welcome, I found it easy to use.
ElaHuguet is offline   Reply With Quote
Old 07-24-2006, 05:37 PM   #5
meisterz
Junior Member
meisterz began at the beginning.
 
meisterz's Avatar
 
Posts: 6
Karma: 10
Join Date: Jul 2006
Device: TH55
Where can I find this Tidy?? program?

Thanks
meisterz is offline   Reply With Quote
Old 07-24-2006, 06:07 PM   #6
branko
Connoisseur
branko will become famous soon enoughbranko will become famous soon enoughbranko will become famous soon enoughbranko will become famous soon enoughbranko will become famous soon enoughbranko will become famous soon enough
 
Posts: 93
Karma: 549
Join Date: Jul 2006
Location: Amsterdam
Device: Palm Zire
It's the first link in Google. Do you know what Google is?
branko is offline   Reply With Quote
Old 07-25-2006, 09:27 AM   #7
meisterz
Junior Member
meisterz began at the beginning.
 
meisterz's Avatar
 
Posts: 6
Karma: 10
Join Date: Jul 2006
Device: TH55
By the sarcasm I assume it is either tidyhtml http://www.tucows.com/preview/206197 or tidy ui http://www.forums.devnetwork.net/vie...e6b53b800f978c
meisterz is offline   Reply With Quote
Old 07-25-2006, 01:59 PM   #8
branko
Connoisseur
branko will become famous soon enoughbranko will become famous soon enoughbranko will become famous soon enoughbranko will become famous soon enoughbranko will become famous soon enoughbranko will become famous soon enough
 
Posts: 93
Karma: 549
Join Date: Jul 2006
Location: Amsterdam
Device: Palm Zire
Sarcasm?
branko is offline   Reply With Quote
Old 08-01-2006, 11:37 AM   #9
yokos
Pac-Man catched my iLiad.
yokos plays well with othersyokos plays well with othersyokos plays well with othersyokos plays well with othersyokos plays well with othersyokos plays well with othersyokos plays well with othersyokos plays well with othersyokos plays well with othersyokos plays well with othersyokos plays well with others
 
yokos's Avatar
 
Posts: 720
Karma: 2571
Join Date: Apr 2006
Location: Germany; next to Baltic Sea
Device: 1st gen iRex iLiad with 2nd ed. battery/case
If you are a fan of almighty LaTeX give rtf2LaTeX a try. It works fine. http://sourceforge.net/projects/rtf2latex2e
yokos is offline   Reply With Quote
Old 08-01-2006, 11:47 AM   #10
rlauzon
Wizard
rlauzon put the bomp in the bomp-a-bomp-a-bomp.rlauzon put the bomp in the bomp-a-bomp-a-bomp.rlauzon put the bomp in the bomp-a-bomp-a-bomp.rlauzon put the bomp in the bomp-a-bomp-a-bomp.rlauzon put the bomp in the bomp-a-bomp-a-bomp.rlauzon put the bomp in the bomp-a-bomp-a-bomp.rlauzon put the bomp in the bomp-a-bomp-a-bomp.rlauzon put the bomp in the bomp-a-bomp-a-bomp.rlauzon put the bomp in the bomp-a-bomp-a-bomp.rlauzon put the bomp in the bomp-a-bomp-a-bomp.rlauzon put the bomp in the bomp-a-bomp-a-bomp.
 
rlauzon's Avatar
 
Posts: 1,017
Karma: 67827
Join Date: Jan 2005
Device: Opus/System76 Starling
Quote:
Originally Posted by Riocaz
Anyone know of a tool which will strip font and size tags from an rtf file, but leaves the bold and italic tags in place?
I usually use OpenOffice.

1. I convert the RTF into an HTML file.
2. Reload the HTML file back into OpenOffice.
3. I use the source view to do a Find/Replace on all the offending tags.
4. Then I convert the HTML into a regular OpenOffice file to save it.
5. And finally, I export to PDF to put it on my iLiad.
rlauzon is offline   Reply With Quote
Old 08-02-2006, 06:13 PM   #11
jsc
Member
jsc began at the beginning.
 
Posts: 12
Karma: 10
Join Date: Jul 2006
I just got rtf2latex2e compiled for OS X and used it to convert a Baen RTF. It helped some, but I have to say it's simply not very good. I had to deal with a great number of unbalanced environment tags (italics started but never ended) and a large section of boldface which was not visible in the RTF. This may be because the RTF file included some formatting badness, but I think the above suggestion to use OpenOffice to convert to xhtml is better.

Unfortunately, the xhtml generated by OpenOffice uses CSS heavily, so it's not always obvious what markup to substitute. Italics is not done with an i tag, it is a p tag with a CSS class. Still, it puts it in a format that's at least workable. The final issue is to replace double and single quotes with appropriate text quotes, for which I'm working on a script to do heuristically (you can't just count on there being left-right quote pairs, since multiple paragraphs in quotes are traditionally started with but not ended by a text quote).
jsc is offline   Reply With Quote
Old 08-02-2006, 06:33 PM   #12
NatCh
Gizmologist
NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.
 
NatCh's Avatar
 
Posts: 11,605
Karma: 926222
Join Date: Jan 2006
Location: Republic of Texas Embassy at Jackson, TN
Device: Nook STGR
Quote:
Originally Posted by jsc
The final issue is to replace double and single quotes with appropriate text quotes
Okay, I'm not trying to be obnoxious, really I'm not, but I can't think of another way to ask this.

Why is this such a deal? It doesn't bother me at all if it's a "" instead of “” -- either way I get that it's a quote.... Is it just a matter of preference, or am I missing something here?


As a suggestion to address this, wouldn't it be a “ if it has a non-whitespace character after it, and a ” otherwise? Maybe that helps with the find/replace.

I think I'd try searching for "<whitespace> and replace all those with ”<whitespace>, and then search for all the remaining " and replace with “

Last edited by NatCh; 08-02-2006 at 06:40 PM.
NatCh is offline   Reply With Quote
Old 08-03-2006, 06:51 AM   #13
Riocaz
Fulfilled but not by iRex
Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.Riocaz ought to be getting tired of karma fortunes by now.
 
Posts: 932
Karma: 286846
Join Date: May 2006
Location: London
Device: Far too many
JSC: I would suspect the original file. I had similar problems with size/justification/etc when converting them.

Natch: If you are missing something then so am I. I find "zzz" ''zzz'' “zzz” almost indistingushable. So it's a matter of personal preference IMO.
Riocaz is offline   Reply With Quote
Old 08-03-2006, 12:48 PM   #14
NatCh
Gizmologist
NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.
 
NatCh's Avatar
 
Posts: 11,605
Karma: 926222
Join Date: Jan 2006
Location: Republic of Texas Embassy at Jackson, TN
Device: Nook STGR
Quote:
Originally Posted by Riocaz
NatCh: If you are missing something then so am I. I find "zzz" ''zzz'' “zzz” almost indistingushable. So it's a matter of personal preference IMO.
Thanks, Riocaz, that's the conclusion I was drawing too. Guess it comes down to "I likes what I likes," which is as it should be.
NatCh is offline   Reply With Quote
Old 08-08-2006, 12:28 PM   #15
jsc
Member
jsc began at the beginning.
 
Posts: 12
Karma: 10
Join Date: Jul 2006
NatCh, your suggestion about spaces before or after is correct, and I've been doing that, but there are situations where it does not apply. Especially with the first author I've been working at converting, who tends to use a lot of m-dashes to interject comments within speeches, you get a lot of ---"text and text"--- and you cannot just assume that the text is quote or commentary.

Why bother? Well, I brought up textquotes specifically in relation to the use of rtf2latex2e. If anyone is going to bother using LaTeX, then there is a higher probability that they have a higher interest in the niggling details of fine typography, making ebooks look like books and not just text files. And text quotes is just one such detail, along with the proper use of hyphens, n-dashes, and m-dashes, ligatures, proper spacing after sentences but not abbreviations, non-breaking spaces, widows and orphans, etc. Thankfully, LaTeX takes care of most of those things automatically, but not the quotes thing.

That's useful only if one cares. I'm not a type-fascist myself, but the iLiad screen is so nice, I thought I might expend the effort for at least a few books just so I have something worthy of the screen. But I find just the plain PDF output from OO reads just fine as well. And the manybooks.net output for iLiad looks really very good.
jsc is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
conversion from .rtf problems gondwild Calibre 7 02-07-2010 12:18 AM
RTF and TEXT conversion spaze Calibre 4 08-23-2009 04:11 AM
Error with RTF Conversion daesdaemar Calibre 4 01-29-2009 06:42 PM
rtf conversion martingUSA Calibre 11 11-29-2008 11:38 AM
RTF Conversion with Plucker cactusjack Reading and Management 9 11-16-2004 08:44 PM


All times are GMT -4. The time now is 09:02 AM.


MobileRead.com is a privately owned, operated and funded community.