Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > Workshop

Notices

Reply
 
Thread Tools Search this Thread
Old 11-19-2011, 04:02 PM   #1
sovre
Zealot
sovre began at the beginning.
 
sovre's Avatar
 
Posts: 108
Karma: 10
Join Date: Dec 2010
Location: United States
Device: iPad Mini; iPhone; Kindle Paperwhite (10th gen)
hidden dashes?

I have scanned a few texts into RTF format. They look just fine when viewed in this format in Word, but if I try to use the document with some other software, I run into a problem. For example, say I copy the text into another text reading program: suddenly a huge number of words are shown as having dashes. Many of these look like normal short dashes, others I have never seen before--they look like "handle," because there is a small line extending vertically at the end of the dash.

My suspicion is somehow ABBYY is hiding these dashes from the scanned text when it is converted into an RTF file, but they are actually still there and so showing up when I try to work with the file in other contexts? Is there a way to prevent this from happening in my future scans and what would be the most convenient way to deal with the problem now?

Thanks for your help.
sovre is offline   Reply With Quote
Old 11-20-2011, 03:33 AM   #2
Jellby
frumious Bandersnatch
Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.
 
Jellby's Avatar
 
Posts: 7,516
Karma: 18512745
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
They are probably "soft hyphens" which are a way of indicating places where it is possible to break a word across lines, but should be hidden otherwise. As you see, not all applications know how to deal with them, and I find they are generally overkill. I don't know if you can prevent FineReader from adding them, but I guess you can remove them with search and replace (but note that some of the hyphens should stay).
Jellby is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
em-dashes in mobi AThirstyMind Kindle Formats 4 06-12-2011 09:13 AM
Finereader flagging em dashes proxy Workshop 0 11-25-2010 12:17 PM
txt to mobi - dashes becoming ? cybmole Calibre 5 10-14-2010 11:02 AM
Doing away with long dashes Logseman Sigil 8 03-22-2010 02:29 AM
BD and dashes problem Otter Sony Reader 1 09-25-2007 05:47 AM


All times are GMT -4. The time now is 05:53 AM.


MobileRead.com is a privately owned, operated and funded community.