|11-19-2011, 04:02 PM||#1|
Join Date: Dec 2010
Location: United States
Device: Kindle Paperwhite 3; iPad Mini
I have scanned a few texts into RTF format. They look just fine when viewed in this format in Word, but if I try to use the document with some other software, I run into a problem. For example, say I copy the text into another text reading program: suddenly a huge number of words are shown as having dashes. Many of these look like normal short dashes, others I have never seen before--they look like "handle," because there is a small line extending vertically at the end of the dash.
My suspicion is somehow ABBYY is hiding these dashes from the scanned text when it is converted into an RTF file, but they are actually still there and so showing up when I try to work with the file in other contexts? Is there a way to prevent this from happening in my future scans and what would be the most convenient way to deal with the problem now?
Thanks for your help.
|11-20-2011, 03:33 AM||#2|
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
They are probably "soft hyphens" which are a way of indicating places where it is possible to break a word across lines, but should be hidden otherwise. As you see, not all applications know how to deal with them, and I find they are generally overkill. I don't know if you can prevent FineReader from adding them, but I guess you can remove them with search and replace (but note that some of the hyphens should stay).
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|em-dashes in mobi||AThirstyMind||Kindle Formats||4||06-12-2011 09:13 AM|
|Finereader flagging em dashes||proxy||Workshop||0||11-25-2010 12:17 PM|
|txt to mobi - dashes becoming ?||cybmole||Calibre||5||10-14-2010 11:02 AM|
|Doing away with long dashes||Logseman||Sigil||8||03-22-2010 02:29 AM|
|BD and dashes problem||Otter||Sony Reader||1||09-25-2007 05:47 AM|