View Single Post
Old 05-14-2013, 09:44 AM   #10
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 29,807
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Quote:
Originally Posted by Hoods7070 View Post
@grannyGrumpy: Glad someone understands the sentiments behind my original question.

FWIW I generally search/replace all quotes to straight quotes, then use "smarten punctuation" in the Calibre ePub Modify plugin to fix them back to curly quotes. This does an excellent but not a 100% job, so I usually sort the quotes out BEFORE any other editing. As the code view fixed fonts (I use Courier New) show curly quotes as angled straight quotes it's relatively easy to spot the mistakes. Generally these occur with the use of the wrong angle in abbreviations/truncations like 'em, 'twas etc. - which can mostly be S/R-ed easily enough back in Sigil.

Unfortunately a S/R in Sigil does not even "see" some stuff, such as the little rectangles/or boxes that show up periodically. I have not figured out any logic as to when/why these things appear or what they might represent. Seems you just have to weed them out visually. (You can copy and paste these glyphs, but they don't register in a search.) Anyway, it'd be brilliant if there were an option in Sigil for these and any other extra-ordinary glyphs to be picked up and highlighted during a spellcheck.

Not sure if any Sigil developers follow these threads, but hope they do and would consider some changes that would minimize the need for Regex. The thought of defining a regex to root out evils is far beyond my minimal understanding of same.

Apropos - thanks @myducks for pointing to that excellent tutorial!
the squares/rectangles are missing/unsupported glyphs in the current font or wrongly declared character encoding. There may be / USUALLY is more than one character when you have this problem.
I have resorted to a HEX editor (like http://www.physics.ohio-state.edu/~p...dit/index.html ) to discover some of the character VALUES. THEN I spend some time looking up the value of various character maps until I find a logical usage (2 or 3 different matches in the same chart usually gives a clue to the proper encoding; eg CP1252 ). The above steps ARE tedious.
theducks is offline   Reply With Quote