View Single Post
Old 05-18-2016, 02:08 PM   #8
jackie_w
Grand Sorcerer
jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.
 
Posts: 6,252
Karma: 16544692
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
Quote:
Originally Posted by chaot View Post
The meaning of *nbsp* (non blanc spaces) is not clear to me. A *nbsp* must be syntactically somehow recognizable and different from the *plainspace* - or?
By *nbsp* I meant 'non-breaking-space', a.k.a. the unicode char \xa0 - also often found as the html entity  . The calibre Editor Check function will complain if your HTML contains entities. The Beautify function will always replace all html entities with their unicode char equivalent. The Editor visually differentiates normal space and unicode \xa0 by highlighting the latter (as it also does for the mdash and ndash unicode chars to distinguish them from a normal hyphen).

A Regex search for \s will match both a normal space and an \xa0 (and other 'whitespace' chars for that matter, e.g. newline). If you only want to search for '2 or more consecutive normal spaces' then perhaps a regex search for
Code:
[ ]{2,}
would be better than \s\s (there's a normal space between those square-brackets)
jackie_w is offline   Reply With Quote