Quote:
Originally Posted by chaot
The meaning of *nbsp* (non blanc spaces) is not clear to me. A *nbsp* must be syntactically somehow recognizable and different from the *plainspace* - or?
|
By *nbsp* I meant 'non-breaking-space', a.k.a. the unicode char \xa0 - also often found as the html entity
. The calibre Editor Check function will complain if your HTML contains entities. The Beautify function will always replace all html entities with their unicode char equivalent. The Editor visually differentiates normal space and unicode \xa0 by highlighting the latter (as it also does for the mdash and ndash unicode chars to distinguish them from a normal hyphen).
A Regex search for \s will match both a normal space and an \xa0 (and other 'whitespace' chars for that matter, e.g. newline). If you only want to search for '2 or more consecutive normal spaces' then perhaps a regex search for
would be better than \s\s (there's a normal space between those square-brackets)