Javascript Regular Expression For Word Boundaries, Tolerating In-word Hyphens And Apostrophes
I'm looking for a Regular Expression for JavaScript that will identify word boundaries in English, while accepting hyphens and apostrophes that appear inside words, but excluding t
Solution 1:
You can organize your word-boundary characters into two groups.
- Characters that cannot be alone.
- Characters that can be alone.
A regex that works with your example would be:
[\s.,'-]{2,}|[\s.]
Now all that's left is to keep adding all non-word characters into those two groups until it fits all of your needs. So you might start adding symbols and more punctuation to those character classes.
Solution 2:
You could write something like that:
(\s|[!-/]|[:-@]|[\[-`]|[\{-~])*\s(\s|[!-/]|[:-@]|[\[-`]|[\{-~])*Or the compact version:
(\s|[!-/:-@\[-`\{-~])*\s(\s|[!-/:-@\[-`\{-~])*The RegExp requires one \s (Space character) and selects als spaces and non alphanumeric chars before and after it.
https://regex101.com/r/bR8sV1/4
\smatches all spaces!-/every char from!to/:-@every char from:to@\[-``every char from[to``\{-~every char from{to~
Post a Comment for "Javascript Regular Expression For Word Boundaries, Tolerating In-word Hyphens And Apostrophes"