Template:LSL Regular Expressions
Revision as of 14:18, 19 October 2022 by Rider Linden (talk | contribs)
Wildcard | ||
---|---|---|
. | Matches any character | |
Anchors | ||
^ | Matches the beginning of the string. | |
$ | Matches the end of the string. | |
Repeats | ||
* | Matches the preceding atom 0 or more times. | |
+ | Matches the preceding atom 1 or more times. | |
? | Matches the preceding atom 0 or 1 times. | |
{n}
{n,} {n, m} |
Matches the preceding atom n, n or more, or between n and m times. | |
Sub-expressions | ||
(expression) | Text enclosed in parentheses is a marked sub-expression. Text matched as part of a sub-expressions is split out and may be repeated. | |
Alternation | ||
a | b | Match either a or b. | |
Character Sets | ||
[abc] | Matches any one of the enumerated characters. | |
[a-c] | Matches any character in the specified range. | |
[^abc] | Matches any character other than the enumerated characters. | |
[[:name:]] | Matches any character of the named class. | |
Any of the above character set definitions may be combined. | ||
Escape Sequences | ||
Specific Characters | ||
\e | ASCII 0x1B, ESC | |
\n | New line | |
\r | Carriage return | |
\t | Tab | |
\xdd | Matches an ASCII character with the code dd | |
Single character classes | ||
\d
\D |
Any decimal digit. | \d is equivalent to [[:digit:]] or [0-9]
\D is equivalent to [^[:digit:]] or [^0-9] |
\l
\L |
Any lower case character. | \l is equivalent to [[:lower:]] or [a-z]
\L is equivalent to [^[:lower:]] or [^a-z] |
\s
\S |
Any whitespace character. | \s is equivalent to [[:space:]] or [ \t\r\n]
\S is equivalent to [^[:space:]] or [^ \t\r\n] |
\u
\U |
Any upper case character. | \u is equivalent to [[:upper:]] or [A-Z]
\U is equivalent to [^[:upper:]] or [^A-Z] |
\w
\W |
Any "word" character.
Alphanumeric plus underscore |
\w is equivalent to [[:upper:][:lower:][:digit:]_] or [A-Za-z0-9_]
\W is equivalent to [^[:upper:][:lower:][:digit:]_] or [^A-Za-z0-9_] |
Word boundaries | ||
\< | Start of word. | |
\> | End of word | |
\b | ||
\B | Not a word boundary. | |
Named Character Classes | ||
alnum | Any alpha-numeric character. |
|
alpha | Any alphabetic character. |
|
blank | Any whitespace character that is not a line separator. | |
cntrl | Any control character |
|
digit
d |
Any decimal digit |
|
lower
l |
Any lower case character. |
|
Any printable character. | ||
punct | Any punctiation character. | |
space
s |
Any whitespace character. | |
upper
u |
Any upper case character. |
|
word
w |
Any control character |
|
xdigit
w |
Any hexadecimal digit character |
|