Template:LSL Regular Expressions
| Wildcard | ||
|---|---|---|
| . | Matches any character | |
| Anchors | ||
| ^ | Matches the beginning of the string. | |
| $ | Matches the end of the string. | |
| Repeats | ||
| * | Matches the preceding atom 0 or more times. | |
| + | Matches the preceding atom 1 or more times. | |
| ? | Matches the preceding atom 0 or 1 times. | |
| {n}
{n,} {n, m} |
Matches the preceding atom n, n or more, or between n and m times. | |
| Sub-expressions | ||
| (expression) | Text enclosed in parentheses is a marked sub-expression. Text matched as part of a sub-expressions is split out and may be repeated. | |
| Alternation | ||
| a | b | Match either a or b. | |
| Character Sets | ||
| [abc] | Matches any one of the enumerated characters. | |
| [a-c] | Matches any character in the specified range. | |
| [^abc] | Matches any character other than the enumerated characters. | |
| [[:name:]] | Matches any character of the named class. | |
| Any of the above character set definitions may be combined. | ||
| Escape Sequences | ||
| Specific Characters | ||
| \e | ASCII 0x1B, ESC | |
| \n | New line | |
| \r | Carriage return | |
| \t | Tab | |
| \xdd | Matches an ASCII character with the code dd | |
| Single character classes | ||
| \d
\D |
Any decimal digit. | \d is equivalent to [[:digit:]] or [0-9]
\D is equivalent to [^[:digit:]] or [^0-9] |
| \l
\L |
Any lower case character. | \l is equivalent to [[:lower:]] or [a-z]
\L is equivalent to [^[:lower:]] or [^a-z] |
| \s
\S |
Any whitespace character. | \s is equivalent to [[:space:]] or [ \t\r\n]
\S is equivalent to [^[:space:]] or [^ \t\r\n] |
| \u
\U |
Any upper case character. | \u is equivalent to [[:upper:]] or [A-Z]
\U is equivalent to [^[:upper:]] or [^A-Z] |
| \w
\W |
Any "word" character.
Alphanumeric plus underscore |
\w is equivalent to [[:upper:][:lower:][:digit:]_] or [A-Za-z0-9_]
\W is equivalent to [^[:upper:][:lower:][:digit:]_] or [^A-Za-z0-9_] |
| Word boundaries | ||
| \< | Start of word. | |
| \> | End of word | |
| \b | ||
| \B | Not a word boundary. | |
| Named Character Classes | ||
| alnum | Any alpha-numeric character. |
|
| alpha | Any alphabetic character. |
|
| blank | Any whitespace character that is not a line separator. | |
| cntrl | Any control character |
|
| digit
d |
Any decimal digit |
|
| lower
l |
Any lower case character. |
|
| Any printable character. | ||
| punct | Any punctiation character. | |
| space
s |
Any whitespace character. | |
| upper
u |
Any upper case character. |
|
| word
w |
Any control character |
|
| xdigit
w |
Any hexadecimal digit character |
|