首页 > 其他分享> 文章详细

正则表达式

2019-02-07 13:40:30 阅读：262 来源： 互联网

标签：search group 正则表达式 matches baz re match

1. Rules

1.1. Classical Regex

L(ϵ) = L("") = {""}

If c is a character, L(c) = {"c"}

If R₁, R₂ are r.e.s, L(R₁R₂) = {x₁x₂|x₁∈L(R₁), x₂∈L(R₂)}

L(R₁|R₂) = L(R₁)∪L(R₂)

L(R∗) = L(ϵ)∪L(R)∪L(RR)∪· · ·

L((R)) = L(R)

Precedence: *, concatenation, union

Grouping: parenthesises

1.2. Abbreviations

character lists: [a-zA-Z]

negative character lists: [^a-z]

character classes: .(dot), \d, \s

L(R+) = L(RR∗)

L(R?) = L(ϵ |R)

1.3. Extensions

“capture” parenthesizes expressions

m = re.match(r’\s*(\d+)\s*,\s*(\d+)\s*’, ’12,34’), have m.group(1) == ’12’, m.group(2) == ’34’

m.group(x) means the xth pair of parenthesis

lazy vs. greedy quantifiers

re.match(r’(\d+).*’, ’1234ab’) makes group(1) match ’1234’

re.match(r’(\d+?).*’, ’1234ab’) makes group(1) match ’1’

boundaries

re.search(r’(^abc|qef)’, L) matches abc only at beginning of string, and qef anywhere

re.search(r’(?m)(^abc|qef)’, L) matches abc only at beginning of string or of any line

(?m) enables regex to read multiline data

re.search(r’rowr(?=baz)’, L) matches an instance of ‘rowr’, but only if ‘baz’ follows (does not match baz)

(?=baz) means the pattern is followed by baz

re.search(r’(?<=rowr)baz’, L) matches an instance of ‘baz’, but only if immediately preceded by ‘rowr’ (does not match rowr)

(?<=rowr) means the pattern’s precedence is followed by rowr

non-linear patterns

re.search(r’(\S+),\1’, L) matches a word followed by the same word after a comma

标签：search,group,正则表达式,matches,baz,re,match
来源： https://www.cnblogs.com/truelycloud/p/10354715.html

本站声明： 1. iCode9 技术分享网（下文简称本站）提供的所有内容，仅供技术学习、探讨和分享；
2. 关于本站的所有留言、评论、转载及引用，纯属内容发起人的个人观点，与本站观点和立场无关；
3. 关于本站的所有言论和文字，纯属内容发起人的个人观点，与本站观点和立场无关；
4. 本站文章均是网友提供，不完全保证技术分享内容的完整性、准确性、时效性、风险性和版权归属；如您发现该文章侵犯了您的权益，可联系我们第一时间进行删除；
5. 本站为非盈利性的个人网站，所有内容不会用来进行牟利，也不会利用任何形式的广告来间接获益，纯粹是为了广大技术爱好者提供技术内容和技术思想的分享性交流网站。

ICode9

正则表达式