标签:search group 正则表达式 matches baz re match
1. Rules
1.1. Classical Regex
L(ϵ) = L("") = {""}
If c is a character, L(c) = {"c"}
If R1, R2 are r.e.s, L(R1R2) = {x1x2|x1∈L(R1), x2∈L(R2)}
L(R1|R2) = L(R1)∪L(R2)
L(R∗) = L(ϵ)∪L(R)∪L(RR)∪· · ·
L((R)) = L(R)
Precedence: *, concatenation, union
Grouping: parenthesises
1.2. Abbreviations
character lists: [a-zA-Z]
negative character lists: [^a-z]
character classes: .(dot), \d, \s
L(R+) = L(RR∗)
L(R?) = L(ϵ |R)
1.3. Extensions
“capture” parenthesizes expressions
m = re.match(r’\s*(\d+)\s*,\s*(\d+)\s*’, ’12,34’), have m.group(1) == ’12’, m.group(2) == ’34’
m.group(x) means the xth pair of parenthesis
lazy vs. greedy quantifiers
re.match(r’(\d+).*’, ’1234ab’) makes group(1) match ’1234’
re.match(r’(\d+?).*’, ’1234ab’) makes group(1) match ’1’
boundaries
re.search(r’(^abc|qef)’, L) matches abc only at beginning of string, and qef anywhere
re.search(r’(?m)(^abc|qef)’, L) matches abc only at beginning of string or of any line
(?m) enables regex to read multiline data
re.search(r’rowr(?=baz)’, L) matches an instance of ‘rowr’, but only if ‘baz’ follows (does not match baz)
(?=baz) means the pattern is followed by baz
re.search(r’(?<=rowr)baz’, L) matches an instance of ‘baz’, but only if immediately preceded by ‘rowr’ (does not match rowr)
(?<=rowr) means the pattern’s precedence is followed by rowr
non-linear patterns
re.search(r’(\S+),\1’, L) matches a word followed by the same word after a comma
标签:search,group,正则表达式,matches,baz,re,match 来源: https://www.cnblogs.com/truelycloud/p/10354715.html
本站声明: 1. iCode9 技术分享网(下文简称本站)提供的所有内容,仅供技术学习、探讨和分享; 2. 关于本站的所有留言、评论、转载及引用,纯属内容发起人的个人观点,与本站观点和立场无关; 3. 关于本站的所有言论和文字,纯属内容发起人的个人观点,与本站观点和立场无关; 4. 本站文章均是网友提供,不完全保证技术分享内容的完整性、准确性、时效性、风险性和版权归属;如您发现该文章侵犯了您的权益,可联系我们第一时间进行删除; 5. 本站为非盈利性的个人网站,所有内容不会用来进行牟利,也不会利用任何形式的广告来间接获益,纯粹是为了广大技术爱好者提供技术内容和技术思想的分享性交流网站。