在Python中分配和测试Regex？

while (read(line)) { if (m=matchregex(regex1,line)) { /* munch on the components extracted in regex1 by accessing m */ } else if (m=matchregex(regex2,line)) { /* munch on the components extracted in regex2 by accessing m */ } else if ... ... else { error("Unrecognized line format"); } }

for line in open(file,'r').read().splitlines(): if imps(regex1,line): # munch on contents of img elsif imps(regex2,line): # munch on contents of img else: error('Unrecognised line: {}'.format(line))

2条回答

网友

1楼 · 编辑于 2024-06-16 11:04:34

取决于代码的需要。你知道吗

我常用的一种选择是这样的：

# note, order is important here. The first one to match will exit the processing
parse_regexps = [
    (r"^foo", handle_foo),
    (r"^bar", handle_bar),
]

for regexp, handler in parse_regexps:
    m = regexp.match(line)
    if m:
        handler(line)  # possibly other data too like m.groups
        break
else:
    error("Unrecognized format....")

这样做的好处是将处理代码移动到清晰和明显的函数中，从而使测试和更改变得容易。你知道吗

网友

2楼 · 编辑于 2024-06-16 11:04:34

您可以使用continue：

for line in file:
    m = re.match(re1, line)
    if m:
       do stuff
       continue

    m = re.match(re2, line)
    if m:
       do stuff
       continue

    raise BadLine

另一个不太明显的选择是使用如下函数：

def match_any(subject, *regexes):
    for n, regex in enumerate(regexes):
        m = re.match(regex, subject)
        if m:
           return n, m
    return -1, None

然后：

for line in file:
    n, m = match_any(line, re1, re2)
    if n == 0:
       ....
    elif n == 1:
       ....
    else:
       raise BadLine

相关问题更多 >

编程相关推荐

热门问题

热门文章