使用Python中的正则表达式区分2个字符串

2条回答

网友

1楼 · 编辑于 2024-05-16 01:15:52

我想你真正想要的是这样的东西：

regions_to_files = defaultdict(list)
for x in filenames:
    matches = re.match(r'(?P<region>.*)_(?P<year>200[0-9]|201[0-7])_test.csv', x)
    region = matches.group('region')
    regions_to_files[region].append(x)

现在，所有与mato_grosso相关的文件都将在regions_to_files['mato_grosso']可用，而所有与mato_grosso_do_sul相关的文件都将在regions_to_files['mato_grosso_do_sul']可用

匹配第一个文件名：

# mato_grosso_2000_test.csv
re.match(r'mato_grosso_20(0[0-9]|1[0-7])_test.csv', filename)

匹配第二个文件名，但不匹配第一个文件名：

# mato_grosso_do_sul_2000_test.csv
re.match(r'mato_grosso_do_sul_20(0[0-9]|1[0-7])_test.csv', filename)

正则表达式(0[0-9]|1[0-7])将匹配00，01。你17岁。你知道吗

网友

2楼 · 编辑于 2024-05-16 01:15:52

您可以使用带有否定前瞻断言的正则表达式来查找“mato\u grosso”后面不跟“do\u sul”的匹配项。例如：

re.match('mato_grosso_(?!do_sul)', 'mato_grosso_2000_test.csv')

re.match('mato_grosso_(?!do_sul)', 'mato_grosso_do_sul_2000_test.csv')

这会为第一个示例找到匹配项，但不会为第二个示例找到匹配项。你知道吗

Pythonre module文档详细讨论了正则表达式语法。做一个“消极的前瞻性”寻找更多的细节。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用Python中的正则表达式区分2个字符串

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >