Python Regex匹配任何内容

2条回答

网友

1楼 · 编辑于 2024-06-07 11:10:00

你可以试试这个：

import re
s = "Test.docx 04-05-2017.docx 04-04-17.pdf secondtest.pdf"

new_data = re.findall("[a-zA-Z]+\.[a-zA-Z]+|\d{1,}-\d{1,}-\d{4}\.[a-zA-Z]+", s)

输出：

^{pr2}$

网友

2楼 · 编辑于 2024-06-07 11:10:00

首先找到所有匹配的，然后分别从列表中删除它们。firstFindtheMatching方法首先使用re库查找匹配的名称：

def firstFindtheMatching(listoffiles):
    """
    :listoffiles: list is the name of the files to check if they match a format
    :final_string: any file that doesn't match the format 01-01-17.pdf (MM-DD-YY.pdf) is put in one str type output. (ALSO) I'm returning the listoffiles so in that you can see the whole output in one place but you really won't need that. 

    """
    import re
    matchednames = re.findall("\d{1,2}-\d{1,2}-\d{1,2}\.pdf", listoffiles)
    #connect all output in one string for simpler handling using sets
    final_string = ' '.join(matchednames)
    return(final_string, listoffiles)

输出如下：

^{pr2}$

如果你想重新生成结果，我用了下面的主函数。这样做的好处是可以向firstFindtheMatching()添加更多的regex。它能帮助你把事情分开。在

def main():

    filenames= "05-08-17.pdf Test.pdf 04-08-17.pdf 08-09-16.pdf 08-09-2016.pdf some-all-letters.pdf"
    [matchednames , alllist] = firstFindtheMatching(filenames)
    print(matchednames, alllist)
    notcommon = set(filenames.split()) - set(matchednames.split())
    print(notcommon)




if __name__ == '__main__':
    main()

相关问题更多 >

编程相关推荐

热门问题

热门文章

Python Regex匹配任何内容

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >