Python：如何从文件中提取字符串

-#- --length-- -----date/time------ path 3 97103164 Feb 7 2016 01:36:16 +05:30 taas/NN41_R11_Golden_Image 4 1896 Sep 27 2019 14:22:08 +05:30 taas/NN41_R11_Golden_Config 5 1876 Nov 27 2017 20:07:50 +05:30 taas/nfast_default.cfg

3条回答

网友

1楼 · 编辑于 2024-05-23 15:17:04

如果路径可以包含空格，那么这实际上是一个令人惊讶的复杂过程。
您需要使用maxsplit参数来split标识路径字段。你知道吗

with open("outlog.out") as f:
    for line in f:
         field = line.split(None,7)
         if "Golden_Image" in field:
            print(field)

网友

2楼 · 编辑于 2024-05-23 15:17:04

一次阅读全部内容，然后再进行搜索是没有效率的。相反，可以逐行读取文件，如果行符合条件，则可以提取路径，而无需进一步拆分和使用RegEx。你知道吗

使用以下正则表达式获取路径

\s+(?=\S*$).*

链接：https://regex101.com/r/zuH0Zv/1

如果工作代码为：

import re
data = "3     97103164 Feb 7 2016 01:36:16 +05:30 taas/NN41_R11_Golden_Image"
regex = r"\s+(?=\S*$).*"
test_str = "3     97103164 Feb 7 2016 01:36:16 +05:30 taas/NN41_R11_Golden_Image"
matches = re.search(regex, test_str)
print(matches.group().strip())

网友

3楼 · 编辑于 2024-05-23 15:17:04

在线上进行拆分，并检查拆分部分中是否存在“Golden\u Image”字符串。你知道吗

import re 
with open("outlog.out") as f:
    for line in f:
         if not "Golden_Image" in i:
             continue
         print re.search(r'\S*Golden_Image\S*', line).group()

或者

images = re.findall(r'\S*Golden_Image\S*', open("outlog.out").read())

示例：

>>> s = '''
-#- --length-- -----date/time------ path

 3     97103164 Feb 7 2016 01:36:16 +05:30 taas/NN41_R11_Golden_Image
 4         1896 Sep 27 2019 14:22:08 +05:30 taas/NN41_R11_Golden_Config
 5         1876 Nov 27 2017 20:07:50 +05:30 taas/nfast_default.cfg'''.splitlines()
>>> for line in s:
    for i in line.split():
        if  "Golden_Image" in i:
            print i


taas/NN41_R11_Golden_Image
>>>

相关问题更多 >

编程相关推荐

热门问题

热门文章