如何只恢复文本文件中字符串的第二个实例？

with open(wave_cat,'r') as catID: for i, cat_line in enumerate(catID): if not len(cat_line.strip()) == 0: line = cat_line.split() #replen = re.sub('length:','length0:','length:') if line[0] == '#' and line[1] == 'event': num = long(line[2]) elif line[0] == 'length:': Length = float(line[2])

3条回答

网友

1楼 · 编辑于 2024-04-26 07:16:32

你在正确的轨道上。除非你真的需要，否则推迟拆分可能会快一点。另外，如果你扫描了很多文件，只需要第二个长度条目，那么一旦你看到它，就可以节省很多时间来跳出循环。你知道吗

length_seen = 0
elements = []
with open(wave_cat,'r') as catID:
    for line in catID:
        line = line.strip()
        if not line:
            continue
        if line.startswith('# event'):
            element = {'num': int(line.split()[2])}
            elements.append(element)
            length_seen = 0
        elif line.startswith('length:'):
            length_seen += 1
            if length_seen == 2:
                element['length'] = float(line.split()[2])

网友

2楼 · 编辑于 2024-04-26 07:16:32

使用计数器：

with open(wave_cat,'r') as catID:
    ct = 0
    for i, cat_line in enumerate(catID):
        if not len(cat_line.strip()) == 0:
            line    = cat_line.split()
            #replen = re.sub('length:','length0:','length:')
            if line[0] == '#' and line[1] == 'event':
                num = long(line[2])
            elif line[0] == 'length:':
                ct += 1
                if ct == 2:
                    Length = float(line[2])
                    ct = 0

网友

3楼 · 编辑于 2024-04-26 07:16:32

如果可以将整个文件读入内存，只需执行regex against the file contents：

for fn in [list of your files, maybe from a glob]:
    with open(fn) as f:
        try:
            nm=pat.findall(f.read())[1]
        except IndexError:
            nm=''
        print nm

如果文件较大，请使用mmap：

import re, mmap

nth=1
pat=re.compile(r'^# event.*?^length:.*?^length:\s[\d.]+\s(\d+\.\d+)', re.S | re.M)
for fn in [list of your files, maybe from a glob]:
    with open(fn, 'r+b') as f:
        mm = mmap.mmap(f.fileno(), 0)
        for i, m in enumerate(pat.finditer(mm)):
            if i==nth:
                print m.group(1)
                break

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何只恢复文本文件中字符串的第二个实例？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >