在文本文件中，如何使用python解析特定模式中的多行？

dummy01234567890 0987654321dummy -------start-------(It is possible to modify) text line1 text line2 -------end---------(It is possible to modify) 12345678910 qwertyuiop -------start-------(It is possible to modify) text line3 text line4 -------end---------(It is possible to modify) ;p12309809128309123 dummyline1235567

2条回答

网友

1楼 · 编辑于 2024-04-27 00:36:26

您可以这样做以获得所需的结果：

text = """dummy01234567890
    0987654321dummy 
       -start   -(It is possible to modify)
    text line1
    text line2
       -end    -(It is possible to modify)
    12345678910
    qwertyuiop        
       -start   -(It is possible to modify)
    text line3
    text line4
       -end    -(It is possible to modify)
    ;p12309809128309123
    dummyline1235567"""

text_list = text.splitlines()
print(['\n'.join([text_list[3+i*6].strip(), text_list[4+i*6].strip()]) for i in xrange(len(text_list)/6)])

这将导致：

['text line1\ntext line2', 'text line3\ntext line4']

网友

2楼 · 编辑于 2024-04-27 00:36:26

Finite-state machine是自适应的，对于大多数需求来说足够简单。你知道吗

state = 'init'
arrays = []
with open('textfile.txt') as f:
    lines = []
    for line in f.readlines():
        if state == 'init':  # seek for start
             word = line.strip().strip('-')
             if word != 'start':
                 continue
             state = 'start'
             lines = []
        elif state == 'start':  # start parsing now
             word = line.strip().strip('-')
             if word != 'end':
                 lines.append(line.strip())
                 continue
             # end current parsing now
             arrays.append('\n'.join(lines))
             state = 'init'

相关问题更多 >

编程相关推荐

热门问题

热门文章

在文本文件中，如何使用python解析特定模式中的多行？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >