pyparsing如何跳过缩进块的结尾？

identifier: some description text here which will wrap on to the next line. the follow-on text should be indented. it may contain identifier: and any text at all is allowed next_identifier: more description, short this time last_identifier: blah blah

5311 def checkSubIndent(s,l,t): 5312 curCol = col(l,s) 5313 if curCol > indentStack[-1]: 5314 indentStack.append( curCol ) 5315 else: -> 5316 raise ParseException(s,l,"not a subentry") 5317 ipdb> indentStack [1] ipdb> curCol 1

1条回答

网友

1楼 · 发布于 2024-04-24 08:04:18

使用indentedBlock时，传入的参数是块中每一行的表达式，因此它不应该是indentedBlock(ZeroOrMore(line_expression), stack)，而应该是indentedBlock(line_expression, stack)。Pyparsing包含一个用于“从这里到行尾的所有内容”的内置表达式，名为restOfLine，因此我们只将其用于缩进块中每一行的表达式：

import pyparsing as pp

NL = pp.LineEnd().suppress()

label = pp.ungroup(pp.Word(pp.alphas, pp.alphanums+'_') + pp.Suppress(":"))

indent_stack = [1]
# see corrected version below
#description = pp.Group((pp.Empty() 
#                    + pp.restOfLine + NL
#                    + pp.ungroup(pp.indentedBlock(pp.restOfLine, indent_stack))))

description = pp.Group(pp.restOfLine + NL
                       + pp.Optional(pp.ungroup(~pp.StringEnd() 
                                                + pp.indentedBlock(pp.restOfLine, 
                                                                   indent_stack))))

labeled_text = pp.Group(label("label") + pp.Empty() + description("description"))

我们使用ungroup删除由indentedBlock创建的额外级别的嵌套，但是我们还需要删除在indentedBlock中内部创建的每行嵌套。我们通过一个解析操作来执行此操作：

^{pr2}$

在这一点上，我们已经差不多完成了。以下是解析并转储的示例文本：

parsed_data = (pp.OneOrMore(labeled_text)).parseString(sample)    
print(parsed_data[0].dump())

['identifier', ['some description text here which will wrap', 'on to the next line. the follow-on text should be', 'indented. it may contain identifier: and any text', 'at all is allowed']]
- description: ['some description text here which will wrap', 'on to the next line. the follow-on text should be', 'indented. it may contain identifier: and any text', 'at all is allowed']
- label: 'identifier'

或使用此代码拉出“标签”和“说明”字段：

for item in parsed_data:
    print(item.label)
    print('..' + '\n..'.join(item.description))
    print()

identifier
..some description text here which will wrap
..on to the next line. the follow-on text should be
..indented. it may contain identifier: and any text
..at all is allowed

next_identifier
..more description, short this time

last_identifier
..blah blah

相关问题更多 >

编程相关推荐

热门问题

热门文章