NLTK分块和遍历结果

from nltk.chunk import RegexpParser grammar = ''' NP: {<DT>?<JJ>*<NN>*} V: {<V.*>}''' chunker = RegexpParser(grammar) token = [] ## Some tokens from my POS tagger chunked = chunker.parse(tokens) print chunked #How do I walk the tree? #for chunk in chunked: # if chunk.??? == 'NP': # print chunk

3条回答

网友

1楼 · 编辑于 2024-05-23 17:39:43

萨维诺的回答很好，但也值得注意的是，子树也可以通过索引访问，例如

for n in range(len(chunked)):
    do_something_with_subtree(chunked[n])

网友

2楼 · 编辑于 2024-05-23 17:39:43

这应该有效：

for n in chunked:
    if isinstance(n, nltk.tree.Tree):               
        if n.label() == 'NP':
            do_something_with_subtree(n)
        else:
            do_something_with_leaf(n)

网友

3楼 · 编辑于 2024-05-23 17:39:43

token中的小错误

from nltk.chunk import RegexpParser
grammar = '''
NP: {<DT>?<JJ>*<NN>*}
V: {<V.*>}'''
chunker = RegexpParser(grammar)
token = [] ## Some tokens from my POS tagger
//chunked = chunker.parse(tokens) // token defined in the previous line but used tokens in chunker.parse(tokens)
chunked = chunker.parse(token) // Change in this line
print chunked

相关问题更多 >

编程相关推荐

热门问题

热门文章

NLTK分块和遍历结果

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >