NLTK：如何用python获得循环中数组的特定内容？

import nltk from nltk.corpus.reader import TaggedCorpusReader reader = TaggedCorpusReader('cookbook', r'.*\.pos') train_sents=reader.tagged_sents() for sent in train_sents: tags = [tag[1] for (word, tag) in nltk.bigrams(sent) if word[0]=='ny'] #0 is for the word and 1 is for the tag, so tag[0] get you the word and #tag[1] the tag, the same with word[0] and word[1] fd = nltk.FreqDist(tags) fd.tabulate()

import nltk from nltk.corpus.reader import TaggedCorpusReader reader = TaggedCorpusReader('cookbook', r'.*\.pos') train_sents=reader.tagged_sents() for sent in train_sents: #i change the line here tags = [tag[1] for (word, tag) in nltk.bigrams(sent) if tag[1]=='DTDEF'] fd = nltk.FreqDist(tags) fd.tabulate()

import nltk from nltk.corpus.reader import TaggedCorpusReader reader = TaggedCorpusReader('cookbook', r'.*\.pos') train_sents=reader.tagged_sents() tags=[] count=0 for sent in train_sents: for (word,tag) in sent: #if tag is DTDEF i want to get the tag after it if tag=="DTDEF": tags[count]=tag[acutalIndex+1] count+=1 fd = nltk.FreqDist(tags) fd.tabulate()

2条回答

网友
1楼 · 编辑于 2024-05-29 04:37:46

感谢#CrazySqueak的帮助，我使用了他的代码并编辑了一些部分来获得：
import nltk from nltk.corpus.reader import TaggedCorpusReader reader = TaggedCorpusReader('cookbook', r'.*\.pos') train_sents=reader.tagged_sents() tags = [] foundit=False for sent in train_sents: #i change the line here for (word,tag) in nltk.bigrams(sent): if foundit: #If the entry is after 'DTDEF' tags.append(tag[1]) #Add it to the resulting list of tags, i change #tag [1] instead, if you use only tag, it will #store not only the tag but the word as well #of foundit foundit=False #I need to make it false again, cause it will store again even #if the tag is != of DTDEF if tag[1]=='DTDEF': #If the entry is 'DTDEF' foundit=True #Set the 'After DTDEF' flag. fd = nltk.FreqDist(tags) fd.tabulate()
再次感谢你的建议和回答。你知道吗

网友
2楼 · 编辑于 2024-05-29 04:37:46

我不是100%确定我能理解，但是如果您希望在一个特定条目之后获得列表中的所有条目，最简单的方法是：
foundthing=False result = [] for i in list: if foundthing: result.append(i) if i == "Thing I'm Looking For": foundthing = True
将此添加到代码中会导致：
import nltk from nltk.corpus.reader import TaggedCorpusReader reader = TaggedCorpusReader('cookbook', r'.*\.pos') train_sents=reader.tagged_sents() tags = [] foundit=False for sent in train_sents: #i change the line here for (word,tag) in nltk.bigrams(sent): if foundit: #If the entry is after 'DTDEF' tags.append(foundit) #Add it to the resulting list of tags. if tag[1]=='DTDEF': #If the entry is 'DTDEF' foundit=True #Set the 'After DTDEF' flag. fd = nltk.FreqDist(tags) fd.tabulate()
希望这有帮助。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章