统计列表中单词的频率并按频率排序

original list = ["the", "car",....] newlst = [] frequency = [] for word in the original list if word not in newlst: newlst.append(word) set frequency = 1 else increase the frequency sort newlst based on frequency list

3条回答

网友

1楼 · 编辑于 2024-05-13 18:59:26

words = file("test.txt", "r").read().split() #read the words into a list.
uniqWords = sorted(set(words)) #remove duplicate words and sort
for word in uniqWords:
    print words.count(word), word

网友

2楼 · 编辑于 2024-05-13 18:59:26

你可以用

from collections import Counter

它支持Python 2.7，阅读更多信息here

一。

>>>c = Counter('abracadabra')
>>>c.most_common(3)
[('a', 5), ('r', 2), ('b', 2)]

使用dict

>>>d={1:'one', 2:'one', 3:'two'}
>>>c = Counter(d.values())
[('one', 2), ('two', 1)]

但是，你必须先读取文件，然后转换成dict

2。这是python文档示例，使用re和Counter

# Find the ten most common words in Hamlet
>>> import re
>>> words = re.findall(r'\w+', open('hamlet.txt').read().lower())
>>> Counter(words).most_common(10)
[('the', 1143), ('and', 966), ('to', 762), ('of', 669), ('i', 631),
 ('you', 554),  ('a', 546), ('my', 514), ('hamlet', 471), ('in', 451)]

网友

3楼 · 编辑于 2024-05-13 18:59:26

用这个

from collections import Counter
list1=['apple','egg','apple','banana','egg','apple']
counts = Counter(list1)
print(counts)
# Counter({'apple': 3, 'egg': 2, 'banana': 1})

相关问题更多 >

编程相关推荐

热门问题

热门文章

统计列表中单词的频率并按频率排序

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >