前K个常用词卡在一个部分

import heapq class Solution: # def topKFrequent(self, words: List[str], k: int) -> List[str]: def topKFrequent(self, words, k): results = [] wordTable = {} for word in words: if (wordTable.get(word) is None): wordTable[word] = 1 continue wordTable[word] = (wordTable.get(word)) + 1 heap = [] # print(wordTable) heapSize = 0 for word in wordTable.keys(): node = [wordTable[word], word] if(heapSize<k): heapq.heappush(heap,node) heapSize += 1 continue if(heapSize>=k): if (heap[0][0]< node[0]): heapq.heappushpop(heap,node) heapSize -= 1 continue if heap[0][0] == node[0] and heap[0][1]>node[1]: heapq.heappop(heap) heapq.heappush(heap,node) heapSize -= 1 continue # heap.sort(key = lambda x: x.freq, reverse=True); print(heap) for i in reversed(range(k)): results.append(heap[i][1]) return results

2条回答

网友

1楼 · 编辑于 2024-06-13 03:42:28

这一行应该对你有帮助

from collections import Counter

topk = lambda words, k: [t[0] for t in Counter(list(sorted(words))).most_common(k)]

print(topk(["i", "love", "leetcode", "i", "love", "coding"], k=2))
print(topk(["the", "day", "is", "sunny", "the", "the", "the", "sunny", "is", "is"], k=4))

# Output
['i', 'love']
['the', 'is', 'sunny', 'day']

第一步是使用 list(sorted(words))
计数器将list转换为频率。它是内置的 heapq
most_common(k)顾名思义，它给了你最大的帮助常用词。但请注意，我们已经对它们进行了排序按词典编纂
最后一个外部for循环只需使用第一个 most_common(k)函数返回的元组的值

网友

2楼 · 编辑于 2024-06-13 03:42:28

使用functools.cmp_to_key

from functools import cmp_to_key

def cmp(a, b):
    if a[0] == b[0]:
         return -1 if a[1] < b[1] else 1
    return -1 if a[0] > b[0] else 1
return sorted(heap, key=cmp_to_key(cmp))

相关问题更多 >

编程相关推荐

热门问题

热门文章