基于两个属性对字典列表进行深度筛选

entry1 = {'location': 'LOC1', 'pallet-id': '123456', ...} entry2 = {'location': 'LOC1', 'pallet-id': '123456', ...} entry3 = {'location': 'LOC1', 'pallet-id': '123456', ...} entry4 = {'location': 'LOC1', 'pallet-id': '123456', ...}

entry1 = {'location': 'LOC1', 'pallet-id': '5555', ...} entry2 = {'location': 'LOC1', 'pallet-id': '5555', ...} entry3 = {'location': 'LOC2', 'pallet-id': '5555', ...} entry4 = {'location': 'LOC1', 'pallet-id': '5555', ...}

1条回答

网友

1楼 · 发布于 2024-05-23 17:48:06

嵌套循环在这里是个坏主意，因为它会导致二次时间复杂性。不过，您可以在线性时间内完成：

from collections import Counter
from operator import itemgetter

pal = itemgetter('pallet-id')
pal_loc = itemgetter('pallet-id', 'location')

# unique pallet-id, location combos
pallocs = set(map(pal_loc, entries))
# set([('5555', 'LOC1'), ('5555', 'LOC2'), ('123456', 'LOC1')])

# count pallet-id occurrences in the unique combos
count = Counter(pl[0] for pl in pallocs) 
# Counter({'5555': 2, '123456': 1})

# filter the entries for pallet-ids with counts greater than 1
filtered_entries = [e for e in entries if count[pal(e)] > 1]

相关问题更多 >

编程相关推荐

热门问题

热门文章