如何在Python中从列表中删除重复的词典？

[ { "ID" : "0001", "Organization" : "SolarUSA", "Matchcode" : "SolarUSA, Something Street, Somewhere State, Whatev Zip", "Owner" : "Timothy Black", }, { "ID" : "0002", "Organization" : "SolarUSA", "Matchcode" : "SolarUSA, Something Street, Somewhere State, Whatev Zip", "Owner" : "Johen Wilheim", }, { "ID" : "0003", "Organization" : "Zapotec", "Matchcode" : "Zapotec, Something Street, Somewhere State, Whatev Zip", "Owner" : "Simeon Yurrigan", } ]

3条回答

网友

1楼 · 编辑于 2024-04-28 05:40:39

使用itertools.groupby()按键值对词典进行分组，然后从每个组中获取第一个项。

import itertools

data =[ {
    "ID" : "0001",
    "Organization" : "SolarUSA",
    "Matchcode" : "SolarUSA, Something Street, Somewhere State, Whatev Zip",
    "Owner" : "Timothy Black",
   }, {
    "ID" : "0002",
    "Organization" : "SolarUSA",
    "Matchcode" : "SolarUSA, Something Street, Somewhere State, Whatev Zip",
    "Owner" : "Johen Wilheim",
   }, {
    "ID" : "0003",
    "Organization" : "Zapotec",
    "Matchcode" : "Zapotec, Something Street, Somewhere State, Whatev Zip",
    "Owner" : "Simeon Yurrigan",
   } ]


print [g.next() for k,g in itertools.groupby(data, lambda x: x['Matchcode'])]

给出结果

[{'Owner': 'Timothy Black',  
  'Organization': 'SolarUSA', 
  'ID': '0001',  
  'Matchcode': 'SolarUSA, Something Street, Somewhere State, Whatev Zip'},

 {'Owner': 'Simeon Yurrigan', 
  'Organization': 'Zapotec', 
  'ID': '0003', 
  'Matchcode':'Zapotec, Something Street, Somewhere State, Whatev Zip'}]

我相信这就是你要找的。

编辑：我更喜欢独特的解决方案。它更短更具描述性。

网友

2楼 · 编辑于 2024-04-28 05:40:39

对于现在已消除歧义的问题，此答案不正确。

所有的听写都有相同的键吗？如果是的话，编写一个函数
```
the_keys = ["foo", "bar"]
def as_values(d):
    return tuple(d[k] for k in the_keys)

unique_values = unique_everseen(list_of_dicts, key=as_values)
```
其中unique_everseen定义于http://docs.python.org/2/library/itertools.html
如果dict不那么一致，请使用更通用的键，例如我发布到https://stackoverflow.com/a/2704866/192839的FrozenDict

网友

3楼 · 编辑于 2024-04-28 05:40:39

正如可以使用tuple来获得与list等价的哈希表一样，也可以使用frozenset来获得与dict等价的哈希表。唯一的技巧是需要将d.items()而不是d传递给构造函数。

>>> d = {'a': 1, 'b': 2}
>>> s = frozenset(d.items())
>>> hash(s)
-7588994739874264648
>>> dict(s) == d
True

然后你可以使用你最喜欢的解决方案，你已经看到。将它们转储到set中，或者如果需要保留顺序，则使用OrderedSet或unique_everseen配方。例如：

>>> unique_sets = set(frozenset(d.items()) for d in list_of_dicts)
>>> unique_dicts = [dict(s) for s in unique_sets]

或者，保持顺序并使用键值：

>>> sets = (frozenset(d.items()) for d in list_of_dicts)
>>> unique_sets = unique_everseen(sets, key=operator.itemgetter(key))
>>> unique_dicts = [dict(s) for s in unique_sets]

当然，如果列表或dict嵌套在其中，则必须递归转换，就像对列表列表的转换一样。

相关问题更多 >

编程相关推荐

热门问题

热门文章