删除列表中符合条件的字典

data = [ { 'id': '16e26a4a9f97fa4f', 'received_on': '2019-11-01 11:05:51', 'customer_group': 'Life-time Buyer' }, { 'id': '16db0dd4a42673e2', 'received_on': '2019-10-09 14:12:29', 'customer_group': 'Lead' }, { 'id': '16db0dd4199f5897', 'received_on': '2019-10-09 14:12:29', 'customer_group': 'Lead' } ]

[ { 'id': '16e26a4a9f97fa4f', 'received_on': '2019-11-01 11:05:51', 'customer_group': 'Life-time Buyer' }, { 'id': '16db0dd4199f5897', 'received_on': '2019-10-09 14:12:29', 'customer_group': 'Lead' } ]

3条回答

网友

1楼 · 编辑于 2024-05-14 22:14:23

有个主意：

import random

data = [
    {
        'id': '16e26a4a9f97fa4f',
        'received_on': '2019-11-01 11:05:51',
        'customer_group': 'Life-time Buyer'
    },
    {
        'id': '16db0dd4a42673e2',
        'received_on': '2019-10-09 14:12:29',
        'customer_group': 'Lead'
    },
    {
        'id': '16db0dd4199f5897',
        'received_on': '2019-10-09 14:12:29',
        'customer_group': 'Lead'
    }
]


r_data = data.copy()
random.shuffle(r_data)
unique_data = {(elem['received_on'],elem['customer_group']):elem['id'] 
                for elem in data}
new_data = [{'id':val, 'received_on':key[0],'customer_group':key[1]} 
                for key,val in unique_data.items()]
new_data = sorted(new_data,key = lambda x:data.index(x)) #if you need sorted
print(new_data)

输出：

[{'id': '16e26a4a9f97fa4f', 'received_on': '2019-11-01 11:05:51', 'customer_group': 'Life-time Buyer'}, {'id': '16db0dd4199f5897', 'received_on': '2019-10-09 14:12:29', 'customer_group': 'Lead'}]

网友

2楼 · 编辑于 2024-05-14 22:14:23

这里有一种获取第一个唯一datetime的方法，如果您想要随机项，您可以像here中那样首先无序排列列表

data = [
    {
        'id': '16e26a4a9f97fa4f',
        'received_on': '2019-11-01 11:05:51',
        'customer_group': 'Life-time Buyer'
    },
    {
        'id': '16db0dd4a42673e2',
        'received_on': '2019-10-09 14:12:29',
        'customer_group': 'Lead'
    },
    {
        'id': '16db0dd4199f5897',
        'received_on': '2019-10-09 14:12:29',
        'customer_group': 'Lead'
    }
]

datetime = set()
result = []
for d in data:
    dt = d['received_on']
    if dt not in datetime:
        result.append(d)
        datetime.add(dt)
result

输出：

[{'id': '16e26a4a9f97fa4f',
  'received_on': '2019-11-01 11:05:51',
  'customer_group': 'Life-time Buyer'},
 {'id': '16db0dd4a42673e2',
  'received_on': '2019-10-09 14:12:29',
  'customer_group': 'Lead'}]

网友

3楼 · 编辑于 2024-05-14 22:14:23

利用上面的一些想法，我还想将customer_group作为received_on之外的另一个条件。我得到了预期的结果。你知道吗

conditions, result = [], []
for d in data:
    condition = (d['received_on'], d['customer_group'])
    if condition not in conditions:
        result.append(d)
        conditions.append(condition)
print(len(result))

相关问题更多 >

编程相关推荐

热门问题

热门文章