给定特定条件时，对嵌套字典列表中的值求和

cars = [{'id': 2, 'car': {'car_make':'Acura', 'car_model':'TL', 'car_year':2005},'total_sales':589}, {'id': 30, 'car': {'car_make':'Acura', 'car_model':'TL', 'car_year':2004}, 'total_sales':167}, {'id': 31, 'car': {'car_make':'Acura', 'car_model':'Integra', 'car_year':2008}, 'total_sales':200}, {'id': 71, 'car': {'car_make':'BMW', 'car_model':'5 Series', 'car_year':2011},'total_sales':824}, {'id': 72, 'car': {'car_make':'BMW', 'car_model':'5 Series', 'car_year':2001}, 'total_sales':6}]

total_sales = {{'car_make': 'Acura', 'car_model': 'TL', 'total_sales': 756}, {'car_make': 'Acura', 'car_model': 'Integra', 'total_sales': 200}, {'car_make': 'BMW', 'car_model': '5 Series', 'total_sales': 830}}

total_sales = {'car_make':{}} for car in cars: if car['car']['car_make'] in total_sales['car_make'] and car['car']['car_model'] in total_sales['car_model']: total_sales['total_sales'] = total_sales['total_sales'] + car['total_sales'] else: total_sales['car_make'] = car['car']['car_make'] total_sales['car_model'] = car['car']['car_model'] total_sales['total_sales'] = car['total_sales'] print(total_sales)

{'car_make': 'Acura', 'car_model': 'TL', 'total_sales': 589} {'car_make': 'Acura', 'car_model': 'TL', 'total_sales': 756} {'car_make': 'Acura', 'car_model': 'Integra', 'total_sales': 200} {'car_make': 'BMW', 'car_model': '5 Series', 'total_sales': 824} {'car_make': 'BMW', 'car_model': '5 Series', 'total_sales': 830}

3条回答

网友

1楼 · 编辑于 2024-06-16 08:32:10

您可以使用collections.defaultdict：

cars = [{'id': 2, 'car': {'car_make': 'Acura', 'car_model': 'TL', 'car_year': 2005}, 'total_sales': 589}, {'id': 30, 'car': {'car_make': 'Acura', 'car_model': 'TL', 'car_year': 2004}, 'total_sales': 167}, {'id': 31, 'car': {'car_make': 'Acura', 'car_model': 'Integra', 'car_year': 2008}, 'total_sales': 200}, {'id': 71, 'car': {'car_make': 'BMW', 'car_model': '5 Series', 'car_year': 2011}, 'total_sales': 824}, {'id': 72, 'car': {'car_make': 'BMW', 'car_model': '5 Series', 'car_year': 2001}, 'total_sales': 6}]
from collections import defaultdict
d = defaultdict(int)
for i in cars:
   d[(i['car']['car_make'], i['car']['car_model'])] += i['total_sales']

r = [{'car_make':a, 'car_model':b, 'total_sales':c} for (a, b), c in d.items()]

输出：

[{'car_make': 'Acura', 'car_model': 'TL', 'total_sales': 756}, 
 {'car_make': 'Acura', 'car_model': 'Integra', 'total_sales': 200}, 
 {'car_make': 'BMW', 'car_model': '5 Series', 'total_sales': 830}]

网友

2楼 · 编辑于 2024-06-16 08:32:10

根据你的字典变得有多疯狂，以及你想获得这种类型的统计信息的粒度，你可能想研究重构你的结构，以使用pandas，这为数据操作、切片打开了更多的大门，然而对于初学者来说，它确实有一点学习曲线，所以我将避免通常的建议“嘿，使用这个库”。如果您感兴趣，可以阅读一些文档，https://pandas.pydata.org/docs/

要直接回答您的问题，您可能需要重新构建total sales字典。您可以使用一个“total_sales_by_make_and_model”字典，然后使用cars列表中列出的make键作为该字典的键--这会使它更清晰一些，但是我不确定这与您的整体“业务”逻辑有什么关系

我还建议将代码扩展一点，以将键名存储在变量中，至少在看到求和出错的地方之前是这样

cars = [
    {'id': 2, 'car': {'car_make':'Acura', 'car_model':'TL', 'car_year':2005},'total_sales':589},
    {'id': 30, 'car': {'car_make':'Acura', 'car_model':'TL', 'car_year':2004}, 'total_sales':167},
    {'id': 31, 'car': {'car_make':'Acura', 'car_model':'Integra', 'car_year':2008}, 'total_sales':200},
    {'id': 71, 'car': {'car_make':'BMW', 'car_model':'5 Series', 'car_year':2011},'total_sales':824},
    {'id': 72, 'car': {'car_make':'BMW', 'car_model':'5 Series', 'car_year':2001}, 'total_sales':6}
]

total_sales_by_make_and_model = {}

for car in cars:
    car_make = car['car']['car_make']
    car_model = car['car']['car_model']
    total = car['total_sales']

    # if the make key doesn't exist, create it
    if car_make not in total_sales_by_make_and_model:
        total_sales_by_make_and_model[car_make] = {}

    # use get to set the value to "total" if the key doesn't exist
    # otherwise this will increment the current value by "total"
    total_sales_by_make_and_model[car_make][car_model] = total_sales_by_make_and_model[car_make].get(car_model, 0) + total

此结果将使您能够灵活地在make键中按模型进行查询。例如：

print(total_sales_by_make_and_model['Acura'])
print(total_sales_by_make_and_model['BMW']['5 Series'])

输出：

{'TL': 756, 'Integra': 200}
830

如果您想获得每个品牌的总销售额，可以使用一个简单的循环遍历每个键并对值求和：

for make in total_sales_by_make_and_model:
    total = sum(total_sales_by_make_and_model[make].values())
    print(f"Total sales for '{make}': {total}")

输出：

Total sales for 'Acura': 956
Total sales for 'BMW': 830

编辑：必须修复几个给出错误金额的主要打字错误：）

网友

3楼 · 编辑于 2024-06-16 08:32:10

以下是一种方法：

models=set([(i['car']['car_make'], i['car']['car_model']) for i in cars])

d={i:0 for i in models}

for i in cars:
    d[(i['car']['car_make'], i['car']['car_model'])]+=i['total_sales']
    
total_sales=[{'car_make':key[0], 'car_model':key[1], 'total_sales':val} for key, val in d.items()]

print(total_sales)

输出

[{'car_make': 'BMW', 'car_model': '5 Series', 'total_sales': 830}, {'car_make': 'Acura', 'car_model': 'TL', 'total_sales': 756}, {'car_make': 'Acura', 'car_model': 'Integra', 'total_sales': 200}]

相关问题更多 >

编程相关推荐

热门问题

热门文章