如何从一个文件创建twoleveldictionary？

3条回答

网友

1楼 · 编辑于 2024-06-16 14:13:10

一种方法是使用pandas，如果需要使用表格数据，这是一个好主意：

>>> import pandas as pd
>>> df = pd.read_csv('path/to/your.csv', delimiter=';', index_col='country')
>>> df.to_dict()
{'company1': {'finland': 30, 'sweden': 20, 'norway': 10},
 'company2': {'finland': 30, 'sweden': 30, 'norway': 20},
 'company3': {'finland': 40, 'sweden': 50, 'norway': 70}}

网友

2楼 · 编辑于 2024-06-16 14:13:10

@fsimonjetz的answer非常棒，如果你已经在这个项目中与熊猫合作。如果您没有这样做，那么仅将其用于此任务是一种巨大的过度使用，因为我们可以用简单的逻辑解析和转换数据

import csv

from collections import defaultdict

output = defaultdict(dict)

with open('path/to/your.csv') as f:
    reader = csv.DictReader(f, delimiter=';')
    companies = reader.fieldnames[1:]
    for line in reader:
        country = line['country']
        for company in companies:
            output[company][country] = line[company]
            # or output.setdefault(company, {})[country] = line[company]
            # if you want 'output' to be a "normal" dict instead of defaultdict

print(dict(output))  # or just print(output) if you don't mind seeing OrderedDict
                     # repr

输出

{'company1': {'finland': '30', 'sweden': '20', 'norway': '10'}, 
 'company2': {'finland': '30', 'sweden': '30', 'norway': '20'}, 
 'company3': {'finland': '40', 'sweden': '50', 'norway': '70'}}

网友

3楼 · 编辑于 2024-06-16 14:13:10

我认为使用OrderedICT会有很大帮助。你可以用类似的方式来做：

import csv
from collections import OrderedDict

with open('file.csv') as f:
    reader = csv.reader(f, delimiter=';')
    list_companies = next(reader)  # ['country', 'company1', 'company2', ...]
    companies_dict = OrderedDict()
    for company in list_companies[1:]:  # We forget about 'country'
        companies_dict[company] = {}  # We initialize the companies' dicts in order
    for country_values in reader:  # For every line after the first one
        country = country_values[0]  # We get the country at the beginning of every line
        for countries_dict, value in zip(companies_dict.values(), country_values[1:]):
            countries_dict[country] = value  # And set the value for every company in order

    print(dict(companies_dict))
    # {'company1': {'finland': '30', 'sweden': '20', 'norway': '10'}, ...}

zip函数对您来说可能是新函数，它是一个生成器，基本上接受两个（或更多）iterable，并将相同位置的元素作为一个集合放在一起。例如，zip(['finland', 'sweden' , 'england'], [30, 30, 40]) == [('finland', 30), ('sweden', 30), ('england', 40)]

这可能并不完全符合你的目的，但我相信这是实现你想要的目标的一个足够好的方法

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何从一个文件创建twoleveldictionary？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >