从.npy fi生成pandas数据帧

import pandas as pd import numpy as np import os targetdir = '../test_dir/' filenames = [] successful = [] unsuccessful = [] for dirs, subdirs, files in os.walk(targetdir): for name in files: filenames.append(name) path_to_use = os.path.join(dirs, name) if path_to_use.endswith('.npy'): try: file_dict = np.load(path_to_use).item() df = pd.from_dict(file_dict) #df = pd.read_pickle(path_to_use) successful.append(path_to_use) except: unsuccessful.append(path_to_use) continue print str(len(successful)) + " files were loaded successfully!" print "The following files were not loaded:" for item in unsuccessful: print item + "\n" print df

1条回答

网友

1楼 · 发布于 2024-06-12 17:49:37

假设加载.npy后，项（np.load(path_to_use).item()）看起来与此类似

{'user_c': 'id_003', 'user_a': 'id_001', 'user_b': 'id_002'}

所以，如果你需要使用上面的字典来产生如下的数据帧

  user_name user_id
0    user_c  id_003
1    user_a  id_001
2    user_b  id_002

你可以使用

df = pd.DataFrame(list(x.item().iteritems()), columns=['user_name','user_id'])

如果你有下面这样的字典列表

users = [{'u_name': 'user_a', 'u_id': 'id_001'}, {'u_name': 'user_b', 'u_id': 'id_002'}]

你可以简单地使用

df = pd.DataFrame(users)

提出一个类似的数据帧

     u_id  u_name
0  id_001  user_a
1  id_002  user_b

好像你有一本类似的字典

data = {
    'Center': [[0.1, 0.2, 0.3], [0.4, 0.5, 0.6]],
    'Vpeak': [1.1, 2.2],
    'ID': ['id_001', 'id_002']
}

在这种情况下，您可以简单地使用

df = pd.DataFrame(data)  # df = pd.DataFrame(file_dict.item()) in your case

提出一个类似的数据帧

    Center          ID      Vpeak
0   [0.1, 0.2, 0.3] id_001  1.1
1   [0.4, 0.5, 0.6] id_002  2.2

如果dict中有ndarray，请执行与下面类似的预处理；并使用它创建df

for key in data:
    if isinstance(data[key], np.ndarray):
        data[key] = data[key].tolist()

df = pd.DataFrame(data)

相关问题更多 >

编程相关推荐

热门问题

热门文章