在python中解析json数据时如何解析多个索引值并创建csv文件

"EmployeeId","type","KeyColumn","Start","End","Country","Target","CountryId","TargetId" "Emp1","Metal","1212121212","2000-06-17","9999-12-31","","AMAZON","1","" "Emp1","Metal","1212121212","2000-06-17","9999-12-31","","FLIPKART","2",""

for i in range(len(json_file['enty'])): temp = {} temp['EmployeeId'] = json_file['enty'][i]['id'] temp['type'] = json_file['enty'][i]['type'] for key in json_file['enty'][i]['data']['attributes'].keys(): try: temp[key] = json_file['enty'][i]['data']['attributes'][key]['values'][0]['value'] except: temp[key] = None for key in json_file['enty'][i]['data']['attributes'].keys(): if(key == 'Employee'): for j in range(len(json_file['enty'][i]['data']['attributes']['Employee']['group'])): for key in json_file['enty'][i]['data']['attributes']['Employee']['group'][j].keys(): try: temp[key] = json_file['enty'][i]['data']['attributes']['Employee']['group'][j][key]['values'][0]['value'] except: temp[key] = None temp_df = pd.DataFrame([temp]) df = pd.concat([df, temp_df], sort=True) # Rearranging columns df = df[['EmployeeId', 'type'] + [col for col in df.columns if col not in ['EmployeeId', 'type']]] # Writing the dataset df[columns_list].to_csv("Test22.csv", index=False, quotechar='"', quoting=1)

1条回答

网友

1楼 · 发布于 2024-04-25 14:48:52

修改代码、添加一个循环、更改索引以及修改range参数的长期解决方案：

df = pd.DataFrame()

num = max([len(v) for k,v in json_file['data'][0]['data1'].items()])
for i in range(num):
    temp = {}
    temp['Empid'] = json_file['data'][0]['Empid']
    temp['Empname'] = json_file['data'][0]['Empname']
    for key in json_file['data'][0]['data1'].keys():
        if key not in temp:
            temp[key] = []
        try:
            for j in range(len(json_file['data'][0]['data1'][key])):
                temp[key].append(json_file['data'][0]['data1'][key][j]['relative']['id']) 
        except:
            temp[key] = None                    
    temp_df = pd.DataFrame([temp])
    df = pd.concat([df, temp_df],ignore_index=True)
for i in json_file['data'][0]['data1'].keys():
    df[i] = pd.Series([x for y in df[i].tolist() for x in y]).drop_duplicates()

现在：

print(df)

是：

  Empid Empname    XXXX   YYYYY
0  1234     ABC  Naveen   Kumar
1  1234     ABC     NaN  Rajesh

相关问题更多 >

编程相关推荐

热门问题

热门文章