为什么我得到这个错误“ValueError:无法从一个重复的轴重新索引”？

path1 = '/home/Desktop/computed_2d_blaze/' path2 = '/home/Desktop/computed_1d/' path3 = '/home/Desktop/sn_airmass_seeing/' dir1 = [x for x in os.listdir(path1) if '.ares' in x] dir2 = [x for x in os.listdir(path2) if '.ares' in x] dir3 = [x for x in os.listdir(path3) if '.ares' in x] lst = [] lst1 = [] for file1, file2,file3 in zip(dir1,dir2,dir3): df1 = pd.read_table(path1+file1, skiprows=0, usecols=(0,1,2,3,4,8),names=['wave','num','stlines','fwhm','EWs','MeasredWave'],delimiter=r'\s+') df2 = pd.read_table(path2+file2, skiprows=0, usecols=(0,1,2,3,4,8),names=['wave','num','stlines','fwhm','EWs','MeasredWave'],delimiter=r'\s+') df1 = df1.groupby('wave').mean().reset_index() df1 = df1.sort_values('wave').reset_index(drop=True) df2 = df2.sort_values('wave').reset_index(drop=True) dfs = pd.merge(df1,df2, on='wave', how='inner') dfs['delta_ew'] = (dfs.EWs_x - dfs.EWs_y) dfs=dfs.filter(items=['wave','delta_ew']) lst.append(dfs) df3 = pd.read_table(path3+file3, skiprows=0, usecols=(0,1,2),names=['seeing','airmass','snr'],delimiter=r'\s+') lst1.append(df3) [df.set_index('wave', inplace=True) for df in lst] df=pd.concat(lst,axis=1,join='inner') x = pd.concat(lst1,axis=1,join='inner') for z in df.index: t = x.loc[0, 'airmass'] s = df.loc[z, 'delta_ew'] dfs = pd.concat([s,t],axis=1,names=['delta_ew','airmass']) dfs = dfs[np.abs(dfs.delta_ew - dfs.delta_ews.mean()) <= (dfs.delta_ews.mad())]

2条回答

网友

1楼 · 编辑于 2024-06-06 15:24:27

当索引具有重复值时，将联接/分配给列时，此错误通常会增加。在

错误是从dfs = pd.concat([s,t],axis=1,names=['delta_ew','airmass'])代码引发的。我相信我找到了解决你问题的办法。只需将ignore_index=True添加到concat代码中。在

像这样：

dfs = pd.concat([s,t],axis=1,names=['delta_ew','airmass'], ignore_index=True )

这将重新创建索引。在

注：index表示行和列名称

网友

2楼 · 编辑于 2024-06-06 15:24:27

我终于解决了这个问题。我用了dictionary，而不是concat。因为我面临的问题是合并两个熊猫系列来制作新的数据帧。我首先将pandas系列t & s的值转换成dictionary，然后将字典转换成一个dataframe，它对我来说工作得非常好。在

for z in df.index:
   t = x.loc[0, 'airmass']
   t = t.values
   s = df.loc[z, 'delta_ew']
   s = s.values
   dic = dict(zip(s,t))      
   q = pd.DataFrame(dic.items(), columns=['ew', 'airmass'])
   q = q[np.abs(q.ew - q.ew.mean()) <= (q.ew.mad())]

相关问题更多 >

编程相关推荐

热门问题

热门文章