Python数据帧填充不存在

df1 Client NumberOfProducts ID A 1 2 A 5 1 B 1 2 B 6 1 C 9 1 C 9 0 D 2.5 NaN D 2.5 NaN

1条回答

网友

1楼 · 发布于 2024-04-20 10:33:49

用途：

clients = ['A','B','C','D']
N = 2

#test only values from list and also filter only 2 rows for each client if necessary
df = df[df['Client'].isin(clients)].groupby('Client').head(N)

#create helper counter and reshape by unstack
df1 = df.set_index(['Client',df.groupby('Client').cumcount()]).unstack()
#set first if only 1 row per client - replace second NumberOfProducts by first 
df1[('NumberOfProducts',1)] = df1[('NumberOfProducts',1)].fillna(df1[('NumberOfProducts',0)])
# ... replace second ID by first subtracted by 1
df1[('ID',1)] = df1[('ID',1)].fillna(df1[('ID',0)] - 1)
#add missing clients by reindex
df1 = df1.reindex(clients)
#replace NumberOfProducts by constant 2.5
df1['NumberOfProducts'] = df1['NumberOfProducts'].fillna(2.5)
print (df1)
       NumberOfProducts        ID     
                      0    1    0    1
Client                                
A                   1.0  5.0  2.0  1.0
B                   1.0  6.0  2.0  1.0
C                   9.0  9.0  1.0  0.0
D                   2.5  2.5  NaN  NaN

#last reshape to original
df2 = df1.stack().reset_index(level=1, drop=True).reset_index()
print (df2)
  Client  NumberOfProducts   ID
0      A               1.0  2.0
1      A               5.0  1.0
2      B               1.0  2.0
3      B               6.0  1.0
4      C               9.0  1.0
5      C               9.0  0.0
6      D               2.5  NaN
7      D               2.5  NaN

相关问题更多 >

编程相关推荐

热门问题

热门文章

Python数据帧填充不存在

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >