使用相应的过去索引值和特定的唯一列值填充新的数据帧列

index col1 teststat col2 0 a 32.0 32 1 a 32.0 432 2 b 433.0 433 3 c 4.0 4 4 a 432.0 56 5 c 4.0 64 6 a 56.0 4 7 b 433.0 535 8 c 64.0 643 9 c 643.0 356 10 b 535.0 32 11 b 32.0 535 12 a 4.0 34

for vals in list(df['col1'].unique()): if vals=='a': idxa = df.index[df['col1']=='a'] if vals=='b': idxb = df.index[df['col1']=='b'] if vals=='c': idxc = df.index[df['col1']=='c']

for i in range(len(idxa)): if i==0: df.loc[idxa[i],'test_stat']=df.loc[idxa[i],'col2'] else: df.loc[idxa[i],'test_stat']=df.loc[idxa[i-1],'col2'] for i in range(len(idxb)): if i==0: df.loc[idxb[i],'test_stat']=df.loc[idxb[i],'col2'] else: df.loc[idxb[i],'test_stat']=df.loc[idxb[i-1],'col2'] for i in range(len(idxc)): if i==0: df.loc[idxc[i],'test_stat']=df.loc[idxc[i],'col2'] else: df.loc[idxc[i],'test_stat']=df.loc[idxc[i-1],'col2']

1条回答

网友

1楼 · 发布于 2024-06-06 11:25:52

一种方法是将groupby与shift一起使用。你知道吗

df['teststat'] = df.groupby('col1')['col2'].shift(1).fillna(df['col2'])

print(df[['col1', 'teststat', 'col2']])

    col1    teststat    col2
0      a        32.0      32
1      a        32.0     432
2      b       433.0     433
3      c         4.0       4
4      a       432.0      56
5      c         4.0      64
6      a        56.0       4
7      b       433.0     535
8      c        64.0     643
9      c       643.0     356
10     b       535.0      32
11     b        32.0     535
12     a         4.0      34

编辑

对于您的附加问题：

Let's say, i want another column 'teststat2' which gives the difference between last 2 values for a particular value in 'col1'.

你可以做以下的事情。你知道吗

df['teststat2'] = df['col2'] - df['teststat']
df.loc[df['teststat2'] == 0, 'teststat2'] = df['col2']
print(df)

    col1    teststat    col2    teststat2
0      a        32.0      32         32.0
1      a        32.0     432        400.0
2      b       433.0     433        433.0
3      c         4.0       4          4.0
4      a       432.0      56       -376.0
5      c         4.0      64         60.0
6      a        56.0       4        -52.0
7      b       433.0     535        102.0
8      c        64.0     643        579.0
9      c       643.0     356       -287.0
10     b       535.0      32       -503.0
11     b        32.0     535        503.0
12     a         4.0      34         30.0

相关问题更多 >

编程相关推荐

热门问题

热门文章