如何根据列中的数字复制Pandas中的行组？

+----------------------------- + | aid | bid | x1 | x2 | count | +----------------------------- + | 1 | 1 | tim | 6 | 3 | | 1 | 2 | tim | 6 | 3 | | 1 | 3 | tim | 6 | 3 | | 2 | 1 | bob | 6 | 2 | | 2 | 2 | bob | 6 | 2 | | 2 | 3 | bob | 6 | 2 | +----------------------------- +

+----------------------------- + | aid | bid | x1 | x2 | count | +----------------------------- + | 1 | 1 | tim | 6 | 3 | | 1 | 2 | tim | 6 | 3 | | 1 | 3 | tim | 6 | 3 | | 1 | 1 | tim | 6 | 3 | | 1 | 2 | tim | 6 | 3 | | 1 | 3 | tim | 6 | 3 | | 1 | 1 | tim | 6 | 3 | | 1 | 2 | tim | 6 | 3 | | 1 | 3 | tim | 6 | 3 | | 2 | 1 | bob | 6 | 2 | | 2 | 2 | bob | 6 | 2 | | 2 | 3 | bob | 6 | 2 | | 2 | 1 | bob | 6 | 2 | | 2 | 2 | bob | 6 | 2 | | 2 | 3 | bob | 6 | 2 | +----------------------------- +

df = pd.DataFrame({'aid': [1,1,1,2,2,2], 'bid': [1,2,3,1,2,3], 'x1': ['tim']*3 + ['bob']*3 + ['ray']*3, 'x2': [1,0,0,0,1,0,0,0,1], 'count': [3,3,3,2,2,2,4,4,4]})[['aid', 'bid', 'x1', 'x2', 'count']] aid bid x1 x2 count 0 1 1 tim 1 3 1 1 2 tim 0 3 2 1 3 tim 0 3 3 2 1 bob 0 2 4 2 2 bob 1 2 5 2 3 bob 0 2 6 3 1 ray 0 4 7 3 2 ray 0 4 8 3 3 ray 1 4

pd.concat([frame for count, frame in df.groupby('count', as_index=False,sort=False) for _ in range(count)]).sort_values('aid').reset_index(drop=True) aid bid x1 x2 count 0 1 1 tim 1 3 1 1 2 tim 0 3 2 1 3 tim 0 3 3 1 1 tim 1 3 4 1 2 tim 0 3 5 1 3 tim 0 3 6 1 1 tim 1 3 7 1 2 tim 0 3 8 1 3 tim 0 3 9 2 3 bob 0 2 10 2 1 bob 0 2 11 2 2 bob 1 2 12 2 2 bob 1 2 13 2 1 bob 0 2 14 2 3 bob 0 2 15 3 2 ray 0 4 16 3 1 ray 0 4 17 3 2 ray 0 4 18 3 3 ray 1 4 19 3 1 ray 0 4 20 3 2 ray 0 4 21 3 3 ray 1 4 22 3 1 ray 0 4 23 3 2 ray 0 4 24 3 3 ray 1 4 25 3 1 ray 0 4 26 3 3 ray 1 4

2条回答

网友

1楼 · 编辑于 2024-04-20 12:49:10

out=pd.DataFrame()
for n,fr in df.groupby('count'): out=out.append([fr]*n)

为了

In [5]: out.sort('aid')
Out[5]: 
   aid  bid   x1  x2  count
0    1    1  tim   6      3
1    1    2  tim   6      3
2    1    3  tim   6      3
0    1    1  tim   6      3
1    1    2  tim   6      3
2    1    3  tim   6      3
0    1    1  tim   6      3
1    1    2  tim   6      3
2    1    3  tim   6      3
3    2    1  bob   6      2
4    2    2  bob   6      2
5    2    3  bob   6      2
3    2    1  bob   6      2
4    2    2  bob   6      2
5    2    3  bob   6      2

网友

2楼 · 编辑于 2024-04-20 12:49:10

可以使用列表连接：

df = pd.DataFrame({'aid': [1,1,1,2,2,2], 'bid': [1,2,3,1,2,3], 'x1': ['tim']*3 + ['bob']*3, 'x2': [6]*6, 'count': [3,3,3,2,2,2]})[['aid', 'bid', 'x1', 'x2', 'count']]

>>> pd.concat([frame 
               for count, frame in df.groupby('count', as_index=False, sort=False) 
               for _ in range(count)]).sort_values('aid').reset_index(drop=True)
    aid  bid   x1  x2  count
0     1    1  tim   6      3
1     1    2  tim   6      3
2     1    3  tim   6      3
3     1    1  tim   6      3
4     1    2  tim   6      3
5     1    3  tim   6      3
6     1    1  tim   6      3
7     1    2  tim   6      3
8     1    3  tim   6      3
9     2    1  bob   6      2
10    2    2  bob   6      2
11    2    3  bob   6      2
12    2    1  bob   6      2
13    2    2  bob   6      2
14    2    3  bob   6      2

相关问题更多 >

编程相关推荐

热门问题

热门文章