在python数据帧中划分多个列，其中分子和分母列都将根据picklis而变化

county _tcount _tvote _f_npb_18_count _f_npb_18_vote countycode 35 San Benito 28194 22335 2677 1741 36 San Bernardino 912653 661838 108724 61832 countycode _f_npb_30_count _f_npb_30_vote 35 384 288 36 76749 53013

[ county _tcount _tvote _f_npb_18_count _f_npb_18_vote \ countycode 35 NaN NaN NaN NaN NaN 36 NaN NaN NaN NaN NaN]

3条回答

网友

1楼 · 编辑于 2024-05-16 06:41:51

我的首选是通过设置索引并使用filter分开计数和投票数据帧来组织它。然后使用join

d1 = df.set_index('county', append=True)
counts = d1.filter(regex='.*_\d+_count$').rename(columns=lambda x: x.replace('_count', ''))
votes = d1.filter(regex='.*_\d+_vote$').rename(columns=lambda x: x.replace('_vote', ''))

d1[['_tcount', '_tvote']].join(votes / counts)

                           _tcount  _tvote  _f_npb_18  _f_npb_30
countycode county                                               
35         San Benito        28194   22335   0.650355   0.750000
36         San Bernardino   912653  661838   0.568706   0.690732

网友

2楼 · 编辑于 2024-05-16 06:41:51

像这样的东西怎么样

cols = my_df.columns
for i in range(2, 6):
    print(u'Creating new col %s', cols[i])
    my_df['new_{0}'.format(cols[i]) = my_df[cols[i]] / my_df[cols[i-1]

网友

3楼 · 编辑于 2024-05-16 06:41:51

我认为您可以除以由^{}创建的numpy arrays，因为这样就不会对齐列名。上次按构造函数新建DataFrame：

arr = county_select_frame.values
df1 = pd.DataFrame(arr[:,5::2] / arr[:,4::2], columns = county_select_frame.columns[5::2])

样品：

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章