加快Pandas数据帧的搜索速度

for names_A in name_list: for names_B in name_list: res = df.query('Source == "{}" & Target == "{}"'.format(names_A,names_B)) if len(res.index.tolist()) > 0: res.to_csv('nets.csv', mode='a', header=False)

2条回答

网友

1楼 · 编辑于 2024-06-16 14:25:49

IIUC：（非常感谢@cᴏʟᴅsᴘᴇᴇᴅ和@Bharath指出了错误！）在

res = df.loc[df['Source'].isin(name_list) & df['Target'].isin(name_list)]
res.to_csv(...)

演示：

^{pr2}$

网友

2楼 · 编辑于 2024-06-16 14:25:49

你已经成功了一半。非常感谢马旭，从他的文章中借用数据。在

第1步
索引是一个不错的选择，但我们只需索引前两列：

df = df.set_index(['Source', 'Target'])
df

              Interaction
Source Target            
a      z         physical
b      c         physical
c      x         physical
d      y         physical
e      b         physical
b      a         physical

第2步
生成所有可能的组合：

^{pr2}$

第3步
索引到数据帧，然后保存：

df = df.loc[df.index.intersection(c)].reset_index()
df

  Source Target Interaction
0      b      a    physical
1      b      c    physical

df.to_csv('nets.csv')

如果您有两个或多个name_lists来查找组合，而不是从一个name_list中获取元素，则可以选择此选项，在这种情况下，您将选择MaxU的答案。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

加快Pandas数据帧的搜索速度

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >