我有2个CSV文件:
CSV1版本:
"Hypervisor","IP","ABCD","Operating System","Domain","Memory","No. CPU","Availability (%)","Last Collection Time","lol"
"lglac125.lss.com","10.247.52.125","VMware ESXi 5.5.0 build-9919047","lss.com","524278.03125","4.0","100.0","1.558599031E9"
"lglac126.lss.com","10.247.52.126","VMware ESXi 5.5.0 build-9919047","lss.com","524278.03125","4.0","100.0","1.558599931E9"
"lglac127.lss.com","10.247.52.127","VMware ESXi 5.5.0 build-9919047","lss.com","524278.03125","4.0","0.0","1.558599031E9"
"lglac128.lss.com","10.247.52.128","VMware ESXi 5.5.0 build-9919047","lss.com","524278.03125","4.0","100.0","1.558599931E9"
"lglac129.lss.com","10.247.52.129","VMware ESXi 5.5.0 build-9919047","lss.com","524278.03125","4.0","100.0","1.558599931E9"
第2版:
"Hypervisor","IP","Arrays","Operating System","Domain","Memory","No. CPU","Availability (%)","Last Collection Time","DummyColumn"
"lglac125.lss.com","10.247.52.125",,"VMware ESXi 5.5.0 build-9919047","lss.com","524278.03125","4.0","100.0","1.558599031E9","A"
"lglac126.lss.com","10.247.52.126",,"VMware ESXi 5.5.0 build-9919047","lss.com","524278.03125","4.0","100.0","1.558599931E9","B"
"lglac127.lss.com","10.247.52.127",,"VMware ESXi 5.5.0 build-9919047","lss.com","524278.03125","4.0","0.0","1.558599031E9","C"
"lglac128.lss.com","10.247.52.128",,"VMware ESXi 5.5.0 build-9919047","lss.com","524278.03125","4.0","100.0","1.558599931E9","D"
"lglac129.lss.com","10.247.52.129",,"VMware ESXi 5.5.0 build-9919047","lss.com","524278.03125","4.0","100.0","1.558599931E9","E"
"DummyRow","10.247.52.129",,"VMware ESXi 5.5.0 build-9919047","lss.com","524278.03125","4.0","100.0","1.558599931E9","F"
我试图将每列的所有条目(如果在csv2中可用)与相应的行进行比较。如果有任何条目丢失或更改,我需要升起一个标志。这两个文件中都有可能添加或删除任何列。因此我需要首先检查csv2中是否有x列,然后匹配csv1中相同列的条目。你知道吗
我已经为此挣扎了三天,但还是没有找到解决办法。我非常感谢你的帮助。你知道吗
IIUC公司
假设导入熊猫的csv1、csv2为
df1
、df2
。在列上使用intersection
查找匹配的列并对其排序。将其传递给df1
和df2
。最后,eq
在df1
和df2
的匹配列的子集上注意:我将
df1.loc[0, 'IP']
改为10.247.52.124
,以便在df1
的第0行的一个值中显示False
,以便演示从这个
df1_mask
,您可以将它插入df1
以检查NaN
。任何NaN
要么是原始值NaN
,要么在df1
和df2
之间更改注意:您的
df1
有lol
列,但没有值,因此它原来是NaN
或者你可以查看
df2
您可以用^{} 和} 来尝试
indicator=True
和^{both
:这将显示数据帧之间的不常见数据
相关问题 更多 >
编程相关推荐