我正在组织用于标记的AWS资源,并已将数据捕获到CSV文件中。CSV文件的输出示例如下。我试图确保每个资源的\u id都有一个标记\u key的数据集,我需要确保它存在。此数据集是
标记\u键
Application
Client
Environment
Name
Owner
Project
Purpose
我刚接触熊猫,我只把CSV文件读作数据帧
import pandas as pd
file_name = "z.csv"
df = pd.read_csv(file_name, names=['resource_id', 'resource_type', 'tag_key', 'tag_value'])
print (df)
CSV文件
vol-00441b671ca48ba41,volume,Environment,Development
vol-00441b671ca48ba41,volume,Name,Database Files
vol-00441b671ca48ba41,volume,Project,Application Development
vol-00441b671ca48ba41,volume,Purpose,Web Server
i-1234567890abcdef0,instance,Environment,Production
i-1234567890abcdef0,instance,Owner,Fast Company
我希望输出如下
vol-00441b671ca48ba41,volume,Environment,Development
vol-00441b671ca48ba41,volume,Name,Database Files
vol-00441b671ca48ba41,volume,Project,Application Development
vol-00441b671ca48ba41,volume,Purpose,Web Server
vol-00441b671ca48ba41,volume,Client,
vol-00441b671ca48ba41,volume,Owner,
vol-00441b671ca48ba41,volume,Application,
i-1234567890abcdef0,instance,Environment,Production
i-1234567890abcdef0,instance,Owner,Fast Company
i-1234567890abcdef0,instance,Application,
i-1234567890abcdef0,instance,Client,
i-1234567890abcdef0,instance,Name,
i-1234567890abcdef0,instance,Project,
i-1234567890abcdef0,instance,Purpose,
一种方法是使用多索引
from_product
和renindex
:输出:
举个稍微简单一点的例子。我有数据帧df:
返回
有要求的b:1、2、3、4和5。你知道吗
然后我们需要找出我们已经拥有了什么。我们这样做:
退货:
现在我们需要列出缺少的列并添加这些列:
返回:
相关问题 更多 >
编程相关推荐