但我没能适应下面的情景
Pandas - Python, deleting rows based on Date column
我有一个输出.csv包含以下列的文件
Customer, Alertkey, Node, Alertgroup, FirstOccurrence,
TKT_Flag, X733SpecificProb, TKT_TicketNumber, TKT_Keyword
文件将每隔7天从数据库中以最近7天的数据递增更新一次
所以理想情况下,我必须从文件中删除前7天的数据。在
我可以写下面,但得到类型错误“TypeError:string index must be integers”
^{pr2}$会有上百万条线。我复制了2016年1月1日的前几行。但从2016年1月1日到现在。每周它会追加并删除前7天的记录,即第一次删除1月1日至1月6日的记录,依此类推
Customer,Alertkey,Node,Alertgroup,FirstOccurrence,TKT_Flag,X733SpecificProb,TKT_TicketNumber,TKT_Keyword
Cust1,Cust1_11_53_Services_Warning,Node_Cust1,ITM_K53_SERVICEMON,2016-01-01 00:12:59,1005,TOLPUKC_OS:25223174,INC000014799786,CGMIDDLEWARE_MEDIUM_CONNECTDIRECT
Cust1,Cust1_11_53_Services_Warning,Node1_Cust1,ITM_K53_SERVICEMON,2016-01-01 00:12:59,1005,TOLPUKC_OS:25223175,INC000014799785,CGMIDDLEWARE_MEDIUM_CONNECTDIRECT
Cust2,Cust2_21_NT_System_CPU_Critical,Cust2_Node8,ITM_NT_System,2016-01-01 00:15:48,101,PARPFRC_OS:21192843,INC000000628410,WINDOWS_MEDIUM_DEFPRODUCTSILVER
Cust3,Cust3_10352_LZ_TDW_DISK_Critica,Cust3_Node22,ITM_Linux_Disk,2016-01-01 00:17:05,200,TOLPUKC_OS:25223370,INC000001412280,CGMOM_HIGH_DEFPRODUCT
Cust6,Cust6_11_53_Services_Warning,Cust6_Node700,ITM_K53_SERVICEMON,2016-01-01 00:22:36,22,TOLPUKC_OS:25223601,INC000002250120,CGIOWINTELIMOC_MEDIUM_DEFPRODUCT
替换此项:
df=df[(parser.parse(df['FirstOccurrence']) < SevenDays)].drop(df.columns)
有:
df = df.drop(df[(parser.parse(df['FirstOccurrence']) < SevenDays)].index, inplace=True)
试试这个希望这对你有帮助。在
相关问题 更多 >
编程相关推荐