将pandas groupby对象保存到csv文件中

dff = pd.DataFrame({'SKU': ['001', '002', '003'], 'revenue_contribution_in_percentage': [0.2, 0.5, 0.3], 'BuyPrice' : [2,3,4], 'SellPrice' : [5,6,6], 'margin' : [3,3,2], 'Avg_per_week' : [3,2,5], 'StockOnHand' : [4,10,20], 'StockOnOrder': [0,0,0], 'Supplier' : ['ABC', 'ABC', 'ABZ' ], 'SupplierLeadTime': [5,5,5], 'cumul_value':[0.4,0.6,1], 'class_mention':['A','A','B'], 'std_week':[1,2,1], 'review_time' : [2,2,2], 'holding_cost': [0.35, 0.35, 0.35], 'aggregate_order_placement_cost': [200, 230,210] })

groups = [group.reset_index().set_index(['SKU'])[[ 'revenue_contribution_in_percentage', 'BuyPrice', 'SellPrice', 'margin', 'Avg_per_week', 'StockOnHand', 'StockOnOrder', 'Supplier', 'SupplierLeadTime', 'cumul_value', 'class_mention', 'std_week', 'review_time', 'holding_cost', 'aggregate_order_placement_cost', 'periods']] for _, group in dff.groupby('Supplier')] df_group = pd.DataFrame(groups).sum() group_to_excel = df_group.to_csv('results.csv')

SKU revenue_contribution_in_percentage BuyPrice SellPrice margin \ 0 001 0.2 2 5 3 1 002 0.5 3 6 3 Avg_per_week StockOnHand StockOnOrder Supplier SupplierLeadTime \ 0 3 4 0 ABC 5 1 2 10 0 ABC 5 cumul_value class_mention std_week review_time holding_cost \ 0 0.4 A 1 2 0.35 1 0.6 A 2 2 0.35 aggregate_order_placement_cost 0 200 1 230

SKU revenue_contribution_in_percentage BuyPrice SellPrice margin \ 0 003 0.3 4 6 2 Avg_per_week StockOnHand StockOnOrder Supplier SupplierLeadTime \ 0 5 20 0 ABZ 5 cumul_value class_mention std_week review_time holding_cost \ 0 1 B 1 2 0.35 aggregate_order_placement_cost 0 210

1条回答

网友

1楼 · 发布于 2024-05-23 22:51:58

您不需要groupby，因为您没有聚合任何内容。您真正想要的是按每个唯一的供应商对dff进行切片，并将其导出到自己的文件中。试试这个：

cols = [
    'SKU',
    'revenue_contribution_in_percentage',
    'BuyPrice',
    'SellPrice',
    'margin',
    'Avg_per_week',
    'StockOnHand',
    'StockOnOrder',
    'Supplier',
    'SupplierLeadTime',
    'cumul_value',
    'class_mention',
    'std_week',
    'review_time',
    'holding_cost',
    'aggregate_order_placement_cost'
]

for supplier in dff['Supplier'].unique():
    sub_dff = dff[dff['Supplier'] == supplier][cols]
    sub_dff.to_csv(f'{supplier}_data.csv')

相关问题更多 >

编程相关推荐

热门问题

热门文章