需要关于加速python代码数据清理的建议吗

import csv from datetime import datetime def date_split(calendar): new_calendar={} i=0 calendar_total=pd.DataFrame() num=calendar.shape[0]-1 while i<=10000: tem=calendar_data.iloc[i,1] #extract year&month&day from day column listdate=datetime.strptime(tem,'%Y-%m-%d') new_calendar['Year']=listdate.year new_calendar['Month']=listdate.month new_calendar['Date']=listdate.day # add the other columns new_calendar['listId']=calendar.iloc[i,0] new_calendar['available']=calendar.iloc[i,2] new_calendar['price']=calendar.iloc[i,3] new_calendar=pd.DataFrame.from_records(new_calendar,index=[i]) #change new_calendar data type from dic to pd dataframe calendar_total=calendar_total.append(new_calendar) i=i+1 return calendar_total

1条回答

网友

1楼 · 发布于 2024-04-26 05:10:59

这就是我如何将年、月和日从现有数据帧提取到新数据帧中的方法：

import numpy as np
import pandas as pd

df = pd.DataFrame({'date' : pd.date_range("19970202", periods=365*20)})

df2 = pd.DataFrame({'year' : df['date'].dt.year, 'month' : df['date'].dt.month, 'day' : df['date'].dt.day})

print (df)
print (df2)

我还没有对一个大的数据集（130万行？），但也许这能给你提速。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章