我试图在pandas中加载一个excel文件,但出现以下错误-
AttributeError:“int”对象没有属性“strip”
我已从excel中获取以下一行以及标题,以便于理解-
Row ID Order ID Order Date Ship Date Ship Mode Customer ID Customer Name Segment Country/Region City State Postal Code Region Product ID Category Sub-Category Product Name Sales Quantity Discount Profit
1 CA-2018-152156 08-11-2018 11-11-2018 Second Class CG-12520 Claire Gute Consumer United States Henderson Kentucky 42420 NorthEAST FUR-BO-10001798 Finance Bookcases Bush Somerset Collection Bookcase 261.96 2 0 41.9136
这是全部代码-
local_path= '../../data/RetailStore.xlsx'
out_path= '../../out/hyperstore.csv'
def load_retail_data(local_path,sheet_name):
return pd.read_excel(
local_path,
header=4,
sheet_name=sheet_name,
parse_dates=True
)
def clean_headers(data_frame:pd.DataFrame) -> pd.DataFrame:
data_frame=data_frame.rename(columns=lambda x:x.strip())
data_frame=data_frame.rename(columns=lambda x:x.replace('\n',' '))
data_frame=data_frame.rename(columns=lambda x:x.replace("'",' '))
data_frame=data_frame.rename(columns=lambda x:x.replace(' ',' '))
return data_frame
def filter_ship_mode(df):
return df[(df[ColumnsStore.ship_mode]!= 'Standard Class') & (df[ColumnsStore.ship_mode]!='Second Class')]
def calc_retail_data(local_path,sheet_name):
retail_data=load_retail_data(local_path,sheet_name)
retail_clean_headers=clean_headers(retail_data)
retail_filtered=filter_ship_mode(retail_clean_headers)
return retail_filtered
if __name__=="__main__":
df_retail_data=calc_retail_data(local_path,'Orders')
df_retail_data.to_csv(out_path,index=False)
您可以将列标题类型转换为字符串
或者只是转义int类型
相关问题 更多 >
编程相关推荐