如何替换数据帧python中的所有单词

2024-05-16 11:30:18 发布

您现在位置:Python中文网/ 问答频道 /正文

我正在尝试将AGL账单转换为dataframe,以便将所需的值放入excel电子表格中。你知道吗

我一直在尝试在行中.replace()个字符,没有任何内容,所以只剩下数字(尝试删除数据帧中的所有单词)。另一个问题是每个单元格中都有多个单词和数字。你知道吗

Here is the current database:

from tabula import read_pdf
import openpyxl
from openpyxl import load_workbook
import pandas as pd
import numpy as np

df1 = tabula.read_pdf('C:/Users/Blake/Desktop/Python/AGL_Bill.pdf',guess=False, pages=2)
df1.columns = ['Description', 'Blank', 'Values']




df1.drop(labels=None, axis=None, index=[0,1,3,4,7,8,25,26,19,15,16,20,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62], columns=None, level=None, inplace=True, errors='raise')
df1.drop(labels=None, axis=1, columns=['Values'], level=None, inplace=True, errors='raise')





df1['Description'].str.replace('kWh', '')



print (df1)

df1.to_csv('Tableone.csv', encoding='utf-8')


wb2 = load_workbook('C:/Users/Blake/Desktop/ETemplate.xlsx')


wb2.create_sheet('DATA')
wb2.save('C:/Users/Blake/Desktop/Template.xlsx')`

Tags: columnsfromimportnonepdf数字单词users
1条回答
网友
1楼 · 发布于 2024-05-16 11:30:18

如果您试图用空字符替换字符-然后RegEx使用数字,每个单元格-将它们连接在一起。你知道吗

进口re

import pandas as pd

data={'1':'Some dumb data $200.22 for me','2':'Some more really dumb data $5.23'}
df=pd.DataFrame.from_dict(data,orient='index')
df.columns=['Data']

def Num_Only(val):
    return ' '.join(re.findall('[\d\.]+',val))

df['New']=''
df.New=df.Data.apply(lambda x: Num_Only(x))
Which should output a new Dataframe ... like this

输出现在是。。。我已经把那美元去掉了,因为它没有用。你知道吗

1.   Some dumb data $200.22 for me  200.22
2   Some more really dumb data $5.23    5.23

希望这能让你走

相关问题 更多 >