如何将.findall（）函数用于遍历列的所有行的excel文件？

import openpyxl import re import sys import pandas as pd reload(sys) sys.setdefaultencoding('utf8') wb = openpyxl.load_workbook('/Users/ap/info1.xlsx') ws = wb.get_sheet_by_name('Companies') w={'Name': [],'Affiliation': [], 'Email':[]} for row in ws.iter_rows('C{}:C{}'.format(ws.min_row,ws.max_row)): for cells in row: a=re.findall(r'Mr.(.*?)Affiliation:',aa, re.DOTALL) a1="".join(a).replace('\n',' ') b=re.findall(r'Affiliation:(.*?)E-mail',aa,re.DOTALL) b1="".join(b).replace('\n',' ') c=re.findall(r'E-mail(.*?)Mobile',aa,re.DOTALL) c1="".join(c).replace('\n',' ') w['Name'].append(q1) w['Affiliation'].append(r1) w['Email'].append(s1) print cell.value df=pd.DataFrame(data=w) df.to_excel(r'/Users/ap/info2.xlsx')

1条回答

网友

1楼 · 发布于 2024-05-15 13:24:40

我会用这个，它代替了“E”-邮件：。。。，然后拆分并分配给右列

df['Name'] = np.nan
df['Affiliation'] = np.nan
df['Email'] = np.nan
df['Mobile'] = np.nan

for i in range(0, len(df)):
    full_value = df['Companies'].loc[i]
    full_value = full_value.replace('Affiliation:', ';').replace('E-mail:', ';').replace('Mobile:', ';')
    full_value = full_value.split(';')
    df['Name'].loc[i] = full_value[0]
    df['Affiliation'].loc[i] = full_value[1]
    df['Email'].loc[i] = full_value[2]
    df['Mobile'].loc[i] = full_value[3]

del df['Companies']
print(df)

相关问题更多 >

编程相关推荐

热门问题

热门文章