从一批文件中提取文本，并将其写入Excel fi

<tr> <td width=25%> Arnold Ed </td> <td width=15%> 18 Feb 1959 </td> </tr> <tr> <td width=15%> 男性 </td> <td width=15%> 02 March 2002 </td> </tr> <tr> <td width=15%> Guangxi </td> </tr>

from bs4 import BeautifulSoup import xlwt list_open = open("c:\\file list.txt") read_list = list_open.read() line_in_list = read_list.split("\n") for each_file in line_in_list: page = open(each_file) soup = BeautifulSoup(page.read()) all_texts = soup.find_all("td") for a_t in all_texts: a = a_t.renderContents() #"print a" here works ok book = xlwt.Workbook(encoding='utf-8', style_compression = 0) sheet = book.add_sheet('namelist', cell_overwrite_ok = True) sheet.write (0, 0, a) book.save("C:\\details.xls")

list_open = open("c:\\file list.txt") read_list = list_open.read() line_in_list = read_list.split("\n") book = xlwt.Workbook(encoding='utf-8', style_compression = 0) sheet = book.add_sheet('namelist', cell_overwrite_ok = True) for i,each_file in enumerate(line_in_list): page = open(each_file) soup = BeautifulSoup(page.read()) all_texts = soup.find_all("td") for j,a_t in enumerate(all_texts): a = a_t.renderContents() sheet.write (i, j, a) book.save("C:\\details.xls")

1条回答

网友

1楼 · 发布于 2024-04-18 15:23:04

您没有将最后四行放入for循环。我想这就是为什么它只把最后一段文本写入Excel文件。你知道吗

编辑

book = xlwt.Workbook(encoding='utf-8', style_compression = 0)
sheet = book.add_sheet('namelist', cell_overwrite_ok = True)

for i, each_file in enumerate(line_in_list):
    page = open(each_file)
    soup = BeautifulSoup(page.read())

    all_texts = soup.find_all("td")

    for j, a_t in enumerate(all_texts):
        a = a_t.renderContents()                   
        sheet.write(i, j, a)

book.save("C:\\details.xls")

相关问题更多 >

编程相关推荐

热门问题

热门文章