如何比较两个.csv和.xlsx文件并打印出不匹配的特定字段？

1条回答

网友

1楼 · 发布于 2024-06-11 20:06:23

要从csv读取列中的所有值，请执行以下操作：

from csv import DictReader as csv_DictReader
csv_file = defaultdict(list)
filepath = "whatever/myfile.csv"
with filepath.open(encoding="cp1252") as file:
    reader = csv_DictReader(file)  
    for row in reader:
        for (k, v) in row.items():
            csv_file[k].append(v)
csv_column = csv_file['employeeID']  # Tell it what column to read

要从excel中读取列中的所有值，请执行以下操作：

from openpyxl import load_workbook
filepath = "whatever/myfile.xlsx"
excel_file = load_workbook(filepath)
excel_sheet = excel_file.active
excel_columns = {}
for column in "ABC": # Tell it what columns to read
    if column not in excel_columns:
        excel_columns[column] = []
    for row in range(1, excel_sheet.max_row + 1):
        cell_name = f"{column}{row}"
        recovered_columns[column].append(self.excel_sheet[cell_name].value)

我们已经读取了整个文件，但是现在只有两个dict，一个是csv_column，另一个是excel_columns

你现在要做的就是比较结果

建议：同时打印csv_column和excel_columns以检查您使用上述代码得到了什么（因为老实说，这些代码是我去年刚刚从一个项目中复制的，但我已经忘记了一半，所以我不能完全确定输出结果。它确实有效）

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何比较两个.csv和.xlsx文件并打印出不匹配的特定字段？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >