如何比较两个文件只得到不在第二个文件中的行？

file 1 : file 2 : col1 col2 col1 col2 john kerry john kerry adam lord bob abram joe hitch

3条回答

网友

1楼 · 编辑于 2024-05-26 16:27:28

如果文件格式相同，我认为您不需要csv模块。这个解决方案怎么样：

exclude_names = frozenset(open('file2')) # make set for performance
with open('output', 'w') as f:
    for name in open('file1'):
        if name not in exclude_names:
             f.write(name)

使用csv读写器的解决方案：

^{pr2}$

网友

2楼 · 编辑于 2024-05-26 16:27:28

results=[i for i, j in zip(reader1, reader2) if i != j]

如果顺序不重要，则使用set(reader1) - set(reader2)。在

^{pr2}$

网友

3楼 · 编辑于 2024-05-26 16:27:28

我会用一个固定的差异：

with open('file1') as f1, open('file2') as f2:
    data1 = set(f1)
    lines_not_in_f2 = data1.difference(f2)

如果文件的格式可能略有不同，则可能需要将文件对象包装在生成元组的生成器中：

^{pr2}$

这样做的好处是不需要将整个f2文件读入内存。它的缺点是输出名称是无序的（因为它们存储在一个集合中）。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何比较两个文件只得到不在第二个文件中的行？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >