如何对这个csv文件排序？

with open(outfname, "rb") as unsorted_file: csv_f = csv.reader(unsorted_file) header = next(csv_f, None) sorted_data = sorted(csv_f, key=operator.itemgetter(0)) with open(outfname, 'wb') as sorted_file: csv_f = csv.writer(sorted_file, quoting=csv.QUOTE_ALL) if header: csv_f.writerow(header) csv_f.writerows(sorted_data)

3条回答

网友

1楼 · 编辑于 2024-06-16 14:14:25

发生这种情况是因为“10”<；“1”。你想比较整数，而不是字符串。可以通过为每一行创建一个整数来实现此行为，该整数使用下划线以下的字符。假设您可以获得一个字符串s（这可以像您当前所做的那样使用itemgetter来完成）。然后，下面的lambda（作为key传递给sorted）将执行您想要的操作。你知道吗

key=lambda s: int(s[: (s.index('_'))])))

这个函数的作用很简单：它只返回由s的字符组成的整数，直到但不包括第一个下划线。你知道吗

网友

2楼 · 编辑于 2024-06-16 14:14:25

假设所有的filename字段都以一个数字开头，那么最简单的方法就是将整数从文件名中解析出来，按整数排序。你知道吗

# Assume this is the data of the CSV after reading it in
filenames = ['10_somefile,0,0',
 '1_somefile,0,0',
 '2_somefile,0,0',
 '3_somefile,0,0',
 '4_somefile,0,0',
 '5_somefile,0,0',
 '6_somefile,0,0',
 '7_somefile,0,0',
 '8_somefile,0,0',
 '9_somefile,0,0']

# Here, we treat the first part of the filename (the number before the underscore) as the sort key.
sorted_data = sorted(filenames, key=lambda l: (int(l.partition('_')[0])))

如果您输出sorted_data，它应该如下所示：

['1_somefile,0,0', '2_somefile,0,0', '3_somefile,0,0', 
 '4_somefile,0,0', '5_somefile,0,0', '6_somefile,0,0', 
 '7_somefile,0,0', '8_somefile,0,0', '9_somefile,0,0', '10_somefile,0,0']

网友

3楼 · 编辑于 2024-06-16 14:14:25

sorted的key参数返回每行的第一个元素作为字符串，使"10..."位于"1_..."之前。您需要使用“自然排序”而不是这种原始排序。你知道吗

检查How to correctly sort a string with a number inside?

相关问题更多 >

编程相关推荐

热门问题

热门文章