在分割CSV文件时从前端和后端删除双引号

import csv divisor = 500000 outfileno = 1 outfile = None with open('testM.txt', 'r') as infile: infile_iter = csv.reader(infile) header = next(infile_iter) for index, row in enumerate(infile_iter): if index % divisor == 0: if outfile is not None: outfile.close() outfilename = 'big-{}.csv'.format(outfileno) outfile = open(outfilename, 'w') outfileno += 1 writer = csv.writer(outfile) writer.writerow(header) writer.writerow(row) if outfile is not None: outfile.close()

2条回答

网友

1楼 · 编辑于 2024-06-06 20:08:55

快速浏览CSV模块将有您的问题的答案。你知道吗

https://docs.python.org/3/library/csv.html#csv.QUOTE_NONE

网友

2楼 · 编辑于 2024-06-06 20:08:55

您可以使用Pandas修复输入并使逻辑更加简单。你知道吗

import csv
import pandas as pd

filename='big-'
for count, chunk in enumerate(pd.read_csv(filename, delimiter=",", quoting=csv.QUOTE_NONE, encoding='utf-8', iterator=True, chunksize=50000)):
    #fix the 1 and N columns to remove the doublequotes char
    chunk[chunk.columns[0]]=chunk[chunk.columns[0]].str[1:]
    chunk[chunk.columns[-1]]=chunk[chunk.columns[-1]].str[:-1]
    #change these columns datatypes if necessary/useful
    #put in the rest of your logic here (saving files etc..)
    chunk.to_csv(file_name+'{}'.format(count))

*警告：我尚未测试整个解决方案。所以你的里程数可能会有所不同。你知道吗

感谢@code mocker的报价。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章