从CSV中删除空行?
我有一个很大的csv文件,其中有些行是完全空白的。我该怎么用Python来删除这些空白行呢?
根据你们的建议,我现在有了以下代码:
import csv
# open input csv for reading
inputCSV = open(r'C:\input.csv', 'rb')
# create output csv for writing
outputCSV = open(r'C:\OUTPUT.csv', 'wb')
# prepare output csv for appending
appendCSV = open(r'C:\OUTPUT.csv', 'ab')
# create reader object
cr = csv.reader(inputCSV, dialect = 'excel')
# create writer object
cw = csv.writer(outputCSV, dialect = 'excel')
# create writer object for append
ca = csv.writer(appendCSV, dialect = 'excel')
# add pre-defined fields
cw.writerow(['FIELD1_','FIELD2_','FIELD3_','FIELD4_'])
# delete existing field names in input CSV
# ???????????????????????????
# loop through input csv, check for blanks, and write all changes to append csv
for row in cr:
if row or any(row) or any(field.strip() for field in row):
ca.writerow(row)
# close files
inputCSV.close()
outputCSV.close()
appendCSV.close()
这样做可以吗,还是有更好的方法呢?
11 个回答
9
使用Python删除.csv文件中的空行
import csv
...
with open('demo004.csv') as input, open('demo005.csv', 'w', newline='') as output:
writer = csv.writer(output)
for row in csv.reader(input):
if any(field.strip() for field in row):
writer.writerow(row)
谢谢
11
我很惊讶这里没有人提到 pandas
。这是一个可能的解决方案。
import pandas as pd
df = pd.read_csv('input.csv')
df.to_csv('output.csv', index=False)
39
使用 csv
模块:
import csv
...
with open(in_fnam, newline='') as in_file:
with open(out_fnam, 'w', newline='') as out_file:
writer = csv.writer(out_file)
for row in csv.reader(in_file):
if row:
writer.writerow(row)
如果你还想去掉那些所有字段都是空的行,可以把 if row:
这一行改成:
if any(row):
如果你还想把那些只有空格的字段也当作空的,可以把它替换成:
if any(field.strip() for field in row):
注意,在 Python 2.x 及更早版本中,csv
模块是需要处理二进制文件的,所以你需要用 'b'
标志来打开文件。而在 3.x 版本中,这样做会出错。