Python3：处理CSV输出中UTF8不兼容字符

import csv, os, os.path rfile = open(nonbillabletest2.csv,'r',newline='') dataread= csv.reader(rfile) trash=next(rfile) #ignores the header line in csv: #Process the target CSV by creating an output with a unique filename per CompanyName for line in dataread: [CompanyName,Specifics] = line #Check that a target csv does not exist if os.path.exists('test leads '+CompanyName+'.csv') < 1: wfile= open('test leads '+CompanyName+'.csv','a') datawrite= csv.writer(wfile, lineterminator='\n') datawrite.writerow(['CompanyName','Specifics']) #write new header row in each file created datawrite.writerow([CompanyName,Specifics]) wfile.close() rfile.close()

1条回答

网友

1楼 · 发布于 2024-05-29 01:48:47

所以nonbillabletest2.csv不是用UTF-8编码的。在

你可以：

把它修好。确保它正确编码为UTF-8，如您所期望的那样。这可能是您所指的“SQL解决方案”。在

事先删除所有非ascii字符（对于纯粹主义者来说，这会破坏数据，但根据您所说的，这似乎是您可以接受的）

import csv, os, string
rfile = open('nonbillabletest2.csv', 'rb')
rbytes = rfile.read()
rfile.close()

contents = ''
for b in rbytes:
  if chr(b) in string.printable + string.whitespace:
    contents += chr(b)

dataread = csv.reader(contents.split('\r\n'))
....

相关问题更多 >

编程相关推荐

热门问题

热门文章