我正在尝试将表中的一些数据保存到CSV文件中。你知道吗
import requests
import csv
from bs4 import BeautifulSoup
#Main function
def getContent(link):
#Request content
result1 = requests.get(link)
#Save source in var
src1 = result1.content
#Activate soup
soup = BeautifulSoup(src1,'lxml')
#Look for table
table = soup.find('table')
#Save in csv
with open('averageheight.csv','w',newline='') as f:
writer = csv.writer(f)
for tr in table('tr'):
row = [t.get_text(strip=True)for t in tr(['td','th'])]
writer.writerow(row)
#LINKS
getContent('https://en.wikipedia.org/wiki/Average_human_height_by_country')
我得到的错误是:
File "c:/Users/Agent 1/Desktop/Datapackages/Average Height/process.py", line 31, in <module>
getContent('https://en.wikipedia.org/wiki/Average_human_height_by_country')
File "c:/Users/Agent 1/Desktop/Datapackages/Average Height/process.py", line 27, in getContent
writer.writerow(row)
File "C:\Users\Agent 1\AppData\Local\Programs\Python\Python38-32\lib\encodings\cp1252.py", line 19, in encode
return codecs.charmap_encode(input,self.errors,encoding_table)[0]
UnicodeEncodeError: 'charmap' codec can't encode character '\u2044' in position 24: character maps to <undefined>
在我的机器上运行了你的代码,没有发现错误。但是,您可能需要考虑将
encoding='utf-8'
指定为with open(...) as f
。你知道吗将ascii字符转换为
utf-8
。使用下面修改过的代码行:相关问题 更多 >
编程相关推荐