这可能已经得到了回答,如果是这样,请指示我的解决方案页面与链接。你知道吗
我拥有的是一个包含100 largest countries by total area (land and water surface)详细信息的文件:
('1','Russia','17,098,242(6,601,668)','Asia/Europe','Azerbaijan, Belarus, China, Estonia, Finland, Georgia, Kazakhstan, Latvia, Lithuania, Mongolia, North Korea, Norway, Poland, Ukraine')
('2','Canada','9,984,670(3,855,100)','North America','United States')
('3','United States(incl. overseas territories)','9,857,348(3,805,943)','North America','Canada, Mexico')
('4','China','9,596,961(3,705,407)','Asia','Afghanistan, Bhutan, India, Kazakhstan, Kyrgyzstan, Laos, Mongolia, Myanmar, Nepal, North Korea, Pakistan, Russia, Tajikistan, Vietnam')
('5','Brazil','8,515,770(3,287,957)','South America','Argentina, Bolivia, Colombia, France (French Guiana), Guyana, Paraguay, Peru, Suriname, Uruguay, Venezuela'),
....
....
是的,输入文件在行首和行尾都有(&;)。你知道吗
任何帮助都将不胜感激。你知道吗
到目前为止,我试图通过写下:
onlyCountries = 'allcountries.txt'
print([x.split(',')[1] for x in open(onlyCountries)])
但这给我的输出是:
["'Russia'", "'Canada'", "'United States(incl. overseas territories)'", "'China'", "'Brazil'"...]
注意,我从上面给出的输入文件示例中得到的额外双引号?我想得到如下输出:
['Russia','Canada','United States','China','Brazil',....]
你可以用熊猫来做这个:
输出:
相关问题 更多 >
编程相关推荐