Python将带有逗号分隔符的.csv文件转换为字典

def createDictionary(filename): f = open(filename, 'r') dic = {} for line in f: #line = line.strip() data = line.split(',') print data dic[data[0]] = data[1] print dic

going to be late\rg2cu', 'glad to see you\rg2e', 'got to eat\rg2g', 'got to go\rg2g2tb', 'got to go to the bathroom\rg2g2w', 'got to go to work\rg2g4aw', 'got to go for a while\rg2gb', 'got to go bye\rg2gb2wn', 'got to go back to work now\rg2ge', 'got to go eat\rg2gn', 'got to go now\rg2gp', 'got to go pee\rg2gpc', 'got 2 go parents coming\rg2gpp', 'got to go pee pee\rg2gs', 'got to go sorry\rg2k', 'good to know\rg2p', 'got to pee\rg2t2s', 'got to talk to someone\rg4u', 'good for you\rg4y', 'good for you\rg8', 'gate\rg9', 'good night\rga', 'go ahead\rgaalma', 'go away and leave me alone\rgafi', 'get away from it\rgafm', 'Get away from me\rgagp', 'go and get pissed\rgaj'

def createDictionary(filename): f = open(filename, 'r') dic = {} for line in f: line = line.strip() data = line.split(',') print data dic[data[0]] = data[1] print dic if __name__ == "__main__": x = createDictionary("textToEnglish.csv") print x

2条回答

网友
1楼 · 编辑于 2024-04-20 03:08:12

只需在函数中添加return。此外，由于csv的第一列中存在重复的值，您将看到字典长度与csv行不同。字典键必须是唯一的，所以当一个重用键被分配给一个值时，后一个值将替换前一个值。你知道吗
def createDictionary(filename): f = open(filename, 'r') dic = {} for line in f: #line = line.strip() data = line.split(',') print(data) dic[data[0]] = data[1] return dic if __name__ == "__main__": x = createDictionary("textToEnglish.csv") print type(x) # <class 'dict'> print len(x) # 4255 for k, v in x.items(): print(k, v)
尽量不要同时使用print字典，尤其是有这么多的值，这会增加内存开销。了解如何使用for循环遍历键和值。你知道吗

网友
2楼 · 编辑于 2024-04-20 03:08:12

尽管所提供的其他解决方案没有任何问题，但是通过使用python优秀的库，您可以简化并大大提升您的解决方案。你知道吗
Pandas是一个用Python处理数据的库，许多数据科学家都喜欢它。你知道吗
Pandas有一个简化的CSV接口来读取和解析文件，可以用来返回字典列表，每个字典包含一行文件。键将是列名，值将是每个单元格中的值。你知道吗
就你而言：
import pandas def createDictionary(filename): my_data = pandas.DataFrame.from_csv(filename, sep=',', index_col=False) list_of_dicts = [item for item in my_data.T.to_dict().values()] return list_of_dicts if __name__ == "__main__": x = createDictionary("textToEnglish.csv") print type(x) # <class 'list'> print len(x) # 4255 print type(x[0]) # <class 'dict'>

相关问题更多 >

编程相关推荐

热门问题

热门文章