读入csv文件 - 问答 - Python中文网

读入csv文件

2024-04-19 18:09:29 发布

您现在位置：Python中文网/ 问答频道 /正文

男 | 程序猿一只，喜欢编程写python代码。

我目前正在读取一个大的csv文件（大约1亿行），使用https://docs.python.org/2/library/csv.html中描述的命令行，例如：

import csv
with open('eggs.csv', 'rb') as csvfile:
     spamreader = csv.reader(csvfile, delimiter=' ', quotechar='|')
     for row in spamreader:
          process_row(row)

我怀疑这是相当慢的，因为每一行都是单独读入的（需要对硬盘进行大量的读取调用）。有没有办法一次读入整个csv文件，然后对其进行迭代？虽然文件本身的大小很大（例如5Gb），但我的机器有足够的ram将其保存在内存中。你知道吗

Tags：文件 csv csvfile 命令行 https org import docs

3条回答

网友

1楼 · 编辑于 2024-04-19 18:09:29

my machine has sufficient ram to hold that in memory.

那么，在迭代器上调用list：

spamreader = list(csv.reader(csvfile, delimiter=' ', quotechar='|'))

网友

2楼 · 编辑于 2024-04-19 18:09:29

import pandas as pd
df =pd.DataFrame.from_csv('filename.csv')

这将把它作为一个熊猫数据帧读入，这样你就可以用它做各种有趣的事情

网友

3楼 · 编辑于 2024-04-19 18:09:29

是的，有一种方法可以一次读取整个文件：

with open('eggs.csv', 'rb', 5000000000) as ...:
    ...

引用：https://docs.python.org/2/library/functions.html#open

相关问题更多 >

编程相关推荐

热门问题

热门文章