Pandas to_sql（）性能为什么这么慢？

docker run --rm -it --link=memsql:memsql memsql/quickstart simple-benchmark Creating database simple_benchmark Warming up workload Launching 10 workers Workload will take approximately 30 seconds. Stopping workload 42985000 rows inserted using 10 threads 1432833.3 rows per second

1条回答

网友

1楼 · 发布于 2024-05-16 00:28:13

如果有人遇到类似情况：

我删除了SQlalchemy，并对Pandas的to_sql()函数使用了（不推荐使用的）MySQL风格。加速比超过120%。我不建议使用这个，但它目前对我有效。在

import MySQLdb

import mysql.connector
from sqlalchemy import create_engine
from pandas.util.testing import test_parallel

engine = MySQLdb.connect("127.0.0.1","root","","netflow_test")

# engine = create_engine('mysql+mysqlconnector://root@localhost:3306/netflow_test', echo=False)

# @test_parallel(num_threads=8)
def commit_flows(netflow_df2):
    % time netflow_df2.to_sql(name='netflow_ids', flavor='mysql', con=engine, if_exists = 'append', index=False, chunksize=50000)
commit_flows(netflow_df2)

{MySQL如何在MySQL中找到类似的查询mysql.conf版)我会更快的。我应该可以在这里每秒超过50000行。在

^{pr2}$

126s之前。38.2秒。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

Pandas to_sql（）性能为什么这么慢？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >