Python3.6连接到大型数据帧的MS SQL Server

import pandas as pd from sqlalchemy import create_engine import numpy as np df = pd.DataFrame(np.random.randint(0,100,size=(100, 1)), columns=list('test_col')) address = 'mssql+pyodbc://uid:pw@server/path/database?driver=SQL Server' engine = create_engine(address) connection = engine.raw_connection() cursor = connection.cursor() # Attempt 1 <- This failed to even create a table at the cursor_execute statement so my issues could be way in the beginning here but I know that I have a connection to the SQL Server because I can use pd.to_sql() to create tables successfully (just incredibly slowly for my tables of interest) create_statement = """ DROP TABLE test_table CREATE TABLE test_table (test_col) """ cursor.execute(create_statement) test_insert = ''' INSERT INTO test_table (test_col) values ('abs'); ''' cursor.execute(test_insert) Attempt 2 <- From iabdb WordPress blog I came across def chunker(seq, size): return (seq[pos:pos + size] for pos in range(0, len(seq), size)) records = [str(tuple(x)) for x in take_rates.values] insert_ = """ INSERT INTO test_table ("A") VALUES """ for batch in chunker(records, 2): # This would be set to 1000 in practice I hope print(batch) rows = str(batch).strip('[]') print(rows) insert_rows = insert_ + rows print(insert_rows) cursor.execute(insert_rows) #conn.commit() # don't know when I would need to commit conn.close() # Attempt 3 # From a related Stack Exchange Post create the table but first drop if it already exists command = """DROP TABLE IF EXISTS test_table CREATE TABLE test_table # these columns are from my real dataset "Serial Number" serial primary key, "Dealer Code" text, "FSHIP_DT" timestamp without time zone, ;""" cursor.execute(command) connection.commit() # stream the data using 'to_csv' and StringIO(); then use sql's 'copy_from' function output = io.StringIO() # ignore the index take_rates.to_csv(output, sep='~', header=False, index=False) # jump to start of stream output.seek(0) contents = output.getvalue() cur = connection.cursor() # null values become '' cur.copy_from(output, 'Config_Take_Rates_TEST', null="") connection.commit() cur.close()

2条回答

网友

1楼 · 编辑于 2024-04-20 04:12:22

“DROP TABLE IF EXISTS test_TABLE”看起来像是无效的tsql语法。你可以这样做：

if (object_id('test_table') is not null) 
DROP TABLE test_table

网友

2楼 · 编辑于 2024-04-20 04:12:22

如果只需要替换现有表，请截断它并使用bcp实用程序上载该表。快多了。在

from subprocess import call

command = "TRUNCATE TABLE test_table"
take_rates.to_csv('take_rates.csv', sep='\t', index=False)
call('bcp {t} in {f} -S {s} -U {u} -P {p} -d {db} -c -t "{sep}" -r "{nl}" -e {e}'.format(t='test_table', f='take_rates.csv', s=server, u=user, p=password, db=database, sep='\t', nl='\n')

您需要安装bcp实用程序（yum install mssql tools on CentOS/RedHat）。在

相关问题更多 >

编程相关推荐

热门问题

热门文章