擅长:python、mysql、java
<p>这是一个非常基本的编程问题,与雪花无关。所以你在问之前应该多调查一下。但下面是一个简单的答案:</p>
<pre><code>#!/usr/bin/env python
# Connect to Snowflake
# See https://docs.snowflake.net/manuals/user-guide/python-connector-example.html#connecting-
...
con =
...
# Scan the file to build the table/column info
tables = {}
with open("zz.csv") as f:
for l in f.readlines():
tname, cname = l.split(",")
tables.setdefault(tname.strip(), []).append(cname.strip())
# Run a query for each query
for tname, clist in tables.iteritems():
query = """select {columns} from {table}
group by {columns}
having count(*) > 1
""".format(columns=",".join(clist), table=tname)
print("Running query: {0}".format(query))
cur = con.cursor()
cur.execute(query)
for rec in cur:
print("DUPLICATED RECORD: {0}".format(rec))
</code></pre>