擅长:python、mysql、java
<p>我想你可以这样做。下面是一个简单的示例,说明您正在尝试执行的操作:</p>
<pre><code>import pandas as pd
import numpy as np
# sample data frame
df = pd.DataFrame({'col1': ['a','b'], 'col2':['b','a'], 'col3':[1,1]})
col1 col2 col3
0 a b 1
1 b a 1
# take first two columns from where to remove duplicates
df2 = df.iloc[:,:2]
# sort the columns based on their corresponding values and create a new df
df3 = pd.DataFrame(np.sort(df2.values), axis=1), df2.index, df2.columns)
# finally drop duplicates
result = pd.concat([df3, df['col3']], axis=1).drop_duplicates(subset=['col1','col2'])
col1 col2 col3
0 a b 1
</code></pre>