如何迭代一个数据帧中唯一行的列值，该数据帧具有排序的数值索引，并且在pandas中有重复项？

网友

1楼 · 编辑于 2024-04-25 02:01:27

首先按掩码删除重复的索引并按arange指定位置，然后用iloc选择：

arr = np.arange(len(df.index))
a = arr[~df.index.duplicated()]
print (a)
[0 2]

for i in a:
    cell_value = df['a'].iloc[i]
    print(type(cell_value))

<class 'numpy.int64'>
<class 'numpy.int64'>

无循环解决方案-将^{}与^{}一起使用，并使用~反转掩码：

^{pr2}$

网友

2楼 · 编辑于 2024-04-25 02:01:27

如果按照您的评论，相同的索引意味着相同的数据，这看起来是一个XY Problem。在

你也不需要一个循环。在

假设您想删除重复的行并只提取第一列（即3，5），下面的内容就足够了。在

res = df.drop_duplicates().loc[:, 'a']

# 1    3
# 2    5
# Name: a, dtype: int64

要返回类型：

^{pr2}$

网友

3楼 · 编辑于 2024-04-25 02:01:27

尝试np.unique：

_, i = np.unique(df.index, return_index=True)
df.iloc[i, df.columns.get_loc('a')].tolist() 

[3, 5]

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何迭代一个数据帧中唯一行的列值，该数据帧具有排序的数值索引，并且在pandas中有重复项？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >