为什么测试“NaN==NaN”不适用于从pandas数据帧中删除？

comments_values = marked_results.comments.unique() array(['VP', 'TEST', nan], dtype=object) # Ah, gotya! so now ive tried: marked_results.comments == comments_values[2] # but still all the results are Falses!!!

2条回答

网友

1楼 · 编辑于 2024-06-16 14:36:30

您需要使用math.isnan()函数（或numpy.isnan）测试NaN。不能用相等运算符检查NaNs。

>>> a = float('NaN')
>>> a
nan
>>> a == 'NaN'
False
>>> isnan(a)
True
>>> a == float('NaN')
False

帮助功能->

isnan(...)
    isnan(x) -> bool

    Check if float x is not a number (NaN).

网友

2楼 · 编辑于 2024-06-16 14:36:30

您应该使用isnull和notnull来测试NaN（使用pandas数据类型比numpy更健壮），请参见"values considered missing" in the docs。

对列使用Series方法^{}不会影响原始数据帧，但可以执行您希望的操作：

In [11]: df
Out[11]:
  comments
0       VP
1       VP
2       VP
3     TEST
4      NaN
5      NaN

In [12]: df.comments.dropna()
Out[12]:
0      VP
1      VP
2      VP
3    TEST
Name: comments, dtype: object

^{}DataFrame方法有一个子集参数（用于删除在特定列中有nan的行）：

In [13]: df.dropna(subset=['comments'])
Out[13]:
  comments
0       VP
1       VP
2       VP
3     TEST

In [14]: df = df.dropna(subset=['comments'])

相关问题更多 >

编程相关推荐

热门问题

热门文章