忽略pandas datafram中的非数字字符串值

网友

1楼 · 编辑于 2024-05-15 22:36:39

Pandas有一些工具可以转换这些类型的列，但它们可能并不完全适合您的需要。pd.to_numeric像您这样转换混合列，但将非数字字符串转换为NaN。这意味着您将得到浮点列，而不是整数，因为只有浮点列可以有NaN值。这通常不太重要，但值得注意。

df = pd.DataFrame({'mixed_types': [12331, '345', 'text']})

pd.to_numeric(df['mixed_types'], errors='coerce')
Out[7]: 
0    12331.0
1      345.0
2        NaN
Name: mixed_types, dtype: float64

如果要删除所有NaN行：

# Replace the column with the converted values
df['mixed_types'] = pd.to_numeric(df['mixed_types'], errors='coerce')

# Drop NA values, listing the converted columns explicitly
#   so NA values in other columns aren't dropped
df.dropna(subset = ['mixed_types'])
Out[11]: 
   mixed_types
0      12331.0
1        345.0

网友

2楼 · 编辑于 2024-05-15 22:36:39

您可以直接使用df.u get_numeric_data（）。

网友

3楼 · 编辑于 2024-05-15 22:36:39

可以使用^{}和errors=coerce来用NaN替换非数值，并将其应用于每一列。然后你可以使用dropna或fillna任何你喜欢的。

df = pd.read_csv('file.csv')
df = df.apply(pd.to_numeric, errors='coerce')
df = df.dropna()

相关问题更多 >

编程相关推荐

热门问题

热门文章

忽略pandas datafram中的非数字字符串值

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >