更改代码以处理NumPy数组而不是datafram

df = pd.DataFrame(data=np.array([[nan, 2, 3], [4, 5, 6], [7, 8, 9]]), columns=["col1", "col2", "col3"]) list_of_NA_features = ["col1"] for feature in list_of_NA_features: for index,row in df.iterrows(): if (pd.isnull(row[feature]) == True): missing_value = 5 # for simplicity, let's put 5 instead of a function df.ix[index,feature] = missing_val

1条回答

网友
1楼 · 发布于 2024-04-19 00:24:46

设置
#setup a numpy array the same as your Dataframe a = np.array([[np.nan, 2., 3.], [ 4., 5., 6.], [ 7., 8., 9.]]) #list_of_NA_features now contains the column index in the numpy array list_of_NA_features = [0]
解决方案：
#Now you can see how those operations can be carried out on a numpy array. I'm just saying you can do this on a numpy array in the way you did it on a Dataframe. I'm not saying this is the best way of doing what you are trying to do. for feature in list_of_NA_features: for index, row in enumerate(a): if np.isnan(row[feature]): missing_value = 5 a[index,feature] = missing_value Out[167]: array([[ 5., 2., 3.], [ 4., 5., 6.], [ 7., 8., 9.]])

相关问题更多 >

编程相关推荐

热门问题

热门文章