将过采样数据集保存为pandas中的csv文件

2024-04-28 20:49:33 发布

您现在位置：Python中文网/ 问答频道 /正文

1261

网友

男 | 程序猿一只，喜欢编程写python代码。

我是Python新手，如果太简单，请提前道歉。找不到任何内容，并且this question没有帮助

我的代码是

# Split data
y = starbucks_smote.iloc[:, -1]
X = starbucks_smote.drop('label', axis = 1)

# Count labels by type
counter = Counter(y)
print(counter)
Counter({0: 9634, 1: 2895})

# Transform the dataset
oversample = SMOTE()
X, y = oversample.fit_resample(X, y)

# Print the oversampled dataset
counter = Counter(y)
print(counter)
Counter({0: 9634, 1: 9634})

如何保存过采样数据集以备将来使用

我试过了

data_res = np.concatenate((X, y), axis = 1)
data_res.to_csv('sample_smote.csv')

出错了

ValueError: all the input arrays must have same number of dimensions, 
but the array at index 0 has 2 dimension(s) and the array at index 1 has 1 dimension(s)

谢谢你的建议

Tags： csv the data index counter res array dataset

1条回答

网友

1楼 · 发布于 2024-04-28 20:49:33

您可以创建数据帧：

data_res = pd.DataFrame(X)
data_res['y'] = y

然后将data_res保存到CSV

基于连接od numpy.arrays的解决方案也是可能的，但需要^{}使维度兼容：

data_res = np.concatenate((X, np.vstack(y)), axis = 1)
data_res = pd.DataFrame(data_res)

将过采样数据集保存为pandas中的csv文件

相关问题更多 >

编程相关推荐

热门问题

热门文章

将过采样数据集保存为pandas中的csv文件

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >