目前还不清楚函数“GridSearchCV”是如何分解训练集和测试集的

_scorer = make_scorer(T_scorer) X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=42) clf = RandomForestClassifier() grid_searcher = GridSearchCV(clf, parameter_grid, verbose=20, scoring=_scorer) grid_searcher.fit(X_test, y_test) clf_best = grid_searcher.best_estimator_ print('Best params = ', clf_best.get_params())

1条回答

网友

1楼 · 发布于 2024-04-29 15:58:47

默认情况下，GridSearchCV执行3倍验证，这意味着它将数据分成3等份（1、2、3），并按以下顺序运行：

在1,2上训练>；测试en 3
在2,3上训练>；在1上测试
在1,3上训练>；在2上测试

您不必在这里使用train test split：只需将X\u train、y\u train提供给gridsearchCV并让它工作

您还可以查看doc的“cv”部分：http://scikit-learn.org/stable/modules/generated/sklearn.model_selection.GridSearchCV.html

编辑：以下是评论中的最终代码：

grid_searcher = GridSearchCV(clf, param_grid=parameter_grid, cv=StratifiedKFold(shuffle =True, random_state = 42))

相关问题更多 >

编程相关推荐

热门问题

热门文章