使用Orange python库进行交叉验证

import Orange, orange, orngDisc, orngTest, orngStat, orngFSS data = Orange.data.Table("test.tab") # has numeric, discrete features nb = Orange.classification.bayes.NaiveLearner() dBayes = orngDisc.DiscretizedLearner(nb, method=Orange.feature.discretization.Entropy(), name="disc nb") # feature selection (three important features based on information gain) fss = orngFSS.FilterBestN(n=3, measure=Orange.feature.scoring.InfoGain()) fBayes = orngFSS.FilteredLearner(dBayes, filter=fss, name="nb & fss") learners = [nb, dBayes, fBayes] results = orngTest.crossValidation(learners, data, folds=10, storeClassifiers=1, storeExamples=1) # print accuracy for the three models (no errors in this block!) print "\nLearner Accuracy #Atts" for i in range(len(learners)): print "%-15s %5.3f %5.2f" % (learners[i].name, orngStat.CA(results)[i], natt[i])

1条回答

网友

1楼 · 发布于 2024-05-14 07:26:55

恐怕您不能使用orngFSS.FilterBestN(n=3, measure=Orange.feature.scoring.InfoGain())，因为有些功能是连续的。方法”feature.scoring.InfoGain“将检查特征是否离散，引用here。在

我有两个建议：

使用分类树作为学习方法，选择树中的前三个特征。如果特征是连续的，分类树将使用“a>；0.1”这样的判别式使特征离散。在
手动使特征离散。例如，如果年龄是一个特征，那么将其标记为“D”，橙色将认为该特征是离散的。我想会有用的

相关问题更多 >

编程相关推荐

热门问题

热门文章