如何在朴素贝叶斯中获得特征重要性？

optimal_alpha = 1 NB_optimal = BernoulliNB(alpha=optimal_aplha) # fitting the model NB_optimal.fit(X_tr, y_tr) # predict the response pred = NB_optimal.predict(X_test) # evaluate accuracy acc = accuracy_score(y_test, pred) * 100 print('\nThe accuracy of the NB classifier for k = %d is %f%%' % (optimal_aplha, acc))

2条回答

网友

1楼 · 编辑于 2024-05-13 08:08:03

通过使用coefs_或feature_log_prob_属性，可以从fit模型中获取每个单词的重要信息。例如

neg_class_prob_sorted = NB_optimal.feature_log_prob_[0, :].argsort()
pos_class_prob_sorted = NB_optimal.feature_log_prob_[1, :].argsort()

print(np.take(count_vect.get_feature_names(), neg_class_prob_sorted[:10]))
print(np.take(count_vect.get_feature_names(), pos_class_prob_sorted[:10]))

为每门课打印出十个最具预测性的单词。

网友

2楼 · 编辑于 2024-05-13 08:08:03

试试这个：

pred_proba = NB_optimal.predict_proba(X_test)
words = np.take(count_vect.get_feature_names(), pred_proba.argmax(axis=1))

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何在朴素贝叶斯中获得特征重要性？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >