如何使用scikit learn计算多类情况下的精确度、召回率、准确率和f1-score？问题的回答

如何使用scikit learn计算多类情况下的精确度、召回率、准确率和f1-score？

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我正在处理一个情绪分析问题，数据如下： <pre><code>label instances 5 1190 4 838 3 239 1 204 2 127 </code></pre> 所以我的数据是不平衡的，因为1190<code>instances</code>被标记为<code>5</code>。使用scikit的<a href="http://scikit-learn.org/stable/modules/generated/sklearn.svm.SVC.html" rel="noreferrer">SVC</a>对Im进行分类。问题是我不知道如何以正确的方式平衡我的数据，以便准确计算多类情况下的精度、召回率、准确性和f1分数。所以我尝试了以下方法： 首先： <pre><code> wclf = SVC(kernel='linear', C= 1, class_weight={1: 10}) wclf.fit(X, y) weighted_prediction = wclf.predict(X_test) print 'Accuracy:', accuracy_score(y_test, weighted_prediction) print 'F1 score:', f1_score(y_test, weighted_prediction,average='weighted') print 'Recall:', recall_score(y_test, weighted_prediction, average='weighted') print 'Precision:', precision_score(y_test, weighted_prediction, average='weighted') print '\n clasification report:\n', classification_report(y_test, weighted_prediction) print '\n confussion matrix:\n',confusion_matrix(y_test, weighted_prediction) </code></pre> 第二： <pre><code>auto_wclf = SVC(kernel='linear', C= 1, class_weight='auto') auto_wclf.fit(X, y) auto_weighted_prediction = auto_wclf.predict(X_test) print 'Accuracy:', accuracy_score(y_test, auto_weighted_prediction) print 'F1 score:', f1_score(y_test, auto_weighted_prediction, average='weighted') print 'Recall:', recall_score(y_test, auto_weighted_prediction, average='weighted') print 'Precision:', precision_score(y_test, auto_weighted_prediction, average='weighted') print '\n clasification report:\n', classification_report(y_test,auto_weighted_prediction) print '\n confussion matrix:\n',confusion_matrix(y_test, auto_weighted_prediction) </code></pre> 第三： <pre><code>clf = SVC(kernel='linear', C= 1) clf.fit(X, y) prediction = clf.predict(X_test) from sklearn.metrics import precision_score, \ recall_score, confusion_matrix, classification_report, \ accuracy_score, f1_score print 'Accuracy:', accuracy_score(y_test, prediction) print 'F1 score:', f1_score(y_test, prediction) print 'Recall:', recall_score(y_test, prediction) print 'Precision:', precision_score(y_test, prediction) print '\n clasification report:\n', classification_report(y_test,prediction) print '\n confussion matrix:\n',confusion_matrix(y_test, prediction) F1 score:/usr/local/lib/python2.7/site-packages/sklearn/metrics/classification.py:676: DeprecationWarning: The default `weighted` averaging is deprecated, and from version 0.18, use of precision, recall or F-score with multiclass or multilabel data or pos_label=None will result in an exception. Please set an explicit value for `average`, one of (None, 'micro', 'macro', 'weighted', 'samples'). In cross validation use, for instance, scoring="f1_weighted" instead of scoring="f1". sample_weight=sample_weight) /usr/local/lib/python2.7/site-packages/sklearn/metrics/classification.py:1172: DeprecationWarning: The default `weighted` averaging is deprecated, and from version 0.18, use of precision, recall or F-score with multiclass or multilabel data or pos_label=None will result in an exception. Please set an explicit value for `average`, one of (None, 'micro', 'macro', 'weighted', 'samples'). In cross validation use, for instance, scoring="f1_weighted" instead of scoring="f1". sample_weight=sample_weight) /usr/local/lib/python2.7/site-packages/sklearn/metrics/classification.py:1082: DeprecationWarning: The default `weighted` averaging is deprecated, and from version 0.18, use of precision, recall or F-score with multiclass or multilabel data or pos_label=None will result in an exception. Please set an explicit value for `average`, one of (None, 'micro', 'macro', 'weighted', 'samples'). In cross validation use, for instance, scoring="f1_weighted" instead of scoring="f1". sample_weight=sample_weight) 0.930416613529 </code></pre> 但是，我收到这样的警告： <pre><code>/usr/local/lib/python2.7/site-packages/sklearn/metrics/classification.py:1172: DeprecationWarning: The default `weighted` averaging is deprecated, and from version 0.18, use of precision, recall or F-score with multiclass or multilabel data or pos_label=None will result in an exception. Please set an explicit value for `average`, one of (None, 'micro', 'macro', 'weighted', 'samples'). In cross validation use, for instance, scoring="f1_weighted" instead of scoring="f1" </code></pre> 如何正确处理不平衡的数据，以便以正确的方式计算分类器的度量？

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

我想人们对用什么重量做什么有很多困惑。我不确定我到底知道什么困扰着你，所以我要涵盖不同的主题，忍受我；）。 <h2>类权重</h2> 来自<code>class_weight</code>参数的权重用于训练分类器。它们不用于计算您正在使用的任何度量：对于不同的类权重，数字将不同，因为分类器不同。 基本上，在每个scikit学习分类器中，类权重用于告诉模型类有多重要。这意味着，在训练过程中，分类器将付出额外的努力，对具有高权重的类进行适当的分类。 他们是如何做到这一点的是算法的具体情况。如果你想知道它是如何为SVC工作的，而doc对你来说没有意义，请尽管提一下。 <h2>指标</h2> 一旦你有了一个分类器，你就想知道它的性能如何。在这里你可以使用你提到的指标：<code>accuracy</code>，<code>recall_score</code>，<code>f1_score</code>。。。 通常当类分布不平衡时，精确性被认为是一个很差的选择，因为它给只预测最频繁类的模型带来了高分。 我不会详细说明所有这些度量，但请注意，除了<code>accuracy</code>，它们自然应用于类级别：正如您在分类报告的<code>print</code>中看到的，它们是为每个类定义的。它们依赖于诸如<code>true positives</code>或<code>false negative</code>之类的概念，这些概念要求定义哪个类是正的类。 <pre><code> precision recall f1-score support 0 0.65 1.00 0.79 17 1 0.57 0.75 0.65 16 2 0.33 0.06 0.10 17 avg / total 0.52 0.60 0.51 50 </code></pre> <h2>警告</h2> <pre><code>F1 score:/usr/local/lib/python2.7/site-packages/sklearn/metrics/classification.py:676: DeprecationWarning: The default `weighted` averaging is deprecated, and from version 0.18, use of precision, recall or F-score with multiclass or multilabel data or pos_label=None will result in an exception. Please set an explicit value for `average`, one of (None, 'micro', 'macro', 'weighted', 'samples'). In cross validation use, for instance, scoring="f1_weighted" instead of scoring="f1". </code></pre> 您收到此警告是因为您正在使用f1分数、召回率和精确度，而没有定义应如何计算这些分数、召回率和精确度！问题可以重新表述：从上述分类报告中，如何为f1分数输出1全局数？你可以： <ol> <li>取每个类的f1分数的平均值：这就是上面的<code>avg / total</code>结果。它也被称为宏平均值。</li> <li>使用真阳性/假阴性等的全局计数来计算f1分数（将每个类的真阳性/假阴性数相加）。Akamicro平均值。</li> <li>计算f1分数的加权平均值。在scikit learn中使用<code>'weighted'</code>将通过类的支持来衡量f1分数：一个类拥有的元素越多，这个类的f1分数在计算中就越重要。</li> </ol> 这是scikit learn中的3个选项，警告是您必须选择其中一个。所以必须为score方法指定一个<code>average</code>参数。 你选择哪一个取决于你想如何衡量分类器的性能：例如宏平均不考虑类不平衡，类1的f1分数和类5的f1分数同样重要。但是如果你使用加权平均法，你会对第五课更重要。 这些度量标准中的整个参数规范在scikit learn中不是非常清楚，根据文档，它在0.18版中会变得更好。他们正在删除一些不明显的标准行为，并发出警告，以便开发人员注意到它。 <h2>计算分数</h2> 我最不想提的是（如果你知道的话，可以跳过它）分数只有在基于分类器从未见过的数据计算时才有意义。这一点非常重要，因为在拟合分类器时使用的数据上得到的任何分数都是完全不相关的。 这里有一种使用<code>StratifiedShuffleSplit</code>的方法，它给您一个数据的随机分割（在洗牌之后），以保持标签分布。 <pre><code>from sklearn.datasets import make_classification from sklearn.cross_validation import StratifiedShuffleSplit from sklearn.metrics import accuracy_score, f1_score, precision_score, recall_score, classification_report, confusion_matrix # We use a utility to generate artificial classification data. X, y = make_classification(n_samples=100, n_informative=10, n_classes=3) sss = StratifiedShuffleSplit(y, n_iter=1, test_size=0.5, random_state=0) for train_idx, test_idx in sss: X_train, X_test, y_train, y_test = X[train_idx], X[test_idx], y[train_idx], y[test_idx] svc.fit(X_train, y_train) y_pred = svc.predict(X_test) print(f1_score(y_test, y_pred, average="macro")) print(precision_score(y_test, y_pred, average="macro")) print(recall_score(y_test, y_pred, average="macro")) </code></pre> 希望这有帮助。

如何使用scikit learn计算多类情况下的精确度、召回率、准确率和f1-score？

1 个回答

相关Python问题