如何解释python中fitted scikitsurvival模型中.predict（）的输出？

import pandas as pd from sksurv.datasets import load_veterans_lung_cancer from sksurv.linear_model import CoxnetSurvivalAnalysis # load data data_X, data_y = load_veterans_lung_cancer() # one-hot-encode categorical columns in X categorical_cols = ['Celltype', 'Prior_therapy', 'Treatment'] X = data_X.copy() for c in categorical_cols: dummy_matrix = pd.get_dummies(X[c], prefix=c, drop_first=False) X = pd.concat([X, dummy_matrix], axis=1).drop(c, axis=1) # display final X to fit Cox Elastic Net model on del data_X print(X.head(3))

2条回答

网友

1楼 · 编辑于 2024-06-17 08:40:04

我发布了这个问题on github，尽管作者重命名了这个问题。在

我得到了关于predict输出是什么的一些有用的解释，但是仍然不确定如何获得一组预测的生存时间，这正是我真正想要的。以下是github线程的一些有用的解释：

predictions are risk scores on an arbitrary scale, which means you can 
usually only determine the sequence of events, but not their exact time.

-sebp（图书馆作者）

^{pr2}$

-帕沃帕克斯。在

在github线程中有更多的解释，尽管我并不是真的能够理解所有的内容。我需要和predict_survival_function和predict_cumulative_hazard_function一起玩，看看我是否能得到一组关于X中最有可能存活时间的预测，这正是我真正想要的。在

我不接受这个答案，以防其他人有更好的答案。在

网友

2楼 · 编辑于 2024-06-17 08:40:04

使用X输入，可以得到输入数组的求值：

def predict(self, X, alpha=None):
    """The linear predictor of the model.
    Parameters
         
    X : array-like, shape = (n_samples, n_features)
        Test data of which to calculate log-likelihood from
    alpha : float, optional
        Constant that multiplies the penalty terms. If the same alpha was used during training, exact
        coefficients are used, otherwise coefficients are interpolated from the closest alpha values that
        were used during training. If set to ``None``, the last alpha in the solution path is used.
    Returns
       -
    T : array, shape = (n_samples,)
        The predicted decision function
    """
    X = check_array(X)
    coef = self._get_coef(alpha)
    return numpy.dot(X, coef)

定义检查数组来自另一个library。您可以查看coxnet的代码。在

相关问题更多 >

编程相关推荐

热门问题

热门文章