如何解释 predict_proba？

Question

I'm learning Employee Turnover Forecast and I got the result as below from predict_proba If I see the first row, I may interpret as this employee would left the company by 83%.我正在学习员工流动预测，我从 predict_proba 得到如下结果。如果我看到第一行，我可能会解释为该员工将离开公司 83%。 Do I understand this correctly?我理解正确吗？

    Output exceeds the size limit. Open the full output data in a text editor
array([[0.17, 0.83],
       [0.43, 0.57],
       [0.29, 0.71],
       [0.94, 0.06],
       [0.98, 0.02],
       [0.84, 0.16],
       [0.64, 0.36],
       [1.  , 0.  ],
       [0.85, 0.15],
       [0.99, 0.01],
       [0.09, 0.91],
       [0.89, 0.11],
       [0.21, 0.79],
       [0.15, 0.85],
       [0.78, 0.22],
       [0.18, 0.82],
       [0.84, 0.16],
       [0.45, 0.55],
       [0.96, 0.04],
       [0.95, 0.05],
       [0.91, 0.09],
       [0.9 , 0.1 ],
       [1.  , 0.  ],
       [0.91, 0.09],
       [0.74, 0.26],
...
       [0.94, 0.06],
       [0.99, 0.01],
       [0.22, 0.78],
       [0.89, 0.11],
       [0.98, 0.02]])

Answer 1

Model score is a measure of the model certainty of the outcome.模型分数是结果的模型确定性的度量。 However, it's not necessarily the same as probability: it does not mean 83% people with 0.83 score leaving yet.但是，它不一定与概率相同：这并不意味着 83% 的人以 0.83 分离开。 Logistic regression scores are probabilities by design, but for random forest behaviour is implementation defined.逻辑回归分数是设计概率，但对于随机森林行为是实现定义的。 If you seek to integrate your scores into business metrics directly, you'll need to calibrate your model first (using eg sklearn.calibration.CalibratedClassifierCV or isotonic regression).如果您寻求将您的分数直接集成到业务指标中，您需要首先校准您的模型（使用例如sklearn.calibration.CalibratedClassifierCV或等渗回归）。

如何解释 predict_proba？

问题描述

1 个解决方案

解决方案1
1 已采纳 2022-06-23 08:00:21

如何解释 predict_proba？

问题描述

1 个解决方案

解决方案1 1 已采纳 2022-06-23 08:00:21

解决方案1
1 已采纳 2022-06-23 08:00:21