简体   繁体   English

如何解释 predict_proba?

[英]How to interpret predict_proba?

I'm learning Employee Turnover Forecast and I got the result as below from predict_proba If I see the first row, I may interpret as this employee would left the company by 83%.我正在学习员工流动预测,我从 predict_proba 得到如下结果。如果我看到第一行,我可能会解释为该员工将离开公司 83%。 Do I understand this correctly?我理解正确吗?

    Output exceeds the size limit. Open the full output data in a text editor
array([[0.17, 0.83],
       [0.43, 0.57],
       [0.29, 0.71],
       [0.94, 0.06],
       [0.98, 0.02],
       [0.84, 0.16],
       [0.64, 0.36],
       [1.  , 0.  ],
       [0.85, 0.15],
       [0.99, 0.01],
       [0.09, 0.91],
       [0.89, 0.11],
       [0.21, 0.79],
       [0.15, 0.85],
       [0.78, 0.22],
       [0.18, 0.82],
       [0.84, 0.16],
       [0.45, 0.55],
       [0.96, 0.04],
       [0.95, 0.05],
       [0.91, 0.09],
       [0.9 , 0.1 ],
       [1.  , 0.  ],
       [0.91, 0.09],
       [0.74, 0.26],
...
       [0.94, 0.06],
       [0.99, 0.01],
       [0.22, 0.78],
       [0.89, 0.11],
       [0.98, 0.02]])

Model score is a measure of the model certainty of the outcome.模型分数是结果的模型确定性的度量。 However, it's not necessarily the same as probability: it does not mean 83% people with 0.83 score leaving yet.但是,它不一定与概率相同:这并不意味着 83% 的人以 0.83 分离开。 Logistic regression scores are probabilities by design, but for random forest behaviour is implementation defined.逻辑回归分数是设计概率,但对于随机森林行为是实现定义的。 If you seek to integrate your scores into business metrics directly, you'll need to calibrate your model first (using eg sklearn.calibration.CalibratedClassifierCV or isotonic regression).如果您寻求将您的分数直接集成到业务指标中,您需要首先校准您的模型(使用例如sklearn.calibration.CalibratedClassifierCV或等渗回归)。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM