簡體 English 中英

sklearn learning_curve和StandardScaler

[英]sklearn learning_curve and StandardScaler

原文 2018-09-12 19:22:47 5 1 python/ scikit-learn

我想知道sklearn.model_selection learning_curve是否可以使用或確實使用sklearn.preprocessing StandardScaler。 我已經研究了實現，但是我的技能水平還不足以得出結論。 所有使用learning_curve的教程都將整個數據集傳遞給learning_curve，learning_curve會將數據分為訓練集和測試集。

適用於所有估算器的所有教程都將數據分為訓練和測試，然后僅縮放訓練數據，並使用訓練數據標度轉換測試數據。 哪個完全明白。

我應該先縮放整個數據集，然后再將其傳遞給learning_curve。 我確實知道learning_curve將使用k折或其他交叉驗證方法，所以它是否重要，因為交叉驗證會平均所有結果？

謝謝，

1 個解決方案

learning_curve不會自行實現StandardScaler 。 您可以創建一個Pipeline作為您的估算器，第一步是StandardScaler然后使用您下一步要使用的任何估算器。 這樣，當您在每次cv迭代期間調用learning_curve時，您都在訓練倍數上同時對定標器和估計量進行訓練，並且在每次迭代中針對測試倍數來驗證性能。

您不希望在調用learning_curve之前縮放整個數據集。 原因是在訓練模型之前縮放整個集合會引入偏差，因為您使用的數據將用於驗證訓練模型，這可能會導致過度擬合。

sklearn中learning_curve函數中estimator參數的值應該是什么？

[英]what should be the value of the estimator parameter in learning_curve function in sklearn?

學習曲線錯誤

[英]Learning_curve error

learning_curve 中的自定義評分

[英]Custom scoring in learning_curve

使用 sklearn 的 learning_curve() 而不是加權的 f1 分數為特定類繪制 f1

[英]Plot f1 for a specific class with sklearn's learning_curve() rather than the weighted f1 scores

在sklearn.svm.SVC（kernel ='rbf'）分類器上使用learning_curve的虛假ValueError

[英]Spurious ValueError using learning_curve on sklearn.svm.SVC(kernel='rbf') classifier

python sklearn：accuracy_score和learning_curve得分有什么區別？

[英]python sklearn: what is the difference between accuracy_score and learning_curve score?

ImportError：沒有名為grid_search的模塊，learning_curve

[英]ImportError: No module named grid_search, learning_curve

學習曲線Sklearn

[英]learning curve Sklearn

learning_curve沒有繪制超過200萬條記錄

[英]learning_curve not plotting more than 2 million records

scikit-learn learning_curve函數在喂入SVM分類器時會引發ValueError

[英]scikit-learn learning_curve function throws a ValueError when fed a SVM Classifier

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 sklearn中learning_curve函數中estimator參數的值應該是什么？學習曲線錯誤 learning_curve 中的自定義評分使用 sklearn 的 learning_curve() 而不是加權的 f1 分數為特定類繪制 f1 在sklearn.svm.SVC（kernel ='rbf'）分類器上使用learning_curve的虛假ValueError python sklearn：accuracy_score和learning_curve得分有什么區別？ ImportError：沒有名為grid_search的模塊，learning_curve 學習曲線Sklearn learning_curve沒有繪制超過200萬條記錄 scikit-learn learning_curve函數在喂入SVM分類器時會引發ValueError

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM