[英]Random Forest Feature Importance using Python
我正在尝试下面的随机森林分类器代码。 即使我已经定义但得到 NameError。 请帮忙
def RFC_model(randomState, X_train, X_test, y_train, y_test):
rand_forest = RandomForestClassifier()
rand_forest.fit(X_train, y_train)
forest_test_predictions = rand_forest.predict(X_test)
print(accuracy_score(y_test, forest_test_predictions))
X_train, X_test, y_train, y_test = train_test_split(df_encoded.drop(['success'],axis='columns').values,
df_encoded.success,
test_size=0.2)
RFC_model(42, X_train, X_test, y_train, y_test)
0.994045375744328
rand_forest.feature_importances_.round(3)
NameError Traceback (most recent call last)
<ipython-input-40-974786899b7f> in <module>
1 #importance of features rounded to nearest 3 decimals
----> 2 rand_forest.feature_importances_.round(3)
NameError: name 'rand_forest' is not defined
您正在RFC_model
函数的范围内本地定义变量rand_forest
。 一旦函数完成执行,对象就会被销毁,因此您无法访问它。 您可以通过返回rand_forest
对象来解决此问题:
def RFC_model(randomState, X_train, X_test, y_train, y_test):
rand_forest = RandomForestClassifier()
rand_forest.fit(X_train, y_train)
forest_test_predictions = rand_forest.predict(X_test)
print(accuracy_score(y_test, forest_test_predictions))
return rand_forest
X_train, X_test, y_train, y_test = train_test_split(df_encoded.drop(['success'],axis='columns').values,
df_encoded.success,
test_size=0.2)
rand_forest = RFC_model(42, X_train, X_test, y_train, y_test)
rand_forest.feature_importances_.round(3)
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.