简体   繁体   English

如何用较少的变量在SAS中对模型评分

[英]How to score a model in SAS with less variables

I have the next problem. 我有下一个问题。

I created a random forest in sas with 200 variables. 我在sas中创建了一个具有200个变量的随机森林。

Now I want to score new observations with the same model but I have only 50 variables (the important ones!). 现在,我想用相同的模型给新的观察结果评分,但是我只有50个变量(重要的变量!)。

When I run the score, it crashes and appeared the next message: ERROR: Score input variable var_1 not found in the data set. 当我运行得分时,它崩溃并显示下一条消息:错误:在数据集中找不到得分输入变量var_1。 (And thats ok because I will not going to generate those variables again, they are useless). (没关系,因为我不会再生成这些变量,它们是无用的)。

The question is, how can I run the model with only the 50 variables? 问题是,如何仅使用50个变量运行模型?

Thanks, 谢谢,

You may want to rerun the RF model with the selected 50 variables only. 您可能只想使用选定的50个变量重新运行RF模型。 You are on the correct path. 您走在正确的道路上。 You just did not conduct the second using the 50 selected variables only. 您只是没有仅使用选择的50个变量进行第二次操作。 The error means the saved binary score file still holds the first model columns. 该错误意味着已保存的二进制分数文件仍包含第一模型列。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM