简体   繁体   English

方差膨胀因子输出统计模型

[英]Variance inflation factor output statsmodels

I am trying to find the multicollinearity using statsmodels, but the output of my code does not reveal the variance inflation factors but as dataframe of generator objects:我正在尝试使用 statsmodels 找到多重共线性,但是我的代码的输出没有显示方差膨胀因子,而是作为生成器对象的数据框:

from statsmodels.stats.outliers_influence import variance_inflation_factor
variables = df[['Mileage','Year','EngineV']]
vif = pd.DataFrame()
vif['VIF'] = (variance_inflation_factor(variables.values,i) for i in range(variables.shape[1]))
vif['features'] = variables.columns

results in the output结果在输出

                                                 VIF  | features
  ---------------------------------------------------------------
0 | <generator object <genexpr> at 0x0000023A9F204... | Mileage
1 | <generator object <genexpr> at 0x0000023A9F204... |    Year
2 | <generator object <genexpr> at 0x0000023A9F204... | EngineV

rather than giving the actual values.而不是给出实际值。 I am sure this is an easy fix but I am very new to Python and coding.我确信这是一个简单的解决方法,但我对 Python 和编码非常陌生。 Thanks谢谢

You should convert your generator to list.您应该将生成器转换为列表。 You can use list comprehansion or list() function.您可以使用列表理解或 list() 函数。

vif['VIF'] = [variance_inflation_factor(variables.values,i) for i in range(variables.shape[1])]

or或者

vif['VIF'] = list((variance_inflation_factor(variables.values,i) for i in range(variables.shape[1])))

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM