[英]I need to remove multi co-linearity between features
I have categorical variables such as Gender,Anxiety,Alcoholic and when i convert these categorical variables into numerical values using encoder techniques then all these variables resembles same in values and then multi co linearity is existing. 我有分类变量,例如Gender,Anxiety,Alcoholic,当我使用编码器技术将这些分类变量转换为数值时,所有这些变量的值相似,然后存在多重共线性。 How i can convert these variables to number so that multi co linearity doesn't exist.
我如何将这些变量转换为数字,以便不存在多重共线性。 All three variables are important for prediction of target variable.
所有这三个变量对于目标变量的预测都很重要。
You don't need to transform the data.Instead you can change the way that you are calculating correlation between variables. 您无需转换数据,而是可以更改计算变量之间相关性的方式。 As these are categorical features, you have to use Chi-Squared test of independence.Then, you won't be facing this issue.
由于这些是分类功能,因此您必须使用Chi-Squared独立性测试,然后您将不会遇到此问题。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.