简体   繁体   English

多元线性回归模型

[英]Multiple linear regression model

Can anyone please help me to understand the difference between using the asterisk and the plus signs in multiple linear regression model?任何人都可以帮助我理解在多元线性回归模型中使用星号和加号之间的区别吗? If I want to fit this model如果我想适合这个模型

model<-lm(continuous variable ~ Covariate SEX+ Covariate RACE+ Factor gene* Factor SNP, 
    data= mydata)

is using the plus sign and the asterisk at the same time correct?同时使用加号和星号正确吗? I'm interested at looking the impact of factor gene and factor SNP on the continuous variable and I'm controlling for sex and race as co-variates in this study.我有兴趣查看因子基因和因子 SNP 对连续变量的影响,并且我在本研究中将性别和种族作为协变量进行控制。

Also, I have another question please, if I'm fitting nearly over 100 model should I check for model assumption for each model that I fitted?另外,我还有一个问题,如果我拟合了近 100 个模型,我是否应该检查我拟合的每个模型的模型假设?

* is for crossing, (ie, y~a*b is equivalent to y~a+b+a:b ) *用于交叉,(即y~a*b相当于y~a+b+a:b

continuous variable ~ Covariate SEX+ Covariate RACE+ Factor gene* Factor SNP

is, therefore, equivalent to因此,等价于

continuous variable ~ Covariate SEX+ Covariate RACE+ Factor gene + Factor SNP + Factor gene:Factor SNP

What you're doing is totally fine.你在做什么完全没问题。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM