简体   繁体   English

生存分析:船尾模型,R中的simexaft包

[英]Survival analysis: aft model, simexaft package in R

We are trying to reproduce the results of an accelarated failure time (aft) model in R, which has been coded in SAS. 我们正在尝试重现R中加速故障时间(aft)模型的结果,该模型已在SAS中进行了编码。

The data set we use is here 我们使用的数据集在这里

There you can find the SAS code as well. 在那里您也可以找到SAS代码。

    formula <- survreg(Surv(Duration, Censor) ~ Acq_Expense + Acq_Expense_SQ + Ret_Expense + Ret_Expense_SQ + Crossbuy + Frequency + Frequency_SQ + Industry + Revenue + Employees, dist='weibull', data = daten [daten$Acquisition==1, ])
    out1 <- survreg(formula = formula, data = daten [daten$Acquisition==1, ], dist = "weibull")
    summary(out1)

    ind <- c("Duration", "Censor")

    err.mat <- ???

    out2 <- simexaft(formula = formula, data = daten [daten$Acquisition==1, ], SIMEXvariable = ind, repeated = FALSE, err.mat = err.mat, dist = "weibull")
    summary(out2)

Our question is how to define the err.mat term? 我们的问题是如何定义err.mat术语? err.mat specifies the variables with measurement errors. err.mat指定具有测量错误的变量。 Since our data set is right censored I thought the variables with measurement error are probably Duration and/or Censor. 由于我们的数据集是正确的,所以我认为带有测量误差的变量可能是持续时间和/或检查器。 But it is not as simple as that, err.mat must be a square symmetric numeric matrix. 但这不是那么简单,err.mat必须是一个方形对称数字矩阵。

If you read the Journal of Statistical Software,January 2012, Volume 46, article describing the simexaft package , it becomes clear that in the situation without repeated measurements to estimate the measurement errors from data, that you must supply these estimates yourself from domain knowledge. 如果您阅读了Journal of Statistics Software(统计软件杂志),2012年1月,第46卷,该文章描述了simexaft软件包 ,那么很显然,在没有重复测量的情况下根据数据估算测量误差的情况下,您必须自己从领域知识中提供这些估算值。 See the example in pages 6-8. 请参阅第6-8页中的示例。 Also see the cited "Statistics in Medicine" article available at Dr Yi's website . 另请参阅易博士网站上提供的“医学统计学”文章 The measurement errors are the first two predictor variables, systolic blood pressure (SBP) and serum cholesterol(CHOL) in that example. 在该示例中,测量误差是前两个预测变量,即收缩压(SBP)和血清胆固醇(CHOL)。 If you are using the text from which you are extracting that data, then you will need to read the chapter text (which does not appear to be available at that website) to determine what assumptions they make about the measurement errors. 如果您使用的是从中提取数据的文本,那么您将需要阅读章节文本(该网站上似乎没有该章节),以确定他们对测量误差的假设。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM