R 中多列的卡方檢驗

Question

在這里，我做了如下data ：

data<-data.frame(alzheimer=c(1,1,0,1,0,0,1,0,0,0),
                 asthma=c(1,1,0,0,1,1,1,1,0,0),
                 points=c(0,1,3,5,3,2,1,2,1,5),
                 sex=c(1,1,0,0,0,0,1,1,1,0))

我想知道sex是否會影響alzheimer或asthma或points 。 所以我正在考慮為獨立性做卡方檢驗。 alzheimer和asthma是二元變量，所以我想我可以分別將sex ==1 和sex ==0 中的所有數字相加，並制作列聯表來進行卡方檢驗。 對於變量points ，我不知道是否可以進行卡方檢驗，因為points是一個序數變量，范圍從 0 到 5，只有整數。

總結一下，我想做3個測試。

sex和alzheimer是獨立的嗎？
sex和asthma是獨立的嗎？
sex和points是獨立的嗎？

另外，在我的實際data中，有很多列，所以我需要知道如何一次完成許多測試並將其寫入 csv 文件。 csv 文件應包含測試統計數據和 p 值。

Answer 1

我們可以編寫一個 function stat_test ，它在二進制列上應用chisq.test ，在其他列上應用wilcox.test （假設它們都是序數列）。 我們可以把這個 function output 做成三件事。

測試名稱
統計數據（stats）的值
p值

然后我們可以使用dplyr::across()將此測試應用於所有列（期望在我們的函數中用作y輸入的alzheimer列）。 之后我們只需將標簽添加為第一行。

data <- data.frame(alzheimer=c(1,1,0,1,0,0,1,0,0,0),
                   asthma=c(1,1,0,0,1,1,1,1,0,0),
                   points=c(0,1,3,5,3,2,1,2,1,5),
                   sex=c(1,1,0,0,0,0,1,1,1,0))

library(dplyr)

stat_test <- function(x, y) {
  if (length(unique(na.omit(x))) > 2) {
    res <- chisq.test(x = x,
               y = y)
    label <- "chi_square"
  } else {
    res <- wilcox.test(x, y = y)
    label <- "wilcox"
  }
  
  c(
    test = label,
    stats = res$statistic,
    p_val = res$p.value
  )
}

data %>% 
  as_tibble %>% 
  summarise(across(-alzheimer,
                   ~ stat_test(.x, alzheimer))) %>% 
  mutate(label = c("test", "stats", "pvalue"), .before = 1L)
#> Warning in wilcox.test.default(x, y = y): cannot compute exact p-value with ties
#> Warning in chisq.test(x = x, y = y): Chi-squared approximation may be incorrect
#> Warning in wilcox.test.default(x, y = y): cannot compute exact p-value with ties
#> # A tibble: 3 x 4
#>   label  asthma            points            sex              
#>   <chr>  <chr>             <chr>             <chr>            
#> 1 test   wilcox            chi_square        wilcox           
#> 2 stats  60                5.13888888888889  55               
#> 3 pvalue 0.407562453620744 0.273341191458911 0.693376361757653

^{由代表 package (v2.0.1) 於 2022 年 9 月 27 日創建}

R 中多列的卡方檢驗

問題描述

1 個解決方案

解決方案1
1 已采納 2022-09-27 06:59:55

R 中多列的卡方檢驗

問題描述

1 個解決方案

解決方案1 1 已采納 2022-09-27 06:59:55

解決方案1
1 已采納 2022-09-27 06:59:55