如何根據r中的另一列填寫缺失的一列

Question

我有一個數據框的子集，如下所示。 我想在“患病年齡”列中填寫 NA，以便一個患有疾病的人的年齡與沒有疾病的兄弟姐妹（從 familyID 識別）相同。

structure(list(id = c(1, 2, 3, 4, 5, 6), 
           familyId = c(1, 1, 2, 2, 3, 3), 
           disease = c(1, 0, 0, 1, 1, 0), 
           `age at disease` = c("40","NA", "NA", "43", "52", "NA")), 
      class = c("tbl_df", "tbl", "data.frame"), row.names = c(NA, -6L))

這意味着最后一列“患病年齡”應該是：c(40,40,43,43,52,52)。

Answer 1

您可以使用以下代碼：

library(dplyr)
library(tidyr)
df %>%
  na_if("NA") %>%
  group_by(familyId) %>%
  fill(`age at disease`) %>%
  fill(`age at disease`, .direction = "up")

輸出：

# A tibble: 6 × 4
# Groups:   familyId [3]
     id familyId disease `age at disease`
  <dbl>    <dbl>   <dbl> <chr>           
1     1        1       1 40              
2     2        1       0 40              
3     3        2       0 43              
4     4        2       1 43              
5     5        3       1 52              
6     6        3       0 52

Answer 2

如果每組只有一個非 NA 元素，我們也可以這樣做

library(dplyr)
df1 %>%
   type.convert(as.is = TRUE) %>%
   group_by(familyId) %>%
   mutate(`age at disease` = `age at disease`[complete.cases(`age at disease`)][1]) %>% 
   ungroup

-輸出

# A tibble: 6 × 4
     id familyId disease `age at disease`
  <dbl>    <dbl>   <dbl> <chr>           
1     1        1       1 40              
2     2        1       0 40              
3     3        2       0 43              
4     4        2       1 43              
5     5        3       1 52              
6     6        3       0 52

Answer 3

這是另一種dplyr方法：

df %>%
  group_by(familyId) %>% 
  arrange(`age at disease`,.by_group = TRUE) %>% 
  mutate(`age at disease` = first(`age at disease`))

     id familyId disease `age at disease`
  <dbl>    <dbl>   <dbl> <chr>           
1     1        1       1 40              
2     2        1       0 40              
3     4        2       1 43              
4     3        2       0 43              
5     5        3       1 52              
6     6        3       0 52

如何根據r中的另一列填寫缺失的一列

問題描述

3 個解決方案

解決方案1
2 2022-05-14 15:40:49

解決方案2
2 2022-05-14 15:57:16

解決方案3
2 2022-05-14 16:24:57

如何根據r中的另一列填寫缺失的一列

問題描述

3 個解決方案

解決方案1 2 2022-05-14 15:40:49

解決方案2 2 2022-05-14 15:57:16

解決方案3 2 2022-05-14 16:24:57

解決方案1
2 2022-05-14 15:40:49

解決方案2
2 2022-05-14 15:57:16

解決方案3
2 2022-05-14 16:24:57