使用基於其他列的變量“名稱”的值在 R 中添加列

Question

如果我們有一個 data.frame ，請說類似

 ///// !col1!col2!col3 --------------- id123 1 0 0 --------------- !id435 0 1 0 --------------- !id777 0 0 1

我想創建一個新列，newcol 的變量名稱的值具有 '1'

數據要

 ///// !col1!col2!col3!newcol --------------------- id123 1 0 0 !col1 --------------------- !id435 0 1 0 !col2 --------------------- !id777 0 0 1 !col3

1）有沒有辦法在 base 或 plyr 中做？ 2）（可選）如果 id123 在 col1 和 col2 中都有值 1 ，如何調整它？ 如何“添加”這些值，在 newcol 中用逗號分隔

 temp$col1 <- c(1,0,0) temp$col2 <- c(0,1,0) temp$col3 <- c(0,0,1) temp<-data.frame(temp$col1, temp$col2, temp$col3)

感謝您的支持:)

Answer 1

我們可以在base R使用max.col

temp$newcol <- names(temp)[max.col(temp, 'first')]

如果我們在同一行有多個 1，並且所有列的名稱都是一個字符串

i1 <- which(temp2 ==1, arr.ind = TRUE)
temp2$newcol <- NA_character_
temp2$newcol[unique(i1[,1])] <-  tapply(names(temp2)[i1[,2]],
         i1[,1], FUN = toString)
temp2$newcol
#[1] "col1"       "col1, col2" "col3"

這也將確保只分配給有 1 個的行

數據

temp <- data.frame(col1  = c(1, 0, 0), col2 = c(0, 1, 0), col3 = c(0, 0, 1))
temp2 <- data.frame(col1 = c(1, 1, 0), col2 = c(0, 1, 0), col3 = c(0, 0, 1))

Answer 2

附加選項

library(tidyverse)
temp2 <- data.frame(col1 = c(1, 1, 0), col2 = c(0, 1, 0), col3 = c(0, 0, 1)) 

temp2 <- temp2 %>% 
  mutate(id = row_number())

temp2 %>% 
  pivot_longer(-id) %>% 
  filter(value == 1) %>% 
  group_by(id) %>% 
  summarise(col = str_c(name, collapse = ", ")) %>% 
  left_join(temp2) %>% 
  select(-id)

使用基於其他列的變量“名稱”的值在 R 中添加列

問題描述

2 個解決方案

解決方案1
1 2020-03-23 17:15:04

數據

解決方案2
1 2020-03-23 17:50:39

使用基於其他列的變量“名稱”的值在 R 中添加列

問題描述

2 個解決方案

解決方案1 1 2020-03-23 17:15:04

數據

解決方案2 1 2020-03-23 17:50:39

解決方案1
1 2020-03-23 17:15:04

解決方案2
1 2020-03-23 17:50:39