基于 r 中的行值的条件 if 语句

Question

I am new to R and I would really appreciate your assistance in this.我是 R 的新手，我非常感谢您在这方面的帮助。

I have a dataframe,with 2 levels being 'Y' AND 'N' indicators on 11 variables.我有一个数据框，有 2 个级别是 11 个变量的“Y”和“N”指标。

I would like to have a new column, which concatenates column names when row value equals to 'Y'我想要一个新列，当行值等于“Y”时连接列名

ie IE

Answer 1

In base R, we can create a row/column index matrix where value is "Y" using which .在基数 R 中，我们可以使用which创建一个行/列索引矩阵，其中值是"Y" 。 Using tapply , we can paste the column names for each row.使用tapply ，我们可以为每一行paste列名。

cols <- paste0('col', 1:9)
mat <- which(df[cols] == 'Y', arr.ind = TRUE)
df$new_col <- as.character(tapply(names(df)[mat[, 2]], mat[, 1], 
               paste, collapse = "_"))
df
#  col1 col2 col3 col4 col5 col6 col7 col8 col9 col10 col11                       new_col
#1    N    Y    N    Y    Y    Y    N    Y    Y     1   624 col2_col4_col5_col6_col8_col9
#2    N    Y    N    Y    Y    Y    N    Y    N     7   548      col2_col4_col5_col6_col8

Using tidyverse we can get the data in long format, filter rows where value is "Y" and for each row paste column values.使用tidyverse我们可以获得长格式的数据， filter value "Y"行，并为每一行粘贴列值。

library(dplyr)

df %>%
  mutate(row = row_number()) %>%
  tidyr::pivot_longer(cols = -c(col10, col11, row)) %>%
  filter(value == 'Y') %>%
  group_by(row, col10, col11) %>%
  summarise(newcol = toString(name)) %>%
  ungroup() %>%
  select(-row)

data数据

df <- structure(list(col1 = structure(c(1L, 1L), .Label = "N", class = "factor"), 
col2 = structure(c(1L, 1L), .Label = "Y", class = "factor"), 
col3 = structure(c(1L, 1L), .Label = "N", class = "factor"), 
col4 = structure(c(1L, 1L), .Label = "Y", class = "factor"), 
col5 = structure(c(1L, 1L), .Label = "Y", class = "factor"), 
col6 = structure(c(1L, 1L), .Label = "Y", class = "factor"), 
col7 = structure(c(1L, 1L), .Label = "N", class = "factor"), 
col8 = structure(c(1L, 1L), .Label = "Y", class = "factor"), 
col9 = structure(2:1, .Label = c("N", "Y"), class = "factor"), 
col10 = c(1, 7), col11 = c(623.53, 548.028)), row.names = c(NA, -2L),
class = "data.frame")

Answer 2

A simple, base R, way of doing it is一个简单的基本 R 方法是

df1$newcol <- apply(df1, 1, function(x){
  paste(names(df1)[x == "Y"], collapse = "_")
})

Test data creation code.测试数据创建代码。

set.seed(1234)
df1 <- t(replicate(2, sample(c("N", "Y"), 10, TRUE)))
df1 <- as.data.frame(df1)
df1 <- cbind(df1, matrix(1:4, 2))
names(df1) <- paste0("col", 1:ncol(df1))

基于 r 中的行值的条件 if 语句

问题描述

2 个解决方案

解决方案1
4 已采纳 2020-02-03 12:56:46

解决方案2
4 2020-02-03 13:01:58

基于 r 中的行值的条件 if 语句

问题描述

2 个解决方案

解决方案1 4 已采纳 2020-02-03 12:56:46

解决方案2 4 2020-02-03 13:01:58

解决方案1
4 已采纳 2020-02-03 12:56:46

解决方案2
4 2020-02-03 13:01:58