如何根据使用 R 与第三列的匹配，将数据框中多列的值替换为第二列中的值？

Question

I am working with a single dataframe in R containing the following char columns and values.我正在使用 R 中的单个 dataframe 包含以下字符列和值。

C1<-c("1","2","3","4","5")
C2<-c("x", "t", "u", "r", "j")
C3<-c("2","5","3","1","4")
C4<-c("3","1","NA", "2","5")
df<-data.frame(C1,C2,C3,C4)

I am trying to write code that will replace values in C3 and C4 as follows:我正在尝试编写将替换 C3 和 C4 中的值的代码，如下所示：

For each value in C3, find the same value in C1.对于 C3 中的每个值，在 C1 中找到相同的值。
Replace the value in C3 with the value in C2 that occurs in the row with the C3/C1 match.将 C3 中的值替换为 C2 中与 C3/C1 匹配的行中出现的值。 In C3, For example, "2" (the first value) would be replaced with "t", "5" would be replaced with "j", "3" would be replaced with "3" and so forth.例如，在 C3 中，“2”（第一个值）将替换为“t”，“5”将替换为“j”，“3”将替换为“3”等等。
Repeat the same procedure for values in C4.对 C4 中的值重复相同的过程。
Skip any cells with an NA in C3 or C4.跳过 C3 或 C4 中具有 NA 的任何单元格。

The initial dataframe looks like this:最初的 dataframe 如下所示：

初始数据框

The final dataframe should look like this:最终的 dataframe 应如下所示：

更新的数据框

I've yet to come up with code (base R or Dplyr) that will accomplish this task.我还没有想出可以完成这项任务的代码（基础 R 或 Dplyr）。 If anyone can lend assistance, I would really appreciate it.如果有人可以提供帮助，我将不胜感激。

Thanks!谢谢！

This is a new df that I've tried to manipulate with the code provided by respondents (eg, df[c("C3", "C4")] <- lapply(df[c("C3", "C4")], function(x) df$C2[match(x, df$C1)])).这是我尝试使用受访者提供的代码来操作的新 df（例如 df[c("C3", "C4")] <- lapply(df[c("C3", "C4") ]，函数（x）df$C2[匹配（x，df$C1）]））。

I am returning all NA's for C3 C4 and cannot understand why.我要退回 C3 C4 的所有 NA，但不明白为什么。 There are matches between C3 and C1. C3 和 C1 之间存在匹配。

Answer 1

We can use match我们可以使用match

df[c("C3", "C4")] <- lapply(df[c("C3", "C4")], function(x) df$C2[match(x, df$C1)])

Answer 2

I also used match , but split it up into two different statements to make it more clear what was going on:我也使用了match ，但将其拆分为两个不同的语句，以便更清楚地了解发生了什么：

# Create sample data
C1<-c("1","2","3","4","5")
C2<-c("x", "t", "u", "r", "j")
C3<-c("2","5","3","1","4")
C4<-c("3","1","NA", "2","5")
df<-data.frame(C1,C2,C3,C4)

# Make replacements
df$C3_mod <- ifelse(is.na(df$C3), df$C3, df$C2[match(df$C3, df$C1)])
df$C4_mod <- ifelse(is.na(df$C4), df$C4, df$C2[match(df$C4, df$C1)])

# View results
df
#   C1 C2 C3 C4 C3_mod C4_mod
# 1  1  x  2  3      t      u
# 2  2  t  5  1      j      x
# 3  3  u  3 NA      u   <NA>
# 4  4  r  1  2      x      t
# 5  5  j  4  5      r      j

Answer 3

Using match with matrix.使用与矩阵match 。

cols <- c('C3', 'C4')
df[cols] <- df$C2[match(as.matrix(df[cols]), df$C1)]
df

#  C1 C2 C3   C4
#1  1  x  t    u
#2  2  t  j    x
#3  3  u  u <NA>
#4  4  r  x    t
#5  5  j  r    j

Answer 4

I solved the issue of my NA values.我解决了我的 NA 值的问题。 It turns out that I had whitespaces in the column values that I hadn't accounted for.事实证明，我没有考虑到列值中有空格。 Again, thanks to everyone for their responses.再次感谢大家的回复。 I learned a lot in the process.在这个过程中我学到了很多。

如何根据使用 R 与第三列的匹配，将数据框中多列的值替换为第二列中的值？

问题描述

4 个解决方案

解决方案1
0 2021-02-21 20:22:41

解决方案2
0 2021-02-21 21:55:22

解决方案3
0 2021-02-22 03:23:24

解决方案4
0 2021-02-22 16:55:24

如何根据使用 R 与第三列的匹配，将数据框中多列的值替换为第二列中的值？

问题描述

4 个解决方案

解决方案1 0 2021-02-21 20:22:41

解决方案2 0 2021-02-21 21:55:22

解决方案3 0 2021-02-22 03:23:24

解决方案4 0 2021-02-22 16:55:24

解决方案1
0 2021-02-21 20:22:41

解决方案2
0 2021-02-21 21:55:22

解决方案3
0 2021-02-22 03:23:24

解决方案4
0 2021-02-22 16:55:24