如何根据多个现有列中的数据对新列使用 mutate

Question

Good Morning,早上好，

this is my data set containing data of different client's races.这是我的数据集，包含不同客户种族的数据。

White  Asian  Black  Native  Islander  Other
1       0       0      0       0         0
0       1       0      0       0         0
0       0       0      1       0         0
0       0       1      0       0         0
1       0       0      0       1         0
0       0       0      0       0         1

The data is stored with a Boolean where 0 = No and 1 = Yes数据以布尔值存储，其中 0 = 否，1 = 是

So if a client has 1 for the column white, then they are white.因此，如果客户的列白色为 1，则它们是白色的。

But if a client has a 1 for white and islander then they are multi racial.但是，如果客户对白人和岛民的评分为 1，那么他们就是多种族。

So this would be my desired output所以这将是我想要的输出

White  Asian  Black  Native  Islander  Other   Race
1       0       0      0       0         0     White
0       1       0      0       0         0     Asian
0       0       0      1       0         0     Native
0       0       1      0       0         0     Black
1       0       0      0       1         0     Multi-Racial 
0       0       0      0       0         1     Other

I'm familiar with mutate() but I've only used mutate based off one column.我熟悉 mutate() 但我只使用了基于一列的 mutate。

Can anyone provide a code that can help with my desired output?任何人都可以提供可以帮助我获得所需输出的代码吗？

Answer 1

Using ifelse() with max.col() should get you what you want.使用ifelse()和max.col()应该可以得到你想要的。 For rows that only have one value you index the name the value was in, otherwise it is "Multi-Racial"对于只有一个值的行，您索引该值所在的名称，否则为"Multi-Racial"

df1$Race <- ifelse(rowSums(df1) == 1, names(df1)[max.col(df1)], "Multi-Racial")
df1
  White Asian Black Native Islander Other         Race
1     1     0     0      0        0     0        White
2     0     1     0      0        0     0        Asian
3     0     0     0      1        0     0       Native
4     0     0     1      0        0     0        Black
5     1     0     0      0        1     0 Multi-Racial
6     0     0     0      0        0     1        Other

Or, using mutate() :或者，使用mutate() ：

df1 %>%
  mutate(Race = ifelse(rowSums(.) == 1, names(.)[max.col(.)], "Multi-Racial"))

Data :数据：

df1 <- read.table(header = T, text = "White  Asian  Black  Native  Islander  Other
1       0       0      0       0         0
0       1       0      0       0         0
0       0       0      1       0         0
0       0       1      0       0         0
1       0       0      0       1         0
0       0       0      0       0         1")

如何根据多个现有列中的数据对新列使用 mutate

问题描述

1 个解决方案

解决方案1
2 已采纳 2020-02-27 16:29:20

如何根据多个现有列中的数据对新列使用 mutate

问题描述

1 个解决方案

解决方案1 2 已采纳 2020-02-27 16:29:20

解决方案1
2 已采纳 2020-02-27 16:29:20