繁体   English   中英

根据其他多个列上的值(为ifelse替代)将值分配给变量

[英]Assign value to variable based on values on multiple other columns (alternative to ifelse)

我有一个描述大量人员的数据框。 我想根据几个变量将每个人分配到一个组中。 例如,假设我有带有5个州的变量“州”,带有4个组的变量“年龄组”和带有5个组的变量“收入”。 我将有5x4x5 = 100个组,我想用从1到100的数字来命名。过去我一直使用ifelse语句的组合来完成此操作,但是现在我有100个可能的结果,我想知道是否有比手动指定每个组合更快的方法。

这是具有预期结果的MWE:

mydata <- as.data.frame(cbind(c("FR","UK","UK","IT","DE","ES","FR","DE","IT","UK"),
 c("20","80","20","40","60","20","60","80","40","60"),c(1,4,2,3,1,5,5,3,4,2)))
colnames(mydata) <- c("Country","Age","Income")

group_grid <- transform(expand.grid(state = c("IT","FR","UK","ES","DE"), 
       age = c("20","40","60","80"), income = 1:5), val = 1:100)

desired_result <- as.data.frame(cbind(c("FR","UK","UK","IT","DE","ES","FR","DE","IT","UK"),

                                      c("20","80","20","40","60","20","60","80","40","60"),
                                      c(1,4,2,3,1,5,5,3,4,2),
                                      c(2,78,23,46,15,84,92,60,66,33)))

colnames(desired_result) <- c("Country","Age","Income","Group_code")

mydata$Group_code <- with(mydata, as.integer(interaction(Country, Age, Income)))应该这样做。

这是使用dplyr left_join选项

library(dplyr)
grpD <- group_grid %>% 
            mutate_if(is.factor, as.character) %>% #change to character class as joining
            mutate(income = as.character(income))#with same class columns are reqd.
mydata %>%
      mutate_if(is.factor, as.character) %>%  #change class here too
      left_join(., grpD, by= c("Country" = "state", "Age" = "age", "Income" = "income"))
#    Country Age Income val
#1       FR  20      1   2
#2       UK  80      4  78
#3       UK  20      2  23
#4       IT  40      3  46
#5       DE  60      1  15
#6       ES  20      5  84
#7       FR  60      5  92
#8       DE  80      3  60
#9       IT  40      4  66
#10      UK  60      2  33

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM