![](/img/trans.png)
[英]How to use dplyr to convert variables from numeric to factor with unique levels
[英]How to use mutate and ifelse to convert numerical variables into factor variable with multiple levels
我很难从现有的数字变量创建新的因子变量。 我有一个与参与者的年龄相关的数字变量Age,但想创建一个将参与者的年龄分类为不同类别的因子变量。 每当我运行代码时,都会出现错误:
“错误:缺少参数“ no”,没有默认值。”
我尝试了以下代码的不同变体,例如不带引号的新因子水平,使用:表示范围等。我的代码如下。
data.frame%>%
mutate(Age = ifelse(Age < 20, "0"),
ifelse(Age >= 20 & Age <= 29, "1"),
ifelse(Age >=30 & Age <= 39, "2"),
ifelse(Age >= 40 & Age <=49, "3"),
ifelse(Age >= 50 & Age <= 59, "4"),
ifelse(Age >= 60 & Age <= 69, "5"),
ifelse(Age >= 70, "6", NA))
cut()
是执行此操作的最简单方法。
在基数R中:
Age <- seq(10,80,by=10)
cut(Age,breaks=c(-Inf,seq(20,70,by=10),Inf),
right=FALSE,
labels=as.character(0:6))
我将让您随意将其嵌入mutate()
中。
您的代码的问题是您没有正确嵌套的选择:将这段代码与您的代码进行仔细比较...
Age = ifelse(Age < 20, "0",
ifelse(Age >= 20 & Age <= 29, "1",
ifelse(...,[yes],[no])))
尾括号")"
应该放在所有ifelse
:
df1 <- data.frame(Age=c(1:80,NA))
df1%>%
mutate(Age_cat = factor(ifelse(Age < 20, "0",
ifelse(Age >= 20 & Age <= 29, "1",
ifelse(Age >=30 & Age <= 39, "2",
ifelse(Age >= 40 & Age <=49, "3",
ifelse(Age >= 50 & Age <= 59, "4",
ifelse(Age >= 60 & Age <= 69, "5",
ifelse(Age >= 70, "6", NA)))))))))
不过,你也应该知道,在dplyr
,这是一个完美的机会case_when
:
df1 %>%
mutate(Age_cat= factor(case_when(
.$Age < 20 ~ "0",
.$Age >= 20 & .$Age <= 29 ~ "1",
.$Age >= 30 & .$Age <= 39 ~"2",
.$Age >= 40 & .$Age <=49 ~ "3",
.$Age >= 50 & .$Age <= 59 ~ "4",
.$Age >= 60 & .$Age <= 69 ~ "5",
TRUE ~"6"))
)
Age Age_cat
1 1 0
2 2 0
3 3 0
4 4 0
5 5 0
...
13 13 0
14 14 0
15 15 0
16 16 0
17 17 0
18 18 0
19 19 0
20 20 1
21 21 1
22 22 1
23 23 1
24 24 1
...
79 79 6
80 80 6
81 NA <NA>
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.