[英]Add column that is the sum of other columns
輸入數據:
Director= c("Director A", "Director B", "Director C")
Salary = c(40000, 35000, 50000)
Listed boards = c(1, 0, 3)
Unlisted boards = c(4, 2, 6)
Other boards = c(2, 3, 3)
Number of qualifications = c(1, 2, 1)
df_directors = data.frame(Director, Salary, Listed boards, Unlisted boards, Other boards, Number of qualifications)
現在我想創建一個額外的列來總結和替換關於板的三列。 所以我的輸出數據應該是這樣的:
Director Salary Boards Number of qualifications
Director A 40000 7 1
Director B 35000 5 2
Director C 50000 12 1
這段代碼不知何故不起作用。
df_directors <- df_directors %>%
rowwise() %>%
mutate(rBoards = sum(c_across(all_of(Listed Boards:Other Boards))))
我們可以將rowSums
與select
一起使用:
library(dplyr)
df_directors %>%
transmute(Director, Salary, Boards = rowSums(select(., contains("boards"))),Number_of_qualifications)
第一個解決方案:
library(dplyr)
df_directors %>%
mutate(Boards = rowSums(select(., contains("boards")))) %>%
select(1,2,Boards,6)
Director Salary Boards Number_of_qualifications
1 Director A 40000 7 1
2 Director B 35000 5 2
3 Director C 50000 12 1
我們可以使用base R
中的rowSums
library(dplyr)
df_directors %>%
transmute(Director, Salary,
Boards = rowSums(across(ends_with('boards')), na.rm = TRUE),
`Number of qualifications`)
-輸出
Director Salary Boards Number of qualifications
1 Director A 40000 7 1
2 Director B 35000 5 2
3 Director C 50000 12 1
在OP的代碼中,當列名中有空格時需要反引號
df_directors %>%
rowwise() %>%
mutate(rBoards = sum(c_across(`Listed boards`:`Other boards`))) %>%
ungroup
-輸出
# A tibble: 3 × 7
Director Salary `Listed boards` `Unlisted boards` `Other boards` `Number of qualifications` rBoards
<chr> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
1 Director A 40000 1 4 2 1 7
2 Director B 35000 0 2 3 2 5
3 Director C 50000 3 6 3 1 12
df_directors <- structure(list(Director = c("Director A", "Director B", "Director C"
), Salary = c(40000, 35000, 50000), `Listed boards` = c(1, 0,
3), `Unlisted boards` = c(4, 2, 6), `Other boards` = c(2, 3,
3), `Number of qualifications` = c(1, 2, 1)),
class = "data.frame", row.names = c(NA,
-3L))
rowSums
和grep
用於識別。
cbind(df_directors[1:2], Boards=rowSums(df_directors[grep('boards$', names(df_directors))]), df_directors[ncol(df_directors)])
# Director Salary Boards Number.of.qualifications
# 1 Director A 40000 7 1
# 2 Director B 35000 5 2
# 3 Director C 50000 12 1
數據:
df_directors <- structure(list(Director = c("Director A", "Director B", "Director C"
), Salary = c(40000, 35000, 50000), Listed.boards = c(1, 0, 3
), Unlisted.boards = c(4, 2, 6), Other.boards = c(2, 3, 3), Number.of.qualifications = c(1,
2, 1)), class = "data.frame", row.names = c(NA, -3L))
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.