[英]Rowsums in R by unique values in multiple columns
假设我有一张有棋手对的桌子(只是一个假想的例子)。 该表显示了谁玩了白棋、黑棋以及玩家之间的比赛次数。
| White| Black| Games|
|:---- |:------:| -----:|
| Anand| Caruana| 13 |
| Carlsen| Naka| 12 |
| Caruana| Giri| 14 |
| Giri| Anand| 10 |
| Grischuk| Carlsen| 7|
我想要的是每个玩家的游戏总数(黑+白),即他与任何其他大师的所有游戏。
| Player | Games_total|
|:---- |:------:|
| Anand| 33|
| Caruana| 27|
| Carlsen| 21|
| Naka| 12|
| Giri| 34|
| Grischuk| 9|
您可以获取长格式的数据,然后对每个Player
的Games
sum
。
library(dplyr)
library(tidyr)
df %>%
pivot_longer(cols = c(White, Black), values_to = 'Player') %>%
group_by(Player) %>%
summarise(Games_total = sum(Games))
# Player Games_total
# <chr> <int>
#1 Anand 23
#2 Carlsen 19
#3 Caruana 27
#4 Giri 24
#5 Grischuk 7
#6 Naka 12
数据
df <- structure(list(White = c("Anand", "Carlsen", "Caruana", "Giri",
"Grischuk"), Black = c("Caruana", "Naka", "Giri", "Anand", "Carlsen"
), Games = c(13L, 12L, 14L, 10L, 7L)), row.names = c(NA, -5L), class = "data.frame")
使用aggregate
尝试此解决方案
setNames( aggregate( Games ~ values,
cbind( stack(df1, c("White","Black")), Games=df1$Games), sum ),
c("Players","Games_total") )
Players Games_total
1 Anand 23
2 Carlsen 19
3 Caruana 27
4 Giri 24
5 Grischuk 7
6 Naka 12
df1 <- structure(list(White = c("Anand", "Carlsen", "Caruana", "Giri",
"Grischuk"), Black = c("Caruana", "Naka", "Giri", "Anand", "Carlsen"
), Games = c(13L, 12L, 14L, 10L, 7L)), class = "data.frame", row.names = c(NA,
-5L))
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.