如何获得基于两列的频率计数总和？

Question

Assuming that the dataframe is stored as someData , and is in the following format:假设数据帧存储为someData ，并采用以下格式：

ID                Team                Games                Medal
1                 Australia           1992 Summer          NA
2                 Australia           1994 Summer          Gold
3                 Australia           1992 Summer          Silver
4                 United States       1991 Winter          Gold
5                 United States       1992 Summer          Bronze
6                 Singapore           1991 Summer          NA

How would I count the frequencies of the medal, based on the Team - while excluding NA as an variable.我将如何根据团队计算奖牌的频率 - 同时将NA作为变量排除在外。 But at the same time, the total frequency of each country should be summed, rather than displayed separately for Gold , Silver and Bronze .但同时，每个国家的总频率应该是相加的，而不是分别为Gold 、 Silver和Bronze 。

In other words, I am trying to display the total number of medals PER country, with the exception of NA .换句话说，我试图显示每个国家的奖牌总数，但NA除外。

I have tried something like this:我试过这样的事情：

library(plyr)
counts <- ddply(olympics, .(olympics$Team, olympics$Medal), nrow)
names(counts) <- c("Country", "Medal", "Freq")
counts

But this just gives me a massive table of every medal for every country separately, including NA.但这只是给了我一个巨大的表格，列出了每个国家的每枚奖牌，包括北美。

What I would like to do is the following:我想做的是以下内容：

Australia            2
United States        2

Any help would be greatly appreciated.任何帮助将不胜感激。

Thank you!谢谢！

Answer 1

We can use count我们可以使用count

library(dplyr)
df1 %>% 
  filter(!is.na(Medal)) %>%
  count(Team)
# A tibble: 2 x 2
#  Team              n
#  <fct>         <int>
#1 Australia         2
#2 United States     2

Answer 2

You can do that in base R with table and colSums你可以用table和colSums在 base R 中做到这colSums

colSums(table(someData$Medal, someData$Team))
    Australia     Singapore United States 
            2             0             2

Data数据

someData = read.table(text="ID        Team        Games         Medal
1                 Australia           '1992 Summer'          NA
2                 Australia           '1994 Summer'          Gold
3                 Australia           '1992 Summer'          Silver
4                 'United States'     '1991 Winter'          Gold
5                 'United States'     '1992 Summer'          Bronze
6                 Singapore           '1991 Summer'          NA",
header=TRUE)

如何获得基于两列的频率计数总和？

问题描述

2 个解决方案

解决方案1
1 已采纳 2018-09-16 01:18:46

解决方案2
0 2018-09-16 01:14:17

如何获得基于两列的频率计数总和？

问题描述

2 个解决方案

解决方案1 1 已采纳 2018-09-16 01:18:46

解决方案2 0 2018-09-16 01:14:17

解决方案1
1 已采纳 2018-09-16 01:18:46

解决方案2
0 2018-09-16 01:14:17