根据数据表中的组和子组的计数创建变量

Question

I have many student records.我有很多学生记录。 I need to create two new variable.我需要创建两个新变量。 One should display the count of Unitcode (ie enrolments) for each student_ID for each Year .应该显示每个Year每个student_ID的Unitcode （即注册）计数。

One should display the count of Fail (ie Grade=='Fail') for each student_ID for each Year .应该显示每个Year每个student_ID的Fail计数（即 Grade=='Fail'）。 See the example of records for three students below:请参阅以下三个学生的记录示例：

   student_ID=c(rep("1001",8),rep("1002",3),rep("1005",11))
   Year=c(rep(2011,4),rep(2012,4),2011,2012,2013,rep(2011,4),rep(2012,3),rep(2013,4))
   Grade=c(rep("Fail",2),rep("Pass",3),rep("Fail",3),rep("Pass",7),rep("Fail",2),rep("Pass",5))
   Unitcode<-c(1201:1222)
   record<-data.table(student_ID, Year, Grade, Unitcode)

If someone could assist with counting new variables that would be greatly appreciated.如果有人可以协助计算新变量，将不胜感激。

Answer 1

A similar option using dplyr would be使用dplyr的类似选项是

library(dplyr)
record %>%
     group_by(student_ID, Year) %>%
     summarise(unitcodes=n(), fails=sum(Grade=='Fail'))

根据数据表中的组和子组的计数创建变量

问题描述

1 个解决方案

解决方案1
1 已采纳 2015-11-06 04:25:56

根据数据表中的组和子组的计数创建变量

问题描述

1 个解决方案

解决方案1 1 已采纳 2015-11-06 04:25:56

解决方案1
1 已采纳 2015-11-06 04:25:56