[英]How to perform cumsum with reset at 0 in R?
I have a table and I want to get the cumulative sum within a group(by ID), but the cumulative count should reset if the counter is 0 at any point within a group and again start the cumulative count from 1.我有一个表,我想获取组内的累积总和(按 ID),但是如果计数器在组内的任何点为 0,则累积计数应重置,并再次从 1 开始累积计数。
ID Counter Cumulative
A 1 1
A 0 0
A 1 1
A 1 2
B 1 1
B 0 0
B 1 1
Create a temporary group column to create a new group everytime you encounter a 0.创建一个临时组列,每次遇到 0 时创建一个新组。
library(dplyr)
df %>%
group_by(ID, grp = cumsum(Counter == 0)) %>%
mutate(Cumulative = cumsum(Counter)) %>%
ungroup() %>%
select(-grp) -> result
result
# ID Counter Cumulative
# <chr> <int> <int>
#1 A 1 1
#2 A 0 0
#3 A 1 1
#4 A 1 2
#5 B 1 1
#6 B 0 0
#7 B 1 1
The same logic can be implemented in base R and data.table
as:相同的逻辑可以在基础 R 和data.table
中实现:
df$Cumulative <- with(df, ave(Counter, ID, cumsum(Counter == 0), FUN = cumsum))
library(data.table)
setDT(df)[, Cumulative := cumsum(Counter), .(ID, cumsum(Counter == 0))]
data数据
df <- structure(list(ID = c("A", "A", "A", "A", "B", "B", "B"), Counter = c(1L,
0L, 1L, 1L, 1L, 0L, 1L)), class = "data.frame", row.names = c(NA, -7L))
An alternative approach could be另一种方法可能是
df %>% group_by(ID) %>%
mutate(cs = accumulate(Counter, ~ifelse(.y == 0, .y, .x + .y)))
Checking it on data provided by dear @Ronak, in his comments在他的评论中检查亲爱的@Ronak 提供的数据
df <- structure(list(ID = c("A", "A", "A", "A", "A", "B", "B", "B"), Counter = c(1L, 0L, 1L, 1L, 1L, 1L, 0L, 1L)), class = "data.frame", row.names = c(NA, -8L))
df %>% group_by(ID) %>%
mutate(cs = accumulate(Counter, ~ifelse(.y == 0, .y, .x + .y)))
# A tibble: 8 x 3
# Groups: ID [2]
ID Counter cs
<chr> <int> <int>
1 A 1 1
2 A 0 0
3 A 1 1
4 A 1 2
5 A 1 3
6 B 1 1
7 B 0 0
8 B 1 1
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.