[英]Summarize Dates
I want to summarize dates from Monday to Monday but on Saturdays and Sundays accumulate on Friday我想总结周一到周一的日期,但周六和周日累积到周五
## A tibble: 10 x 2
#date value
#<date> <dbl>
# 1 2020-01-02 1 --- Thurs
#2 2020-01-03 1 --- Fri
#3 2020-01-04 2 --- Sat
#4 2020-01-06 1 --- Mon
#5 2020-01-07 1 --- Tues
#6 2020-01-08 1 --- Wed
#7 2020-01-09 1 --- Thurs
#8 2020-01-10 1 --- Fri
#9 2020-01-11 2 --- Sat
#10 2020-01-13 1 --- Mon
This is what i want这就是我要的
#date value
#<date> <dbl>
# 1 2020-01-02 1 --- Thurs
#2 2020-01-03 3 --- Fri
#4 2020-01-06 1 --- Mon
#5 2020-01-07 1 --- Tues
#6 2020-01-08 1 --- Wed
#7 2020-01-09 1 --- Thurs
#8 2020-01-10 3 --- Fri
#10 2020-01-13 1 --- Mon
dd = structure(list(date = structure(c(18263, 18264, 18265, 18267, 18268, 18269, 18270, 18271, 18272, 18274), class = "Date"), value = c(1, 1, 2, 1, 1, 1, 1, 1, 2, 1)), row.names = c(NA, -10L), class = c("tbl_df", "tbl", "data.frame"))
We can use the weekdays
to replace
the 'date' that are on weekends to NA
, then fill
the NA
s with previous non-NA element (assuming the 'date' is already arrange
d, and then do a group_by
sum
我们可以使用
weekdays
将周末的 'date' replace
为NA
,然后用之前的非 NA 元素fill
NA
s(假设 'date' 已经arrange
d,然后做一个group_by
sum
library(dplyr)
library(tidyr)
dd %>%
mutate(Weekday = weekdays(date),
date = replace(date, Weekday %in% c("Saturday", "Sunday"), NA)) %>%
fill(date) %>%
group_by(date) %>%
summarise(value = sum(value), .groups = 'drop')
-output -输出
# A tibble: 8 x 2
# date value
# <date> <dbl>
#1 2020-01-02 1
#2 2020-01-03 3
#3 2020-01-06 1
#4 2020-01-07 1
#5 2020-01-08 1
#6 2020-01-09 1
#7 2020-01-10 3
#8 2020-01-13 1
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.