R 如何在初始 group_by 后总结两个不同的组

Question

I have the following I would like to do in one go instead of making two different results and doing a union:我想一次性完成以下操作，而不是产生两个不同的结果并进行联合：

delivery_stats= data.frame(service=c("UberEats", "Seamless","UberEats", "Seamless"),
                            status = c("OnTime", "OnTime", "Late", "Late"),
                            totals = c(235, 488, 32, 58))   

ds1 = filter(delivery_stats, service =="UberEats") %>%
         group_by(service, status) %>% 
         summarise(count_status = sum(totals)) %>%
         mutate(avg_of_status = count_status/sum(count_status))

#now do the same for Seamless, then union...

Answer 1

Provided I have understood you correctly, do you mean this?如果我理解正确，你是这个意思吗？

delivery_stats %>%
    group_by(service) %>%
    mutate(n = sum(totals)) %>%
    transmute(
        status,
        count_status = totals,
        avg_of_status = count_status/n)
## A tibble: 4 x 4
## Groups:   service, status [4]
#  service  status count_status avg_of_status
#  <fct>    <fct>         <dbl>         <dbl>
#1 UberEats OnTime          235         0.880
#2 Seamless OnTime          488         0.894
#3 UberEats Late             32         0.120
#4 Seamless Late             58         0.106

Explanation: First group by service to calculate the sum of totals by service ;说明：由第一组service来计算的总和totals由service ; then group by service and status to calculate the mean (across service ) of count_status = totals .然后按service和status分组以计算count_status = totals的平均值（跨service ）。

Answer 2

You also try base R using ave with the help of within .您也尝试使用基础R ave的帮助下within 。

res <- within(delivery_stats, {
  count_status <- ave(totals, service, status, FUN=mean)
  avg_of_status <- count_status / ave(totals, service, FUN=sum)
})
res
#    service status totals avg_of_status count_status
# 1 UberEats OnTime    235     0.8801498          235
# 2 Seamless OnTime    488     0.8937729          488
# 3 UberEats   Late     32     0.1198502           32
# 4 Seamless   Late     58     0.1062271           58

Answer 3

As said above, I didn't have to filter and it would have worked fine for both groups:如上所述，我不必过滤，它对两个组都可以正常工作：

delivery_stats= data.frame(service=c("UberEats", "Seamless","UberEats", "Seamless"),
                            status = c("OnTime", "OnTime", "Late", "Late"),
                            totals = c(235, 488, 32, 58))



ds1 =    group_by(delivery_stats, service, status) %>% 
         summarise(count_status = sum(totals)) %>%
         mutate(avg_of_status = count_status/sum(count_status))

# A tibble: 4 x 4
# Groups:   service [2]
  service  status count_status avg_of_status
  <fct>    <fct>         <dbl>         <dbl>
1 Seamless Late             58         0.106
2 Seamless OnTime          488         0.894
3 UberEats Late             32         0.120
4 UberEats OnTime          235         0.880

R 如何在初始 group_by 后总结两个不同的组

问题描述

3 个解决方案

解决方案1
1 2020-02-28 04:22:40

解决方案2
0 2020-02-28 06:13:01

解决方案3
-1 2020-02-28 04:31:24

R 如何在初始 group_by 后总结两个不同的组

问题描述

3 个解决方案

解决方案1 1 2020-02-28 04:22:40

解决方案2 0 2020-02-28 06:13:01

解决方案3 -1 2020-02-28 04:31:24

解决方案1
1 2020-02-28 04:22:40

解决方案2
0 2020-02-28 06:13:01

解决方案3
-1 2020-02-28 04:31:24