按區域然后按自定義組匯總事件

Question

我有以下數據集，它采用 2 列數據集並根據規定的 CustomerAge 創建年齡組類別。

    library(tidyverse)
    
    df <- 
      read.table(textConnection("Area   CustomerAge
    A 28 
    A 40
    A 70
    A 19
    B 13
    B 12
    B 72
    B 90"), header=TRUE)
    

df2 <- df %>% 
  mutate(
    # Create categories
    Customer_Age_Group = dplyr::case_when(
      CustomerAge <= 18            ~ "0-18",
      CustomerAge > 18 & CustomerAge <= 60 ~ "19-60",
      CustomerAge > 60             ~ ">60"
    ))

我希望實現的是 output 摘要，如下所示：

區域	客戶_年齡_組	出現次數
一種	0-18歲	0
一種	19-59	3個
一種	>60	1個
乙	0-18歲	2個
乙	19-59	0
乙	>60	2個

Answer 1

要包括 0 次出現，您需要count() 、 ungroup()和complete() ：

df2 %>% group_by(Area, Customer_Age_Group,.drop = FALSE) %>% 
count() %>% 
ungroup() %>% 
complete(Area, Customer_Age_Group, fill=list(n=0))

這也將顯示 0 次出現。

要按區域和年齡組排序：

df2 %>% group_by(Area, Customer_Age_Group,.drop = FALSE) %>% 
count() %>% 
ungroup() %>% 
complete(Area, Customer_Age_Group, fill=list(n=0)) %>% 
arrange(Area, parse_number(Customer_Age_Group))

Answer 2

group_by和summarise是你要找的。

df2 %>% group_by(Area, Customer_Age_Group) %>% summarise(Occurences = n())

但是請注意，這不會顯示數據集中出現次數為零的類別。

按區域然后按自定義組匯總事件

問題描述

2 個解決方案

解決方案1
1 已采納 2022-10-06 10:46:28

解決方案2
0 2022-10-06 10:15:15

按區域然后按自定義組匯總事件

問題描述

2 個解決方案

解決方案1 1 已采納 2022-10-06 10:46:28

解決方案2 0 2022-10-06 10:15:15

解決方案1
1 已采納 2022-10-06 10:46:28

解決方案2
0 2022-10-06 10:15:15