如何在R中满足特定条件时获得连续出现的平均值

Question

I have a data that has payment schedule for customers that is ordered by transaction date.I want to calculate the average number of consecutive failed payments and average number of consecutive success payments.The table looks like below:我有一个data ，其中包含按交易日期订购的客户付款时间表。我想计算连续失败付款的平均次数和连续成功付款的平均次数。表格如下所示：

customer_id |transaction_id.|failed_or_success  | transaction_date 
  1         |1              |success            |2021-01-01
  1         |2              |success            |2021-01-15
  1         |3              |failed             |2021-01-30
  1         |4              |success            |2021-02-15

For example, the average number of consecutive success payment would be (2+1)/2=1.5 , the first 2 comes from transaction_id 1 & 2.the second 1 comes from transaction_id 4. And the average number of consecutive failed payment would just be 1 in this example.例如，平均连续支付成功次数为(2+1)/2=1.5 ，前2来自 transaction_id 1 & 2，第二个1来自 transaction_id 4。而连续支付失败的平均次数为在本例中为 1。 Eventually the table would look like this:最终表格将如下所示：

cus_id |tran_id.|f_or_s |tran_date  |avg_consec_fail|avg_consec_success
  1    |1       |success|2021-01-01 |1              |1.5
  1    |2       |success|2021-01-15 |1              |1.5
  1    |3       |failed |2021-01-30 |1              |1.5
  1    |4       |success|2021-02-15 |1              |1.5

How do I make this happen with R/dplyr ?我如何使用R/dplyr实现这一点？

Answer 1

You may try using rle您可以尝试使用rle

Data数据

df <- read.table(text = "customer_id transaction_id. failed_or_success   transaction_date 
  1         1              success            2021-01-01
  1         2              success            2021-01-15
  1         3              failed             2021-01-30
  1         4              success            2021-02-15", header = TRUE)

Code代码

df %>%
  mutate(avg_consec_success = mean(rle(failed_or_success)$length[rle(failed_or_success)$values != "failed"]))

  customer_id transaction_id. failed_or_success transaction_date avg_consec_success
1           1               1           success       2021-01-01                1.5
2           1               2           success       2021-01-15                1.5
3           1               3            failed       2021-01-30                1.5
4           1               4           success       2021-02-15                1.5

如何在R中满足特定条件时获得连续出现的平均值

问题描述

1 个解决方案

解决方案1
1 已采纳 2021-11-10 01:29:13

Data数据

Code代码

如何在R中满足特定条件时获得连续出现的平均值

问题描述

1 个解决方案

解决方案1 1 已采纳 2021-11-10 01:29:13

Data数据

Code代码

解决方案1
1 已采纳 2021-11-10 01:29:13