PANDAS 按 2 列分组然后计数并取平均值

Question

I have a data frame of users and each time they entered a website, it looks like this:我有一个用户数据框，每次他们进入一个网站时，它看起来像这样：

(if there are x row with same week and date it means the user entered the site x time that date). （如果有 x 行具有相同的星期和日期，则表示用户在该日期的 x 时间进入了站点）。

ID ID	week星期	date日期
1 1个	2 2个	20/07/21 20/07/21
2 2个	3 3个	23/07/21 23/07/21
2 2个	3 3个	23/07/21 23/07/21
2 2个	3 3个	26/07/21 26/07/21
2 2个	4 4个	30/07/21 30/07/21
2 2个	4 4个	30/07/21 30/07/21
2 2个	4 4个	30/07/21 30/07/21
2 2个	4 4个	31/07/21 21 年 7 月 31 日

so far I've managed to do this:到目前为止，我已经设法做到了这一点：

ID ID	week星期	date日期	days number天数
1 1个	2 2个	20/07/21 20/07/21	1 1个
2 2个	3 3个	23/07/21 23/07/21	2 2个
2 2个	3 3个	26/07/21 26/07/21	1 1个
2 2个	4 4个	30/07/21 30/07/21	3 3个
2 2个	4 4个	31/07/21 21 年 7 月 31 日	1 1个

using this code:使用此代码：

df.groupby(['ID','week','date']).agg({'date':['count']})

but I need to calculate the mean times each user used the site by week, so each user has a row for each week.但我需要计算每个用户每周使用该网站的平均时间，因此每个用户每周都有一行。 Therefor the output I need looks like this:因此，我需要的 output 如下所示：

ID ID	week星期	mean days number平均天数
1 1个	2 2个	1 1个
2 2个	3 3个	1.5 1.5
2 2个	4 4个	2 2个

Any ideas how to continue?任何想法如何继续？

Thanks!!谢谢！！

Answer 1

Use:使用：

(df.groupby(['ID', 'week', 'date'], as_index=False)['date']
 .agg('count')
 .groupby(['ID', 'week'], as_index=False)
 .agg(**{'mean days number': ('date', 'mean')})
)

Output: Output：

   ID  week  mean days number
0   1     2               1.0
1   2     3               1.5
2   2     4               2.0

PANDAS 按 2 列分组然后计数并取平均值

问题描述

1 个解决方案

解决方案1
0 已采纳 2023-01-30 17:58:02

PANDAS 按 2 列分组然后计数并取平均值

问题描述

1 个解决方案

解决方案1 0 已采纳 2023-01-30 17:58:02

解决方案1
0 已采纳 2023-01-30 17:58:02