pandas - groupby 多列并获得其他列的唯一计数

Question

我有大约 20 列的数据框。 但是，在这种情况下，我关注 3 个特定的列。

（日期，小时，cust_id）

现在我想绘制如下图：-

1) 'x-axis' 将显示从 00 到 23 的小时数。

2) 'y-axis' 将代表该小时（唯一客户 ID）客户的计数。

3) 图表中的每一行将代表绘制它的特定日期。

我尝试了以下：-

fig, ax = plt.subplots()
for label, df1 in df.groupby('date'):
    df1.groupby('hour').agg(customers = ('customer_id', 'nunique')).plot(ax=ax, label=label)

但是，上述问题很少：-

1) 每个折线图的标签都显示为“customer_id”，这应该是日期。

2）这张图是否正确？

Answer 1

尝试使用以下方法：

fig, ax = plt.subplots()
for label, df1 in df.groupby('date'):
    ax.plot(df1.groupby('hour').agg(customers = ('customer_id', 'nunique')), label=label)
ax.legend()

pandas - groupby 多列并获得其他列的唯一计数

问题描述

1 个解决方案

解决方案1
0 2019-12-23 05:07:40

pandas - groupby 多列并获得其他列的唯一计数

问题描述

1 个解决方案

解决方案1 0 2019-12-23 05:07:40

解决方案1
0 2019-12-23 05:07:40