从列值的总和构建 df

Question

I need to group the data by customer_id and get the sum of purchase for each months.我需要按 customer_id 对数据进行分组并获取每个月的购买总和。 My data looks like this:我的数据如下所示：

cust_id        months
1               1
1               1
1               2
1               4
2               1
2               1

So I need to see the sum of purchase for each months and each customer.所以我需要查看每个月和每个客户的购买总额。 The desired output is:所需的 output 是：

cust_id     mo1     mo2     mo3     mo4
1           2       1       0       1
1           2       0       0       0

Answer 1

Use crosstab with DataFrame.reindex for add missing categories:使用带有DataFrame.reindex的crosstab来添加缺失的类别：

r = range(df['months'].min(), df['months'].max() + 1)
df = (pd.crosstab(df['cust_id'],df['months'])
        .reindex(r, axis=1, fill_value=0)
        .add_prefix('mo'))
print (df)
months   mo1  mo2  mo3  mo4
cust_id                    
1          2    1    0    1
2          2    0    0    0

If need all months is possible use ordered categoricals:如果需要所有月份都可以使用有序分类：

df['months'] = pd.Categorical(df['months'], ordered=True, categories=range(1, 13))

df = df.groupby(['cust_id','months']).size().unstack(fill_value=0).add_prefix('mo')
print (df)
months   mo1  mo2  mo3  mo4  mo5  mo6  mo7  mo8  mo9  mo10  mo11  mo12
cust_id                                                               
1          2    1    0    1    0    0    0    0    0     0     0     0
2          2    0    0    0    0    0    0    0    0     0     0     0

Or reindex by range for all months:或者按range reindex所有月份：

r = range(1, 13)
df = (pd.crosstab(df['cust_id'],df['months'])
        .reindex(r, axis=1, fill_value=0)
        .add_prefix('mo'))
print (df)
months   mo1  mo2  mo3  mo4  mo5  mo6  mo7  mo8  mo9  mo10  mo11  mo12
cust_id                                                               
1          2    1    0    1    0    0    0    0    0     0     0     0
2          2    0    0    0    0    0    0    0    0     0     0     0

从列值的总和构建 df

问题描述

1 个解决方案

解决方案1
0 已采纳 2022-02-24 12:26:36

从列值的总和构建 df

问题描述

1 个解决方案

解决方案1 0 已采纳 2022-02-24 12:26:36

解决方案1
0 已采纳 2022-02-24 12:26:36