Python：按組計數 dataframe 中的特定事件

Question

假設我有一個df：

df = pd.DataFrame({'id': [12, 35, 37, 67, 99, 78],
                  'product': ['banana', 'apple', 'banana', 'pear', 'banana', 'apple'],
                  'reordered': [1, 0, 0, 1, 1, 1]})


    id     product   reordered
0   12     banana    1
1   35     apple     0
2   37     banana    0
3   67     pear      1
4   99     banana    1
5   78     apple     1

我想計算“產品”列中產品的出現次數，以及按產品分組的“重新排序”列中的值。 期望的結果：

       product   count   reordered_0   reordered_1
   0   banana    3       1             2
   1   apple     2       1             1
   2   pear      1       1             0

請指教

Answer 1

使用帶有DataFrame.insert的crosstab作為第一個 position 的列：

df = pd.crosstab(df['product'], df.reordered).add_prefix('reordered_')
df.insert(0, 'count', df.sum(axis=1))
df = df.reset_index().rename_axis(None, axis=1)
print(df)
  product  count  reordered_0  reordered_1
0   apple      2            1            1
1  banana      3            1            2
2    pear      1            0            1

Answer 2

讓我們嘗試使用crosstab ：

(pd.crosstab(df['product'], df['reordered'])
   .add_prefix('reordered_')
   .assign(count=lambda x: x.sum(1))
   .reset_index()
)

Output：

reordered product  reordered_0  reordered_1  count
0           apple            1            1      2
1          banana            1            2      3
2            pear            0            1      1

Python：按組計數 dataframe 中的特定事件

問題描述

2 個解決方案

解決方案1
6 已采納 2021-05-19 04:55:17

解決方案2
5 2021-05-19 04:55:07

Python：按組計數 dataframe 中的特定事件

問題描述

2 個解決方案

解決方案1 6 已采納 2021-05-19 04:55:17

解決方案2 5 2021-05-19 04:55:07

解決方案1
6 已采納 2021-05-19 04:55:17

解決方案2
5 2021-05-19 04:55:07