基於另一個 dataframe 中的兩列的 dataframe 中的行條件計數

Question

我想向df2添加一列，其中包括df1中具有匹配Herd和Ddat值的行數。

import pandas as pd 
    
df1 = [[52, '1', '1/1/2020'], [54, '1', '1/1/2020'],
       [55, '2', '1/1/2020'], [56, '3', '1/1/1999']]
    
df = pd.DataFrame(df1, columns =['Cow','Herd', 'Ddat'])

df2 = [['1', '1/1/2020'], ['1', '1/5/2020'],
       ['2', '1/1/2020'], ['3', '1/1/1999']]
    
df2 = pd.DataFrame(df2, columns =['Herd', 'Ddat'])

我正在尋找的 output 是

Herd    Ddat       Count
1     1/1/2020        2
1     1/5/2020        0
2     1/1/2020        1
3     1/1/1999        1

Answer 1

您可以利用索引的優點：

cols = ['Herd', 'Ddat']
new_df = df2.set_index(cols).assign(Count=df.groupby(cols).count()).fillna(0).astype({'Count': int}).reset_index()

Output：

>>> new_df
  Herd      Ddat  Count
0    1  1/1/2020      2
1    1  1/5/2020      0
2    2  1/1/2020      1
3    3  1/1/1999      1

基於另一個 dataframe 中的兩列的 dataframe 中的行條件計數

問題描述

1 個解決方案

解決方案1
0 2021-12-12 02:06:50

基於另一個 dataframe 中的兩列的 dataframe 中的行條件計數

問題描述

1 個解決方案

解決方案1 0 2021-12-12 02:06:50

解決方案1
0 2021-12-12 02:06:50