根據條件添加行 - Dataframe

Question

我有一個數據框，如下所示：

我想根據以下邏輯添加一個新行：

添加一個“位置”作為“舞台區域”的新行
此行是“location”為“回復區-新商業區”的條目和“location”為“文化中心”的條目的總和。
將'location'的行刪除為“回復區-新商業區”和“文化中心”

因此，對於 2020 年 11 月 11 日，我應該有以下條目：

Answer 1

使用Series.isin過濾多個值，聚合總和添加列location並最后添加到原始 DataFrame 沒有匹配行的掩碼：

mask = df['location'].isin(["Reply's Area - New Commercial Area", 'Cultural Hub'])

df1 = (df[mask].groupby(['day','locationTypes'],as_index=False)[['dwell', 'football']]
              .sum()
              .assign(location = 'Stage Area')
              .reindex(df.columns, axis=1))

df = pd.concat([df[~mask], df1], ignore_index=True)

Answer 2

Jezrael 看起來他很接近答案，但也許足球的聚合不正確......僅僅從他的代碼來看，所以我可能是錯的。

正確的版本看起來像這樣，這與您在示例中建議的數字相匹配。 我制作了一個較小版本的示例表用於測試。 這里的“數據”是您的數據框。

mask = data["location"].isin(["Repley's Area - New Commercial Area", "Cultural Hub"])
data[mask].groupby(["day","locationTypes"], as_index=False)['dwell', 'football'].sum().assign(location="Stage Area")

輸出：

          day locationTypes  dwell  football    location
0  2020-11-11          Zone    145      2307  Stage Area
1  2020-11-12          Zone     95      2905  Stage Area

Answer 3

感謝您的回復！ 以下工作：

mask=df[df['location'].isin(["Repley's Area - New Commercial Area",'Cultural Hub'])]

df1=mask.groupby(['day','locationTypes'],as_index=False)['footfall','dwell (minutes)'].sum().assign(location='Stage Area')

#reordering the columns for pd.concat
df1= df1[df.columns]

df_final=pd.concat([df[~df['location'].isin(["Repley's Area - New Commercial Area",'Cultural Hub'])],df1]) 

#checking the result
df_final[(df_final['day']=='2020-11-11') & (df_final['location']=='Stage Area')]

＃這使

根據條件添加行 - Dataframe

問題描述

3 個解決方案

解決方案1
1 2020-11-24 11:26:21

解決方案2
1 2020-11-24 11:48:20

解決方案3
0 2020-11-24 12:00:52

根據條件添加行 - Dataframe

問題描述

3 個解決方案

解決方案1 1 2020-11-24 11:26:21

解決方案2 1 2020-11-24 11:48:20

解決方案3 0 2020-11-24 12:00:52

解決方案1
1 2020-11-24 11:26:21

解決方案2
1 2020-11-24 11:48:20

解決方案3
0 2020-11-24 12:00:52