dataframe如何根據多個條件輸入數據

Question

ID	創建日期	上次修改日期
1	2021 年 3 月 31 日 8:56	2021 年 3 月 31 日 09:46
1	2021 年 3 月 31 日 5:56	2021 年 3 月 31 日 09:48
2	2021 年 3 月 31 日 0:23	2021 年 3 月 31 日 09:47
2	2021 年 3 月 31 日 6:56	2021 年 3 月 31 日 09:46
3	2021 年 3 月 31 日 7:32	2021 年 3 月 31 日 09:46
3	2021 年 3 月 31 日 8:45	2021 年 3 月 31 日 09:46

你好，

對於上表，我需要將每個ID的最早創建日期注釋為"Minimal" 。

import pandas as pd

inputFolder = os.getcwd()
filename = filedialog.askopenfilename(title="Select file:", filetypes=(("xlsx files", ".xlsx"), ("all files", "*.*")), initialdir = inputFolder)
df = pd.read_excel(filename, index_col=None, header=0) 

df.loc[(df.groupby(['BB Global ID']).agg({'Create Date': min})), 'Comment'] = 'Minimal'

print(df)

我試圖用 pandas df.loc function 來做，但我遇到了以下錯誤。

KeyError: "None of [Index([('C', 'r', 'e', 'a', 't', 'e', ' ', 'D', 'a', 't', 'e')], dtype='object')] are in the [index]"

以下是我想要達到的最終結果：

ID	創建日期	上次修改日期	評論
1	2021 年 3 月 31 日 8:56	2021 年 3 月 31 日 09:46
1	2021 年 3 月 31 日 5:56	2021 年 3 月 31 日 09:48	最小
2	2021 年 3 月 31 日 0:23	2021 年 3 月 31 日 09:47	最小
2	2021 年 3 月 31 日 6:56	2021 年 3 月 31 日 09:46
3	2021 年 3 月 31 日 7:32	2021 年 3 月 31 日 09:46	最小
3	2021 年 3 月 31 日 8:45	2021 年 3 月 31 日 09:46

Answer 1

使用GroupBy.transform重復聚合值，因此可以按原始列進行比較：

mask = df.groupby(['BB Global ID'])['Create Date'].transform(min).eq(df['Create Date'])
df.loc[mask, 'Comment'] = 'Minimal'

或者：

df['Comment'] = np.where(mask, 'Minimal', '')

dataframe如何根據多個條件輸入數據

問題描述

1 個解決方案

解決方案1
3 已采納 2021-04-16 07:09:34

dataframe如何根據多個條件輸入數據

問題描述

1 個解決方案

解決方案1 3 已采納 2021-04-16 07:09:34

解決方案1
3 已采納 2021-04-16 07:09:34