自定义 groupby 函数 pandas python

Question

I have a following dataframe:我有以下数据框：

I would like to group by id and add a flag column which contains Y if anytime Y has occurred against id, resultant DF would like following:我想按 id 分组并添加一个包含 Y 的标志列，如果任何时候 Y 发生在 id 上，结果 DF 会如下所示：

Here is my approach which is too time consuming and not sure of correctness:这是我的方法太耗时且不确定正确性：

temp=pd.DataFrame()
j='flag'
for i in df['id'].unique():
  test=df[df['id']==i]
  test[j]=np.where(np.any((test[j]=='Y')),'Y',test[j])
temp=temp.append(test)

Answer 1

Compare flag to Y , group by id , and use any :将flag与Y进行比较，按id分组，并使用any ：

new_df = (df['flag'] == 'Y').groupby(df['id']).any().map({True:'Y', False:'N'}).reset_index()

Output:输出：

>>> new_df
   id flag
0   1    Y
1   2    Y
2   3    N
3   4    N
4   5    Y

Answer 2

You can do groupby + max since Y > N :你可以做groupby + max因为Y > N ：

df.groupby('id', as_index=False)['flag'].max()

自定义 groupby 函数 pandas python

问题描述

2 个解决方案

解决方案1
2 已采纳

解决方案2
2 2022-05-04 16:38:30

自定义 groupby 函数 pandas python

问题描述

2 个解决方案

解决方案1 2 已采纳

解决方案2 2 2022-05-04 16:38:30

解决方案1
2 已采纳

解决方案2
2 2022-05-04 16:38:30