Pandas：对于组中的最后一行，为一列分配一个值

Question

如何为组的最后一行分配我想要的值（假设我已经对 DF 进行了排序）？

# data
df = pd.DataFrame([['a', 1], ['a', 2], ['b', 1], ['b', 2]],
                  columns=['colA', 'colB'])

# create a new col
df['colC'] = 'Not Current'

# my attempt -- groupby col of interest, get last row, apply value to 'colC' column
df.loc[df.reset_index().groupby('colA').tail(1), 'colC'] = 'Current'

Answer 1

您可以使用通话index进行修复

df.loc[df.groupby('colA').tail(1).index, 'colC'] = 'Current'
df
Out[105]: 
  colA  colB         colC
0    a     1  Not Current
1    a     2      Current
2    b     1  Not Current
3    b     2      Current

Answer 2

使用loc与duplicated ：

df['colC'] = 'Not Current'
not_last_rows = df['colA'].duplicated(keep='last')
df.loc[~not_last_rows, 'colC'] = 'Current'

或者在你的情况下， np.where ：

 not_last_rows = df['colA'].duplicated(keep='last')
 df['colC'] = np.where(not_last_rows, 'Not Current', 'Current')

输出：

  colA  colB         colC
0    a     1  Not Current
1    a     2      Current
2    b     1  Not Current
3    b     2      Current

Pandas：对于组中的最后一行，为一列分配一个值

问题描述

2 个解决方案

解决方案1
3 已采纳 2021-11-10 18:53:11

解决方案2
2 2021-11-10 18:52:16

Pandas：对于组中的最后一行，为一列分配一个值

问题描述

2 个解决方案

解决方案1 3 已采纳 2021-11-10 18:53:11

解决方案2 2 2021-11-10 18:52:16

解决方案1
3 已采纳 2021-11-10 18:53:11

解决方案2
2 2021-11-10 18:52:16