如何向 pandas dataframe 添加一列，该列在一个范围内具有最高值但将其应用于每一行？

Question

I have the following code:我有以下代码：

import pandas as pd
import numpy as np

df = pd.DataFrame([['red', 1], ['red', 13], ['red', 1], ['blue', 1], ['red', 112], ['blue', 10]])

df.columns = ["colour","rank"]

# df['highest_rank'] = ...

print(df)

"""
  colour  rank  highest_rank
0    red     1     122
1    red    13     122
2    red     1     122
3   blue     1     10
4    red   112     122
5   blue    10     10
"""

Hopefully, the example can show you what I'm trying to do as I'm struggling to describe what I'm wanting - The highest ranking of each colour.希望该示例可以向您展示我正在努力做的事情，因为我正在努力描述我想要的东西 - 每种颜色的最高排名。

Answer 1

groupby colour and broadcast the highest rank in each group using transform. groupby 颜色并使用变换广播每组中的最高排名。 Code below下面的代码

df['highest_rank']=df.groupby('colour')['rank'].transform('max')




colour  rank  highest_rank
0    red     1           112
1    red    13           112
2    red     1           112
3   blue     1            10
4    red   112           112
5   blue    10            10

如何向 pandas dataframe 添加一列，该列在一个范围内具有最高值但将其应用于每一行？

问题描述

1 个解决方案

解决方案1
2 已采纳 2022-01-10 20:39:20

如何向 pandas dataframe 添加一列，该列在一个范围内具有最高值但将其应用于每一行？

问题描述

1 个解决方案

解决方案1 2 已采纳 2022-01-10 20:39:20

解决方案1
2 已采纳 2022-01-10 20:39:20