如何将多列中的最大值返回到 pandas df 中的新列

Question

Apologies for the opaque question name (not sure how to word it).为不透明的问题名称道歉（不知道如何措辞）。 I have the following dataframe:我有以下 dataframe：

import pandas as pd
import numpy as np

data = [['tom', 1,1,6,4],
        ['tom', 1,2,2,3],
        ['tom', 1,2,3,1],
        ['tom', 2,3,2,7],
        ['jim', 1,4,3,6],
        ['jim', 2,6,5,3]]

df = pd.DataFrame(data, columns = ['Name', 'Day','A','B','C']) 
df = df.groupby(by=['Name','Day']).agg('sum').reset_index()
df

I would like to add another column that returns text according to which column of A,B,C is the highest:我想添加另一列，根据A,B,C的哪一列最高返回文本：

For example I would like Apple if A is highest, Banana if B is highest, and Carrot if C is highest.例如，如果A最高，我想要Apple ，如果B最高，我想要Banana ，如果C最高，我想要Carrot 。 So in the example above the values for the 4 columns should be:因此，在上面的示例中，4 列的值应该是：

New Col
Carrot
Apple
Banana
Carrot

Any help would be much appreciated!任何帮助将非常感激！ Thanks谢谢

Answer 1

Use DataFrame.idxmax along axis=1 with Series.map :使用DataFrame.idxmax沿axis=1和Series.map ：

dct = {'A': 'Apple', 'B': 'Banana', 'C': 'Carrot'}
df['New col'] = df[['A', 'B', 'C']].idxmax(axis=1).map(dct)

Result:结果：

  Name  Day  A   B  C New col
0  jim    1  4   3  6  Carrot
1  jim    2  6   5  3   Apple
2  tom    1  5  11  8  Banana
3  tom    2  3   2  7  Carrot

Answer 2

@ShubhamSharma's answer is better than this, but here is another option: @ShubhamSharma 的答案比这更好，但这是另一种选择：

df['New col'] = np.where((df['A'] > df['B']) & (df['A'] > df['C']), 'Apple', 'Carrot')
df['New col'] = np.where((df['B'] > df['A']) & (df['B'] > df['C']), 'Banana', df['New col'])

output: output：

    Name    Day A   B   C   New col
0   jim 1   4   3   6   Carrot
1   jim 2   6   5   3   Apple
2   tom 1   5   11  8   Banana
3   tom 2   3   2   7   Carrot

如何将多列中的最大值返回到 pandas df 中的新列

问题描述

2 个解决方案

解决方案1
4 已采纳 2020-07-13 11:02:48

解决方案2
1 2020-07-13 11:13:55

如何将多列中的最大值返回到 pandas df 中的新列

问题描述

2 个解决方案

解决方案1 4 已采纳 2020-07-13 11:02:48

解决方案2 1 2020-07-13 11:13:55

解决方案1
4 已采纳 2020-07-13 11:02:48

解决方案2
1 2020-07-13 11:13:55