Select Pandas dataframe 行，其中两列或多列一起具有最大值

Question

Suppose you have a pandas.DataFrame like so:假设你有一个pandas.DataFrame像这样：

Institution机构	Feat1专长1	Feat2壮举2	Feat3壮举3	... ...
ID1 ID1	14.5 14.5	0 0	0.32 0.32	... ...
ID2 ID2	322.12 322.12	1 1	0.94 0.94	... ...
ID3 ID3	27.08 27.08	0 0	1.47 1.47	... ...

My question is simple: how would one select rows from this dataframe based on the maximum combined values from two or more columns.我的问题很简单：如何根据两列或多列的最大组合值从 dataframe 中获得一个 select 行。 For example:例如：

I want to select rows where the columns Feat1 and Feat3 have their maximum value together , returning:我想 select 列Feat1和Feat3一起具有最大值的行，返回：

Institution机构	Feat1专长1	Feat2壮举2	Feat3壮举3	... ...
ID2 ID2	322.12 322.12	1 1	0.94 0.94	... ...

I am certain a good old for loop can take care of the problem given a little time, but I believe there must be a Pandas function for that, hope someone point me in the right direction.我确信一个好的旧 for 循环可以解决这个问题，但我相信必须有一个 Pandas function ，希望有人指出我正确的方向。

Answer 1

You can play arround with:你可以玩arround：

df.sum(axis=1)

df['row_sum'] = df.sum(axis=1)

or或者

df['sum'] = df['col1' ] + df['col3']

And then:接着：

df.sort(['sum' ],ascending=[False or True])

df.sort_index()

Answer 2

You can do it with slicing:你可以用切片来做到这一点：

output = df.loc[(df['Feat1'] + df['Feat3']).to_frame().idxmax(),:]

This outputs:这输出：

  Institution   Feat1  Feat2  Feat3
1         ID2  322.12      1   0.94

Alternatively you can always create a column and slice through it, but this would require a bit of an extra effort.或者，您始终可以创建一个列并对其进行切片，但这需要一些额外的努力。

df['filter'] = df['Feat1'] + df['Feat3']
output = df[df['filter'] == df['filter'].max()]

Select Pandas dataframe 行，其中两列或多列一起具有最大值

问题描述

2 个解决方案

解决方案1
0 2021-11-18 22:26:11

解决方案2
0 2021-11-18 22:46:05

Select Pandas dataframe 行，其中两列或多列一起具有最大值

问题描述

2 个解决方案

解决方案1 0 2021-11-18 22:26:11

解决方案2 0 2021-11-18 22:46:05

解决方案1
0 2021-11-18 22:26:11

解决方案2
0 2021-11-18 22:46:05