Find last occurrence of a max value in a row and get Column name in Pandas Python

Question

I have the below dataframe called df:

Id	Stage1	Stage2	Stage3
1	2022-02-01	2020-04-03	2022-06-07
---	------------	------------	-----------
2	2023-06-07	2020-03-01	2020-09-03
---	------------	------------	-----------
3	2023-02-04	2023-06-07	2022-06-07
---	------------	------------	-----------
4	2022-05-08	2023-09-01	2023-09-01

I need to calculate the max date for each ID and its respective Stage. So for Order 1,2,3,4 the Stages I need are Stage 3, Stage 1, Stage 2, and Stage 3 respectively. I understand that using

df.filter(like="stage").idxmax(axis=1)

Finds the first occurrence of max date in a row and gives me its column name. However, for Order 4, Stage 2 and 3 have the same date. I need Stage 3 as my answer as Stage 3 is the latest stage of the order. How is this possible?

Answer 1

Swap order of columns for match latest maximal value:

df.filter(like="stage").iloc[:, ::-1].idxmax(axis=1)

Find last occurrence of a max value in a row and get Column name in Pandas Python

Question

1 answers

solution1
1 2022-09-20 10:36:42

Find last occurrence of a max value in a row and get Column name in Pandas Python

Question

1 answers

solution1 1 2022-09-20 10:36:42

solution1
1 2022-09-20 10:36:42