Return rows in pandas based on values in multiple columns

Question

Needed some help with pandas...I'm working on this data and I'm trying to calculate some changes over time per region. Basically, I'm trying to find the oldest quantity and the newest quantity for each area in question. I have code that can give me the year of the most recent and oldest data recordes, however I need to gather the whole row so I can work on the 'quantity' column. Any inputs? here is what i have :

df.groupby(['Country or Area'])['Year'].max()

Thanks in advance!

Answer 1

df = df.sort_values(by=['Country or Area','Year'])
df.groupby('Country or Area').agg(['first','last']).stack()

Answer 2

Use idxmin() and idxmax(). Something like:

grp = df.groupby(['Country or Area'])

for name,group in grp:
    print(name)

    minidx = group['Year'].idxmin()
    maxidx = group['Year'].idxmax()

    print(f"min: {group['Year'][minidx]} {group['Quantity'][minidx]}")
    print(f"max: {group['Year'][maxidx]} {group['Quantity'][maxidx]}")
    print()

Answer 3

您可以使用idxmin和idxmax获取最旧和最新idxmax

df.loc[df.groupby(['Country or Area'])['Year'].idxmin()]

Answer 4

You need to use agg functions of groupby()

You can pass the functions or a dict of functions to the columns you need to aggregate

In your case the code should be like Crish solution is the better way to do it.

Sort the dataframe by the value to check and then group and get by .agg() the result that you need

The stack() method works to deflate the df level

Return rows in pandas based on values in multiple columns

Question

4 answers

solution1
1 2020-02-15 01:46:51

solution2
1 2020-02-15 02:03:14

solution3
0 ACCPTED 2020-02-15 01:51:54

solution4
0 2020-02-15 02:20:25

Return rows in pandas based on values in multiple columns

Question

4 answers

solution1 1 2020-02-15 01:46:51

solution2 1 2020-02-15 02:03:14

solution3 0 ACCPTED 2020-02-15 01:51:54

solution4 0 2020-02-15 02:20:25

solution1
1 2020-02-15 01:46:51

solution2
1 2020-02-15 02:03:14

solution3
0 ACCPTED 2020-02-15 01:51:54

solution4
0 2020-02-15 02:20:25