Pandas Dataframe resample using groupby

Question

I have a dataframe in pandas of the following form:

                       timestap  price    bid    ask  volume
0       2014-06-04 12:11:03.058  21.11  41.12   0.00       0
1       2014-06-04 12:11:03.386  21.17  41.18   0.00       0
2       2014-06-04 12:11:03.435  21.20  41.21   0.00       0
3       2014-06-04 12:11:04.125  21.17  41.19   0.00       0
4       2014-06-04 12:11:04.245  21.16  41.17   0.00       0

What I should do:

Put timestap instead of index
Resample timestamp using groupby (timestap should be grouped by second)
Show the first and the last digit of each column on the equal date and time

The final dataframe should be look like this:

                            price           bid         ask    volume
           timestap    min    max    min    max   min   max  min  max
2014-06-04 12:11:03  21.11  21.20  41.12  41.21  0.00  0.00    0    0
2014-06-04 12:11:04  21.16  21.17  41.17  41.19  0.00  0.00    0    0

What I have now:

import pandas as pd
data = pd.read_csv('table.csv')
data.columns = ['timestap', 'bid', 'ask', 'price', 'volume']
data = data.set_index(data.time)
bydate = data.groupby(pd.TimeGrouper(freq='s'))

Something going wrong on my code and I don't have an idea, how to do the last task. Can you help me?

Answer 1

Use agg function and pass a list of aggregation functions to it with either resample or pd.TimeGrouper :

# make sure the timestamp column is of date time type
df['timestap'] = pd.to_datetime(df['timestap'])

df.set_index('timestap').resample("s").agg(["min", "max"])

Or use TimeGrouper :

df.set_index('timestap').groupby(pd.TimeGrouper(freq='s')).agg(['min', 'max'])

Pandas Dataframe resample using groupby

Question

1 answers

solution1
4 ACCPTED 2017-03-02 16:11:01

Pandas Dataframe resample using groupby

Question

1 answers

solution1 4 ACCPTED 2017-03-02 16:11:01

solution1
4 ACCPTED 2017-03-02 16:11:01