How to find mean of n previous rows in a column in pandas based on date criteria?

Question

I have a dataset that looks like this:

value1 value2 value3 date

17    21    22     2005-04-01 12:05:00

19    20    24     2005-04-01 12:06:00

16    26    23     2005-04-01 12:07:00

I need to transform it somehow, so the values of each row with date ending with .05:00 (5th minute of each hour) will be equal to average value of previous 60 rows.

I tried to use groupby based on datetime, it does provide average values for each hour (00 - 59), but i need to adjust it for my case.

In the end I would like to have something like this:

  value1 value2 value3 date

  17    21    22     2005-04-01 12:05:00

  19    20    24     2005-04-01 13:05:00

  16    26    23     2005-04-01 14:05:00

where 17 for instance is average of 60 previous values in value1 column.

Answer 1

This will create a rolling mean on 60 minutes windows (makes sure, that date column is datetime64[ns] dtype, if not, convert it beforehand), then you can select the necessary rows with .loc[] :

df.rolling('H', on='date').mean().loc[lambda x: x['date'].dt.minute == 5]

See the docs for further details on .rolling() and .loc[] .

How to find mean of n previous rows in a column in pandas based on date criteria?

Question

1 answers

solution1
0 ACCPTED 2019-04-27 11:38:39

How to find mean of n previous rows in a column in pandas based on date criteria?

Question

1 answers

solution1 0 ACCPTED 2019-04-27 11:38:39

solution1
0 ACCPTED 2019-04-27 11:38:39