Python Pandas subtract value of row from value of previous row

Question

I have the following,

import pandas as pd

data = [['AAA','2019-01-01', 10], ['AAA','2019-01-02', 21],
        ['AAA','2019-02-01', 30], ['AAA','2019-02-02', 45],
        ['BBB','2019-01-01', 50], ['BBB','2019-01-02', 60],
        ['BBB','2019-02-01', 70],['BBB','2019-02-02', 80]]

dfx = pd.DataFrame(data, columns = ['NAME', 'TIMESTAMP','VALUE'])

  NAME   TIMESTAMP  VALUE
0  AAA  2019-01-01     10
1  AAA  2019-01-02     21
2  AAA  2019-02-01     30
3  AAA  2019-02-02     45
4  BBB  2019-01-01     50
5  BBB  2019-01-02     60
6  BBB  2019-02-01     70
7  BBB  2019-02-02     80

I want to generate a new column which lists the difference of the VALUE column for the current row from the previous row.

So the output would look somewhat like this,

  NAME   TIMESTAMP  VALUE  DIFF
0  AAA  2019-01-01     10  
1  AAA  2019-01-02     21  11
2  AAA  2019-02-01     30   9
3  AAA  2019-02-02     45  15
4  BBB  2019-01-01     50
5  BBB  2019-01-02     60  10
6  BBB  2019-02-01     70  10
7  BBB  2019-02-02     80  10

Regards.

Answer 1

You could do:

dfx['DIFF'] = dfx.groupby('NAME')['VALUE'].apply(lambda x: x - x.shift()).fillna(0)
print(dfx)

Output

  NAME   TIMESTAMP  VALUE  diff
0  AAA  2019-01-01     10   0.0
1  AAA  2019-01-02     21  11.0
2  AAA  2019-02-01     30   9.0
3  AAA  2019-02-02     45  15.0
4  BBB  2019-01-01     50   0.0
5  BBB  2019-01-02     60  10.0
6  BBB  2019-02-01     70  10.0
7  BBB  2019-02-02     80  10.0

Answer 2

A simpler solution:

dfx.groupby('NAME').diff()

Python Pandas subtract value of row from value of previous row

Question

2 answers

solution1
1 ACCPTED 2019-10-30 15:33:48

solution2
1 2019-10-30 15:35:01

Python Pandas subtract value of row from value of previous row

Question

2 answers

solution1 1 ACCPTED 2019-10-30 15:33:48

solution2 1 2019-10-30 15:35:01

solution1
1 ACCPTED 2019-10-30 15:33:48

solution2
1 2019-10-30 15:35:01