Python pandas .map saves only last edit

Question

I am trying to use pandas .map to edit a dataset as in the following code:

df['Region'] = df['Region'].astype('category')
reg = df['Region']
cats = reg.cat.categories
ncats = len(cats)
n = len(os)

north = (...)
south = (...)
center = (...)
islands = (...)

d1 = {cats[i]:'South' for i in range(ncats) if cats[i] in south}
d2 = {cats[i]:'North' for i in range(ncats) if cats[i] in north}
d3 = {cats[i]:'Center' for i in range(ncats) if cats[i] in center}
d4 = {cats[i]:'Islands' for i in range(ncats) if cats[i] in islands}

df['Reg_cat'] = df['Region'].map(d1)
df['Reg_cat'] = df['Region'].map(d2)
df['Reg_cat'] = df['Region'].map(d3)
df['Reg_cat'] = df['Region'].map(d4)
df['Reg_cat'] = df['Reg_cat'].astype('category')
df['Reg_cat'].cat.categories
df['Reg_cat']

The code does work but it only applies the last .map request. So in this case it applies d4. If d1 is the last one it applies that one. What am I doing wrong?

Answer 1

Each successive map call replaces everything not inside the mapper with NaN.

Try building a single dictionary and passing that instead.

m = {'North' : north, 'South' : south, 'Center' : center, 'Islands', islands}    
d = {v2 : k for k, v in m.items() for v2 in v}

df['Reg_cat'] = df['Reg_cat'].map(d)

Note:

you don't need reg
you don't need cats
you don't need ncats
you also (not surprisingly) don't need n , whatever that is

Answer 2

Everytime you are calling df['Reg_cat'] = df['Region'].map(d#) you are overwriting the value of df['Reg_cat'] . If you'd like to keep all the values, consider adding them as separate columns.

Python pandas .map saves only last edit

Question

2 answers

solution1
3 ACCPTED 2018-05-03 18:51:27

solution2
0 2018-05-03 18:49:18

Python pandas .map saves only last edit

Question

2 answers

solution1 3 ACCPTED 2018-05-03 18:51:27

solution2 0 2018-05-03 18:49:18

solution1
3 ACCPTED 2018-05-03 18:51:27

solution2
0 2018-05-03 18:49:18