简体   繁体   中英

pandas replace column with mean for values

I have a pandas dataframe and want replace each value with the mean for it.

    ID    X     Y
    1     a     1
    2     a     2
    3     a     3
    4     b     2
    5     b     4

How do I replace Y values with mean Y for every unique X?

    ID    X     Y
    1     a     2
    2     a     2
    3     a     2
    4     b     3
    5     b     3

Use transform :

df['Y'] = df.groupby('X')['Y'].transform('mean')
print (df)
   ID  X  Y
0   1  a  2
1   2  a  2
2   3  a  2
3   4  b  3
4   5  b  3

For new column in another DataFrame use map with drop_duplicates :

df1 = pd.DataFrame({'X':['a','a','b']})
print (df1)
   X
0  a
1  a
2  b

df1['Y'] = df1['X'].map(df.drop_duplicates('X').set_index('X')['Y'])
print (df1)
   X  Y
0  a  2
1  a  2
2  b  3

Another solution:

df1['Y'] = df1['X'].map(df.groupby('X')['Y'].mean())
print (df1)
   X  Y
0  a  2
1  a  2
2  b  3

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM