Pandas：用平均值填充系列中的空白

Question

Given df给定 df

df = pd.DataFrame({'distance': [0,1,2,np.nan,3,4,5,np.nan,np.nan,6]})

   distance
0       0.0
1       1.0
2       2.0
3       NaN
4       3.0
5       4.0
6       5.0
7       NaN
8       NaN
9       6.0

I want to replace the nans with the inbetween mean我想用中间平均值替换 nans

Expected output:预计 output：

   distance
0       0.0
1       1.0
2       2.0
3       2.5
4       3.0
5       4.0
6       5.0
7       5.5
8       5.5
9       6.0

I have seen this_answer but it's for a grouping which isn't my case and I couldn't find anything else.我已经看到了 this_answer但它是针对一个不是我的情况的分组，我找不到其他任何东西。

Answer 1

If you don't want df.interpolate you can compute the mean of the surrounding values manually with df.bfill and df.ffill如果你不想df.interpolate你可以用df.bfill和df.ffill手动计算周围值的平均值

(df.ffill() + df.bfill()) / 2

Out:出去：

   distance
0       0.0
1       1.0
2       2.0
3       2.5
4       3.0
5       4.0
6       5.0
7       5.5
8       5.5
9       6.0

Answer 2

How about using linear interpolation?使用线性插值怎么样？

print(df.distance.interpolate())

0    0.000000
1    1.000000
2    2.000000
3    2.500000
4    3.000000
5    4.000000
6    5.000000
7    5.333333
8    5.666667
9    6.000000
Name: distance, dtype: float64

Pandas：用平均值填充系列中的空白

问题描述

2 个解决方案

解决方案1
2 已采纳 2020-11-13 21:29:57

解决方案2
1 2020-11-13 21:20:45

Pandas：用平均值填充系列中的空白

问题描述

2 个解决方案

解决方案1 2 已采纳 2020-11-13 21:29:57

解决方案2 1 2020-11-13 21:20:45

解决方案1
2 已采纳 2020-11-13 21:29:57

解决方案2
1 2020-11-13 21:20:45