如何用模式/平均值估算 pandas dataframe 中的整个缺失值？

Question

I know codes forfilling seperately by taking each column as below我知道通过将每一列分别填充的代码如下

data['Native Country'].fillna(data['Native Country'].mode(), inplace=True)

But i am working on a dataset with 50 rows and there are 20 categorical values which need to be imputed.但我正在处理一个有 50 行的数据集，并且有 20 个分类值需要估算。 Is there a single line code for imputing the entire data set??是否有用于估算整个数据集的单行代码？

Answer 1

Use DataFrame.fillna with DataFrame.mode and select first row because if same maximum occurancies is returned all values:将DataFrame.fillna与DataFrame.mode和 select 第一行一起使用，因为如果返回相同的最大出现次数，则所有值：

data = pd.DataFrame({
        'A':list('abcdef'),
         'col1':[4,5,4,5,5,4],
         'col2':[np.nan,8,3,3,2,3],
         'col3':[3,3,5,5,np.nan,np.nan],
         'E':[5,3,6,9,2,4],
         'F':list('aaabbb')
})

cols = ['col1','col2','col3']

print (data[cols].mode())
   col1  col2  col3
0     4   3.0   3.0
1     5   NaN   5.0

data[cols] = data[cols].fillna(data[cols].mode().iloc[0])
    
print (data)
   A  col1  col2  col3  E  F
0  a     4   3.0   3.0  5  a
1  b     5   8.0   3.0  3  a
2  c     4   3.0   5.0  6  a
3  d     5   3.0   5.0  9  b
4  e     5   2.0   3.0  2  b
5  f     4   3.0   3.0  4  b

如何用模式/平均值估算 pandas dataframe 中的整个缺失值？

问题描述

1 个解决方案

解决方案1
2 已采纳 2021-03-10 13:45:28

如何用模式/平均值估算 pandas dataframe 中的整个缺失值？

问题描述

1 个解决方案

解决方案1 2 已采纳 2021-03-10 13:45:28

解决方案1
2 已采纳 2021-03-10 13:45:28