使用dplyr计算，每列中NA的百分比

Question

I have a data frame with some columns with missing values.我有一个数据框，其中一些列缺少值。 Is there a way (using dplyr) to efficiently calculate the percentage of each column that is missing ie NA.有没有办法（使用 dplyr）有效地计算每列丢失的百分比，即 NA。 Sought of like a colSum equivalent.寻求像 colSum 等价物。 So I dont have to calculate each column percentage missing individually ?所以我不必单独计算每列丢失的百分比？

Answer 1

First, I created a test data for you:首先，我为您创建了一个测试数据：

a<- c(1,NA,NA,4)
b<- c(NA,2,3,4)
x<- data.frame(a,b)
x
#    a  b
# 1  1 NA
# 2 NA  2
# 3 NA  3
# 4  4  4

Then you can use colMeans(is.na(x)) :然后你可以使用colMeans(is.na(x)) ：

colMeans(is.na(x))
#    a    b 
# 0.50 0.25

Answer 2

We can use summarise_each我们可以使用summarise_each

 library(dplyr)
 x %>% 
   summarise_each(funs(100*mean(is.na(.))))

Answer 3

喜欢这种简洁的purrr::map类型：

x %>% map(~ mean(is.na(.)))

使用dplyr计算，每列中NA的百分比

问题描述

3 个解决方案

解决方案1
16 已采纳 2015-11-04 04:12:46

解决方案2
16 2015-11-04 04:20:45

解决方案3
4 2017-06-06 15:50:16

使用dplyr计算，每列中NA的百分比

问题描述

3 个解决方案

解决方案1 16 已采纳 2015-11-04 04:12:46

解决方案2 16 2015-11-04 04:20:45

解决方案3 4 2017-06-06 15:50:16

解决方案1
16 已采纳 2015-11-04 04:12:46

解决方案2
16 2015-11-04 04:20:45

解决方案3
4 2017-06-06 15:50:16