R: Calculate standard deviation in cols in a data.frame despite of NA-Values

Question

Good Morning, I got a lot of data and i have to calculate with it. There are 25 columns (variables) and each column contains thousands of values. But also missing values. I calculated the mean with

colMeans(df, na.rm = TRUE)

How can i calculate the sd of each column and ignore the NA-values?

Answer 1

You can try,

apply(df, 2, sd, na.rm = TRUE)

As the output of apply is a matrix, and you will most likely have to transpose it, a more direct and safer option is to use lapply or sapply as noted by @docendodiscimus,

sapply(df, sd, na.rm = TRUE)

Answer 2

If we convert to matrix , colSds from matrixStats can be used

library(matrixStats)
colSds(as.matrix(df), na.rm=TRUE)

Or we can use summarise_each from dplyr

library(dplyr)
df1 %>%
    summarise_each(funs(sd(., na.rm=TRUE)))

Answer 3

由于函数summarise_each summarise_each()已被弃用，这里是使用dplyr的最新示例：

df1 %>% summarise_all(funs(sd(., na.rm = FALSE)))

Answer 4

sd(variablenname,na.rm=TRUE)

This works for me. Replace "variablename" with the variable you use.

R: Calculate standard deviation in cols in a data.frame despite of NA-Values

Question

4 answers

solution1
6 ACCPTED 2016-06-14 08:11:35

solution2
3 2016-06-14 08:35:14

solution3
0 2018-08-13 16:22:19

solution4
-1 2018-09-27 11:28:23

R: Calculate standard deviation in cols in a data.frame despite of NA-Values

Question

4 answers

solution1 6 ACCPTED 2016-06-14 08:11:35

solution2 3 2016-06-14 08:35:14

solution3 0 2018-08-13 16:22:19

solution4 -1 2018-09-27 11:28:23

solution1
6 ACCPTED 2016-06-14 08:11:35

solution2
3 2016-06-14 08:35:14

solution3
0 2018-08-13 16:22:19

solution4
-1 2018-09-27 11:28:23