Apply Min or Max function to arrays when NA exist in R

Question

I have a question looks simple but really drives me crazy. I really need your help.

First lets generate some data.frame

a<-c(rep(1:2,2),NA,NA)
b<-c(rep(NA,3),3,4,NA)
df<-cbind(a,b)

This will give a table as:

      a  b
[1,]  1 NA
[2,]  2 NA
[3,]  1 NA
[4,]  2  3
[5,] NA  4
[6,] NA NA

Now I need a third column which will be:

When both a and b are not NA, return the max value in both.
When one of them are not NA, return the non-NA number
When both them are NA, return NA.

To sum up, I am looking for the result like this:

      a  b  c
[1,]  1 NA  1
[2,]  2 NA  2
[3,]  1 NA  1
[4,]  2  3  3
[5,] NA  4  4
[6,] NA NA NA

I tried df$c<-max(df$a,df$b) and obviously this doesn't work and give me:

Error in df$a : $ operator is invalid for atomic vectors

Could someone help me please? Thank you very much!!

Answer 1

You could try pmax after converting the dataset ('df' is 'matrix') to 'data.frame'

cbind(df, c=do.call(`pmax`, c(as.data.frame(df), list(na.rm=TRUE))))
#      a  b  c
#[1,]  1 NA  1
#[2,]  2 NA  2
#[3,]  1 NA  1
#[4,]  2  3  3
#[5,] NA  4  4
#[6,] NA NA NA

If you need the "min" value for each row, replace pmax with pmin . To create a 'data.frame', you could use

df <- data.frame(a, b)

cbind get the output as 'matrix'. $ operator won't work with 'matrix', so it is better to use [

Answer 2

You can also use the "regular" max function:

df <- cbind(df, c = apply(df, 1, function(x) ifelse(all(is.na(x)), NA, max(x, na.rm=T))))

df
#      a  b  c
#[1,]  1 NA  1
#[2,]  2 NA  2
#[3,]  1 NA  1
#[4,]  2  3  3
#[5,] NA  4  4
#[6,] NA NA NA

Apply Min or Max function to arrays when NA exist in R

Question

2 answers

solution1
1 2015-02-16 15:22:49

solution2
1 2015-02-16 15:33:53

Apply Min or Max function to arrays when NA exist in R

Question

2 answers

solution1 1 2015-02-16 15:22:49

solution2 1 2015-02-16 15:33:53

solution1
1 2015-02-16 15:22:49

solution2
1 2015-02-16 15:33:53