Finding the max of one column when values in another column is X

Question

how do you find the max value of one column (column B) when the values of Column C = X. How would I keep the label from Column A. Let's say my data is called my.data, column a = Country name, column b = number of children born and column C = year the child was born. So how do find the maximum number of children born in year 2001, keeping the name of the country in line?

Thanks, I'm very sorry, I'm new to R

Answer 1

There are many options in R (and questions on SO) for doing this kind of operation

I will give a data.table solution because I like the easy syntax for this kind of query

data.table

For efficiency for large data sets. I also gives a very easy syntax for subsetting. (.SD references the subset created by i and by )

library(data.table)
DT <- data.table(my.data)
DT[year==2001, .SD[which.max(births)]]

Or this is the same without needing .SD

DT[year==2001][which.max(births)]

example data

my.data <- expand.grid(
  Country = c('Swaziland', 'Australia', 'Tuvalu', 'Turkmenistan'),
  year = 1990:2012 )
my.data$births <- rpois(nrow(my.data), lambda = 500)
DT <- data.table(my.data)
DT[year==2001, .SD[which.max(births)]]

##      Country year births
## 1: Swaziland 2001    501

using base R

births_2001 <- subset(my.data, year == 2001)
births_2001[which.max(births_2001$births),]

##      Country year births
## 45 Swaziland 2001    501

Answer 2

There are a number of ways to do this. I'll break it up so you can hopefully see what's going on better.

 my.data <- data.frame(
    country=c("Australia","France","Germany","Honduras","Nepal","Honduras"),
    children=c(120000,354000,380000,540000,370000,670000),
    year=c(2000,2001,2001,2002,2001,2003)
    )

 myd01 <- my.data[my.data$year==2001,]  # pulls out the 2001 data
 myd01[myd01$children==max(myd01$children),]  # finds the rows with the maximum

Answer 3

> aggregate(.~ year,data=my.data, FUN= max)

这也将解决问题。

Finding the max of one column when values in another column is X

Question

3 answers

solution1
5 2012-09-18 23:29:19

data.table

example data

using base R

solution2
4 2012-09-18 23:36:28

solution3
3 2013-07-18 08:46:21

Finding the max of one column when values in another column is X

Question

3 answers

solution1 5 2012-09-18 23:29:19

data.table

example data

using base R

solution2 4 2012-09-18 23:36:28

solution3 3 2013-07-18 08:46:21

solution1
5 2012-09-18 23:29:19

solution2
4 2012-09-18 23:36:28

solution3
3 2013-07-18 08:46:21