Removing rows of data frame not satisfying a criteria in a specific column of the data frame (in R)

Question

I sincerely apologize if this question have already been asked, but I've searched high and low and could not find any similar question.

I am working on a data frame in R, which is rather complex, so I shall short it down to the following data frame:

data<-data.frame(x=c("ENG","ITA","RUS","SLO"),y=c(4,3,6,4))
list.country<-list("ENG","RUS")

I have a function which creates a list, like "list.country" above, and whatever entries contained in this list, I want to extract the matching rows from the data.frame, into a new data frame. So, using the data and list above, I want to end up with a new data frame looking like this:

new.data<-data.frame(x=c("ENG","RUS",),y=c(4,6))

The only way I have found to do this, is extracting each row separately, and binding them together again afterwards, the following way:

vector.list<-unlist(list.country)
n=length(vector.list)
for(i in 1:n){
  data[[i]]<-data[data$x==vector.list[i],]

}
for(i in 2:n){data[[i]]<- rbind(data[[i-1]],data[[i]])} }

Does anyone have a simpler way to do this?

Thank you

Answer 1

Just do

data[data$x %in% list.country, ]

##     x y
## 1 ENG 4
## 3 RUS 6

Answer 2

The %in% function searches for all elements in first vector matching an element in the second one. In your case, you should do this

data[data$x %in% list.country, ]

Removing rows of data frame not satisfying a criteria in a specific column of the data frame (in R)

Question

2 answers

solution1
1 2014-02-26 12:08:22

solution2
1 ACCPTED 2014-02-26 12:10:30

Removing rows of data frame not satisfying a criteria in a specific column of the data frame (in R)

Question

2 answers

solution1 1 2014-02-26 12:08:22

solution2 1 ACCPTED 2014-02-26 12:10:30

solution1
1 2014-02-26 12:08:22

solution2
1 ACCPTED 2014-02-26 12:10:30