Use of ddply instead of a loop - subtracting for particular categories

Question

I have a dataset with 2 numeric columns.

Example dataset:

X = c(-1:-20)
Y=c(11:30)
df=as.data.frame(cbind(X,Y))

My dataset looks like:

I'm using a loop that allows me to subtract a 100 to value below -10.

for (i in 1:length(df[,1]))
{
  if ((df$X[i]< c(-10.0)) == T)
  {df$X[i] = df$X[i] - 100}else
  {}
}

My "real" dataset contains 300 000 lines and the loop is really time consuming. That's why I've been trying to find an apply function that does the job.

library(plyr)
TAB1=ddply(df,.(X),function (x) x[(df$x)< c(-10.0)]-100)

But it's not working at all.

Thank your for any help.

Answer 1

Don't use ddply for this task. You don't need it. The operations are vectorized

index <- df$X < -10
df$X[index] <- df$X[index] - 100

Use of ddply instead of a loop - subtracting for particular categories

Question

1 answers

solution1
2 ACCPTED 2013-04-30 08:56:15

Use of ddply instead of a loop - subtracting for particular categories

Question

1 answers

solution1 2 ACCPTED 2013-04-30 08:56:15

solution1
2 ACCPTED 2013-04-30 08:56:15