drop a digit for some data points in a column in R

Question

I have a column of numbers (characters) which has either 8 digits or 9 digits data. If the data point has 9 digits, I want to drop the first digit. I'm using the following command:

file$hscode2 <- if (nchar(file$hscode1 >= 9)) {

   file$hscode2 <- substr(file$hscode1,2,9)

}

where the data frame is "file" and the column with 8/9 digits data is hscode1 and the new column which drops the first digit when it is 9 digit character is hscode2

However, I'm not getting the desired result. Any suggestions?

Thanks

Answer 1

I think there is a bug. It should be:

file$hscode2 <- if (nchar(file$hscode1) >= 9) {

    file$hscode2 <- substr(file$hscode1,2,9)

}

As written, your function was running nchar on "file$hscode1 >= 9" which is a boolean, which if converted to a char would just be 1 character, hence the conditions would always have been true I think (leading to the unexpected results you were seeing).

Answer 2

As suggested by @missuse, ifelse() will work fine here.

file$hscode2 <- ifelse(nchar(file$hscode1 >= 9), # test
                       substr(file$hscode1, 2, 9), # value if true
                       file$hscode1) # value if false

drop a digit for some data points in a column in R

Question

2 answers

solution1
1 2017-10-13 11:20:05

solution2
0 2017-10-13 11:33:02

drop a digit for some data points in a column in R

Question

2 answers

solution1 1 2017-10-13 11:20:05

solution2 0 2017-10-13 11:33:02

solution1
1 2017-10-13 11:20:05

solution2
0 2017-10-13 11:33:02