data.table: Numeric column name

Question

Here's the data.table Im working with:

> head(dataTable)
     persnr      1993      1994
1: 60487416 0.5777598        NA
2: 60487511        NA  5.245855
3: 60488034 0.5777598 23.100167
4: 60488147 0.5777598        NA
5: 60488240 0.5777598 23.100167
6: 60488338 0.5777598 23.100167

Having the column years numeric is quite useful, as I can simply iterate through these. It however has a drawback:

dataTable[is.na(1993),]
Empty data.table (0 rows) of 3 cols: persnr,1993,1994

It mistakes the 1993 for an integer, instead of using it as the object name. Otherwise I can't explain how it would come up with zero rows that satisfy this condition. How can I check for NA values when the column name is numeric?

Answer 1

You may want to treat your data.frame like a matrix and use the apply function to find the NA values:

apply(dataTable,1,is.na)

That will be faster than iterating through columns.

If you want to find the rows with any NA values you could do:

apply(dataTable,1,function(x){any(is.na(x))})

data.table: Numeric column name

Question

1 answers

solution1
0 2014-10-31 23:43:07

data.table: Numeric column name

Question

1 answers

solution1 0 2014-10-31 23:43:07

solution1
0 2014-10-31 23:43:07