R: Disabling Scientific Notation

Question

I am using the R programming language. On some bigger data, I tried the following code (make a decision tree):

#load library
library(rpart)
    
    #generate data
    a = rnorm(100, 7000000, 10)
    
    b = rnorm(100, 5000000, 5)
    
    c = rnorm(100, 400000, 10)
    
    group <- sample( LETTERS[1:2], 100, replace=TRUE, prob=c(0.5,0.5) )
    
    group_1 <- sample( LETTERS[1:4], 100, replace=TRUE, prob=c(0.25, 0.25, 0.25, 0.25) )
    
    
    d = data.frame(a,b,c, group, group_1)
    d$group = as.factor(d$group)
    d$group_1 = as.factor(d$group_1)
    
#fit model
    tree <- rpart(group ~ ., d)
    
#visualize results
    plot(tree)
    
    text(tree, use.n=TRUE, minlength = 0, xpd=TRUE, cex=.8)

In the visual output, the numbers are displayed in scientific notation (eg 4.21e+06). Is there a way to disable this?

I consulted this previous answer on stackoverflow: How to disable scientific notation?

I then tried the following command: options(scipen=999)

But this did not seem to fix the problem.

Can someone please tell me what I am doing wrong?

Thanks

Answer 1

I think the labels.rpart function has scientific notation hard-coded in: it uses a private function called formatg to do the formatting using sprintf() with a %g format, and that function ignores options(scipen) . You can override this by replacing formatg with a better function. Here's a dangerous way to do that:

oldformatg <- rpart:::formatg
assignInNamespace("formatg", format, "rpart")

which replaces formatg with the standard format function. (This will definitely have dangerous side effects, so afterwards you should change it back using

assignInNamespace("formatg", oldformatg, "rpart")

A better solution would be to rescale your data. rpart switches to scientific notation only for big numbers, so you could divide the bad numbers by something like 1000 or 1000000, and describe them as being in different units. For your example, this works for me:

library(rpart)

#generate data
set.seed(123)
a = rnorm(100, 7000000, 10)/1000

b = rnorm(100, 5000000, 5)/1000

c = rnorm(100, 400000, 10)/1000

group <- sample( LETTERS[1:2], 100, replace=TRUE, prob=c(0.5,0.5) )

group_1 <- sample( LETTERS[1:4], 100, replace=TRUE, prob=c(0.25, 0.25, 0.25, 0.25) )


d = data.frame(a,b,c, group, group_1)
d$group = as.factor(d$group)
d$group_1 = as.factor(d$group_1)

#fit model
tree <- rpart(group ~ ., d)

#visualize results
plot(tree)

text(tree, use.n=TRUE, minlength = 0, xpd=TRUE, cex=.8)

^{Created on 2021-01-27 by the reprex package (v0.3.0)}

R: Disabling Scientific Notation

Question

1 answers

solution1
2 ACCPTED 2021-01-27 16:23:04

R: Disabling Scientific Notation

Question

1 answers

solution1 2 ACCPTED 2021-01-27 16:23:04

solution1
2 ACCPTED 2021-01-27 16:23:04