R中小外循环和大内循环的高效并行化

Question

I have the following R code我有以下R代码

LLL = list()
idx = 1
for(i in 1:3){
  for(j in 1:9){
     for(k in 1:13){
        for(iter in 1:1000000){
       
           if( runif(1,0,1)<0.5 ){
             LLL[[idx]] = rnorm(1,0,1)
             idx = idx + 1
          }
       }
     }
  }
}

Is there a way to parallelize efficiently this code?有没有办法有效地并行化这段代码？

What I was thinking is that I have 351 configurations of i,j,k , If I could distribute these configurations to cores and each core would run a for loop for 1000000 iterations, can something similar to that be implemented??我在想的是，我有351个i,j,k配置，如果我可以将这些配置分配给内核，并且每个内核都可以运行for循环1000000次迭代，是否可以实现类似的东西？

Answer 1

Instead of calling rnorm() one million times, it would be more efficient to call it once with the argument n = 1000000 .与调用rnorm()一百万次不同，使用参数n = 1000000调用一次会更有效。
To utilize R's functional programming features we should try to avoid writing for() -loops.为了利用 R 的函数式编程特性，我们应该尽量避免编写for()循环。 We can instead first create an object that represents your 351 configurations and then iterate on that object.我们可以先创建一个代表您的 351 个配置的对象，然后迭代该对象。 See below for an example of how to do that without.有关如何在没有的情况下执行此操作的示例，请参见下文。

Create configurations:创建配置：

cfgs <-
  expand_grid(i = 1:3,
              j = 1:9,
              k = 1:13)

Code without parallelization.没有并行化的代码。

cfgs |> 
  split(1:nrow(cfgs)) |> 
  lapply(\(x) rnorm(100000, 0, 1))

In order to parallelize the execution of the code we can use the furrr package.为了并行化代码的执行，我们可以使用furrr包。

library(furrr)
plan(multisession)
cfgs |> 
  split(1:nrow(cfgs)) |> 
  future_map(\(x) rnorm(100000, 0, 1), .options = furrr_options(seed=TRUE))

R中小外循环和大内循环的高效并行化

问题描述

1 个解决方案

解决方案1
1 2021-11-04 17:31:05

R中小外循环和大内循环的高效并行化

问题描述

1 个解决方案

解决方案1 1 2021-11-04 17:31:05

解决方案1
1 2021-11-04 17:31:05