优化Markov链转移矩阵计算？

Question

作为R的中级用户，我知道for循环通常可以通过使用诸如apply或其他功能来优化。 但是，我不知道可以优化当前代码以生成markov链矩阵的函数，该矩阵运行速度非常慢。 我是否已尽速而为，还是有我要忽略的事情？ 我正在尝试通过计算给定警报之前24小时内出现的次数来查找马尔可夫链的转换矩阵。 矢量ids包含所有可能的ID（大约1700）。

原始矩阵如下所示：

>matrix
      id      time
       1     1376084071
       1     1376084937
       1     1376023439
       2     1376084320
       2     1372983476
       3     1374789234
       3     1370234809

这是我的代码来尝试解决这个问题：

matrixtimesort <- matrix[order(-matrix$time),]
frequency = 86400 #number of seconds in 1 day

# Initialize matrix that will contain probabilities
transprobs <- matrix(data=0, nrow=length(ids), ncol=length(ids))

# Loop through each type of event
for (i in 1:length(ids)){
localmatrix <- matrix[matrix$id==ids[i],]

# Loop through each row of the event
for(j in 1:nrow(localmatrix)) {
    localtime <- localmatrix[j,]$time
    # Find top and bottom row number defining the 1-day window
    indices <- which(matrixtimesort$time < localtime & matrixtimesort$time >= (localtime - frequency))
    # Find IDs that occur within the 1-day window
    positiveids <- unique(matrixtimesort[c(min(indices):max(indices)),]$id)
    # Add one to each cell in the matrix that corresponds to the occurrence of an event

            for (l in 1:length(positiveids)){
            k <- which(ids==positiveids[l])
            transprobs[i,k] <- transprobs[i,k] + 1
            }
    }

# Divide each row by total number of occurrences to determine probabilities
transprobs[i,] <- transprobs[i,]/nrow(localmatrix)
    }
  # Normalize rows so that row sums are equal to 1
  normalized <- transprobs/rowSums(transprobs)

谁能提出建议以优化速度？

Answer 1

使用嵌套循环似乎不是一个好主意。 您的代码可以向量化以加快速度。

例如，为什么要查找行号的顶部和底部？ 您可以简单地将时间值与“ time_0 + frequency”进行比较：这是矢量化操作。

HTH。

优化Markov链转移矩阵计算？

问题描述

1 个解决方案

解决方案1
0 2013-08-14 23:07:58

优化Markov链转移矩阵计算？

问题描述

1 个解决方案

解决方案1 0 2013-08-14 23:07:58

解决方案1
0 2013-08-14 23:07:58