R在位置处分割数字向量

Question

我想知道在某个索引处将矢量分成两个的简单任务：

splitAt <- function(x, pos){
  list(x[1:pos-1], x[pos:length(x)])
}

a <- c(1, 2, 2, 3)

> splitAt(a, 4)
[[1]]
[1] 1 2 2

[[2]]
[1] 3

我的问题：必须有一些现有的功能，但我找不到它？ 也许split的可能性？ 如果pos=0或pos>length(a)我的天真实现也不起作用。

Answer 1

改进将是：

splitAt <- function(x, pos) unname(split(x, cumsum(seq_along(x) %in% pos)))

现在可以采取一个位置向量：

splitAt(a, c(2, 4))
# [[1]]
# [1] 1
# 
# [[2]]
# [1] 2 2
# 
# [[3]]
# [1] 3

并且如果pos <= 0或pos >= length(x) ，它在单个列表项中返回整个原始向量的意义上它表现得恰当（主观）。 如果您希望错误输出， stopifnot在函数顶部使用stopifnot 。

Answer 2

我尝试使用flodel的答案，但在我的情况下使用非常大的x （并且必须重复调用该函数）太慢了。 所以我创建了以下功能，这个功能更快，但也非常难看并且行为不正常。 特别是，它不检查任何东西，并且至少对于pos >= length(x)或pos <= 0会返回错误结果（如果你不确定你的输入并且不太关心速度，你可以自己添加这些检查），也许还有其他一些案例，所以要小心。

splitAt2 <- function(x, pos) {
    out <- list()
    pos2 <- c(1, pos, length(x)+1)
    for (i in seq_along(pos2[-1])) {
        out[[i]] <- x[pos2[i]:(pos2[i+1]-1)]
    }
    return(out)
}

但是， splitAt2运行速度提高约20倍，x长度为10 ⁶ ：

library(microbenchmark)
W <- rnorm(1e6)
splits <- cumsum(rep(1e5, 9))
tm <- microbenchmark(
                     splitAt(W, splits),
                     splitAt2(W, splits),
                     times=10)
tm

Answer 3

另一种可能比flodel解决方案更快和/或更易读/更优雅的替代方案：

splitAt <- function(x, pos) {
  unname(split(x, findInterval(x, pos)))
}

R在位置处分割数字向量

问题描述

3 个解决方案

解决方案1
27 已采纳 2013-05-03 11:41:24

解决方案2
5 2013-10-09 14:08:38

解决方案3
2 2016-06-30 14:31:26

R在位置处分割数字向量

问题描述

3 个解决方案

解决方案1 27 已采纳 2013-05-03 11:41:24

解决方案2 5 2013-10-09 14:08:38

解决方案3 2 2016-06-30 14:31:26

解决方案1
27 已采纳 2013-05-03 11:41:24

解决方案2
5 2013-10-09 14:08:38

解决方案3
2 2016-06-30 14:31:26