如何将一个dataframe在R中一一拆分成给定行数

Question

Suppose here the data.假设这里的数据。

data(iris)

Iris has 150 rows, how to split it on 5 datasets but with respect to the order of the rows.150/5=30. Iris 有 150 行，如何在 5 个数据集上拆分它，但相对于行的顺序。150/5=30。 So first dataset consists of rows from 1-30, the second dataset consists of rows 31-60 and so on in strict order.因此，第一个数据集由 1-30 行组成，第二个数据集由 31-60 行组成，依此类推。 Or if I want to split it into 4 parts, 150/4=37.5.或者如果我想把它分成 4 个部分，150/4=37.5。 This is a fractional number, so in such cases we round it up to the upper limit = 38. 38+38+38+36=150.这是一个小数，所以在这种情况下我们将它四舍五入到上限 = 38。 38+38+38+36=150。 In other words, the first split dataset will be from 1-38, the second from 39-76 and similarly.换句话说，第一个拆分数据集将从 1-38 开始，第二个拆分数据集将从 39-76 开始，类似地。

How can i perform it?我该如何执行？ Thank you advance.谢谢你提前。

Answer 1

You can use something like this:你可以使用这样的东西：

num_groups <- 4
split(iris, (1:nrow(iris) - 1) %/% ceiling(nrow(iris) / num_groups))

Confirming above works for different group sizes:确认以上适用于不同的组大小：

4 groups 4组

split(iris, (1:nrow(iris) - 1) %/% ceiling(nrow(iris) / 4)) |> sapply(nrow)
#  0  1  2  3 
# 38 38 38 36

5 groups 5组

split(iris, (1:nrow(iris) - 1) %/% ceiling(nrow(iris) / 5)) |> sapply(nrow)
#  0  1  2  3  4 
# 30 30 30 30 30

7 groups 7组

split(iris, (1:nrow(iris) - 1) %/% ceiling(nrow(iris) / 7)) |> sapply(nrow)
#  0  1  2  3  4  5  6 
# 22 22 22 22 22 22 18

Answer 2

Another possible solution:另一种可能的解决方案：

library(tidyverse)

data(iris)

ngroups <- 4
v <- 1:nrow(iris)

split(v, ceiling(seq_along(v) / round(nrow(iris)/ngroups))) %>% 
  map(~ slice(iris, .x))

如何将一个dataframe在R中一一拆分成给定行数

问题描述

2 个解决方案

解决方案1
2 2022-04-02 13:03:28

4 groups 4组

5 groups 5组

7 groups 7组

解决方案2
2 2022-04-02 14:27:39

如何将一个dataframe在R中一一拆分成给定行数

问题描述

2 个解决方案

解决方案1 2 2022-04-02 13:03:28

4 groups 4组

5 groups 5组

7 groups 7组

解决方案2 2 2022-04-02 14:27:39

解决方案1
2 2022-04-02 13:03:28

解决方案2
2 2022-04-02 14:27:39