如何使用 Python 将一个 excel 拆分为多个 excel，并在所有新 excel 中分配相同的行数？

Question

我有一个大数据的excel。 我想将这个 excel 拆分成多个 excel，行分布相等。

我当前的代码正在部分工作，因为它正在分配所需的行数并创建多个 excel。 但与此同时，它通过考虑行数不断创建更多的 excel。

在n_partitions 中，如果我输入数字 5，那么它会在两个 excel 中创建 5 行的 excel，然后它会继续创建另外三个空白 excel。 我希望我的代码在分发所有行后停止创建更多 excel。

下面是我的示例 excel，带有预期结果和示例代码。

我目前使用的代码是。

import pandas as pd

df = pd.read_excel("C:/Zen/TestZenAmp.xlsx")

n_partitions = 5

for i in range(n_partitions):
    sub_df = df.iloc[(i*n_partitions):((i+1)*n_partitions)]
    sub_df.to_excel(f"C:/Zen/-{i}.xlsx", sheet_name="a")

Answer 1

您可以使用下面的代码将 DataFrame 拆分为 5 个大小的块：

n = 5
list_df = [df[i:i+n] for i in range(0,df.shape[0],n)]

您可以像这样访问每个块：

>>> `list_df[0]`

>>> `list_df[2]`

然后您可以遍历块/子数据帧列表并创建单独的 Excel 文件：

i=1
for sub_df in list_df:
    sub_df.to_excel(f"C:/Zen/-{i}.xlsx", sheet_name="a", index=False)
    i+=1

Answer 2

另一种可能的解决方案：

g = df.groupby([df.index // k])
df['id'] = g.ngroup()
(g.apply(lambda x: x.drop('id', 1)
         .to_excel(f"/tmp/x-{pd.unique(x.id)[0]}.xlsx", sheet_name="a")))

Answer 3

这正是我想要的，但在 Java 版本中:(

如何使用 Python 将一个 excel 拆分为多个 excel，并在所有新 excel 中分配相同的行数？

问题描述

2 个解决方案

解决方案1
0 2022-08-27 09:04:19

>>> `list_df[0]`

>>> `list_df[2]`

解决方案2
0 2022-08-27 11:24:57

解决方案3
-2 2022-12-23 06:49:57

如何使用 Python 将一个 excel 拆分为多个 excel，并在所有新 excel 中分配相同的行数？

问题描述

2 个解决方案

解决方案1 0 2022-08-27 09:04:19

>>> list_df[0]

>>> list_df[2]

解决方案2 0 2022-08-27 11:24:57

解决方案3 -2 2022-12-23 06:49:57

解决方案1
0 2022-08-27 09:04:19

>>> `list_df[0]`

>>> `list_df[2]`

解决方案2
0 2022-08-27 11:24:57

解决方案3
-2 2022-12-23 06:49:57