如何使用 Python 將一個 excel 拆分為多個 excel，並在所有新 excel 中分配相同的行數？

Question

我有一個大數據的excel。 我想將這個 excel 拆分成多個 excel，行分布相等。

我當前的代碼正在部分工作，因為它正在分配所需的行數並創建多個 excel。 但與此同時，它通過考慮行數不斷創建更多的 excel。

在n_partitions 中，如果我輸入數字 5，那么它會在兩個 excel 中創建 5 行的 excel，然后它會繼續創建另外三個空白 excel。 我希望我的代碼在分發所有行后停止創建更多 excel。

下面是我的示例 excel，帶有預期結果和示例代碼。

我目前使用的代碼是。

import pandas as pd

df = pd.read_excel("C:/Zen/TestZenAmp.xlsx")

n_partitions = 5

for i in range(n_partitions):
    sub_df = df.iloc[(i*n_partitions):((i+1)*n_partitions)]
    sub_df.to_excel(f"C:/Zen/-{i}.xlsx", sheet_name="a")

Answer 1

您可以使用下面的代碼將 DataFrame 拆分為 5 個大小的塊：

n = 5
list_df = [df[i:i+n] for i in range(0,df.shape[0],n)]

您可以像這樣訪問每個塊：

>>> `list_df[0]`

>>> `list_df[2]`

然后您可以遍歷塊/子數據幀列表並創建單獨的 Excel 文件：

i=1
for sub_df in list_df:
    sub_df.to_excel(f"C:/Zen/-{i}.xlsx", sheet_name="a", index=False)
    i+=1

Answer 2

另一種可能的解決方案：

g = df.groupby([df.index // k])
df['id'] = g.ngroup()
(g.apply(lambda x: x.drop('id', 1)
         .to_excel(f"/tmp/x-{pd.unique(x.id)[0]}.xlsx", sheet_name="a")))

Answer 3

這正是我想要的，但在 Java 版本中:(

如何使用 Python 將一個 excel 拆分為多個 excel，並在所有新 excel 中分配相同的行數？

問題描述

2 個解決方案

解決方案1
0 2022-08-27 09:04:19

>>> `list_df[0]`

>>> `list_df[2]`

解決方案2
0 2022-08-27 11:24:57

解決方案3
-2 2022-12-23 06:49:57

如何使用 Python 將一個 excel 拆分為多個 excel，並在所有新 excel 中分配相同的行數？

問題描述

2 個解決方案

解決方案1 0 2022-08-27 09:04:19

>>> list_df[0]

>>> list_df[2]

解決方案2 0 2022-08-27 11:24:57

解決方案3 -2 2022-12-23 06:49:57

解決方案1
0 2022-08-27 09:04:19

>>> `list_df[0]`

>>> `list_df[2]`

解決方案2
0 2022-08-27 11:24:57

解決方案3
-2 2022-12-23 06:49:57