简体   繁体   English

按行将大数据框拆分为两个多个数据框

[英]Split large dataframe two multiple dataframes by row

I have a large dataframe of 1,150,000 rows and 6 columns. 我有一个1,150,000行和6列的大型数据框。

How do I split the dataframe into 5 dataframes with 200,000 rows each (the last one being 150,000 rows)? 如何将数据帧分为5个数据帧,每个数据帧具有200,000行(最后一个是150,000行)?

Use list comprehension to create a list of 6 dataframes which then can be assigned to separate variables. 使用列表推导创建一个6个数据帧的列表,然后可以将它们分配给单独的变量。

n = 200000
list_df = [df[i:i+n] for i in range(0,df.shape[0],n)]

Outputs: 输出:

In [3]: df = pd.DataFrame(index=np.arange(1150000),data=np.random.rand(1150000,6
   ...: ))

In [4]: n = 200000

In [5]: df1 = [df[i:i+n] for i in range(0,len(df),n)]
In [6]: df1[0].shape
Out[6]: (200000, 6)

In [7]: df1[1].shape
Out[7]: (200000, 6)

In [8]: df1[2].shape
Out[8]: (200000, 6)

In [9]: df1[3].shape
Out[9]: (200000, 6)

In [10]: df1[4].shape
Out[10]: (200000, 6)

In [11]: df1[5].shape
Out[11]: (150000, 6)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM