簡體 English 中英

如何使用pandas數據框從磁盤讀取和寫入文件？

[英]How to read and write files from disk using the pandas dataframe?

原文 2017-09-15 21:13:29 9 1 python-3.x/ pandas

我將要處理非常大的數據文件（許多GB）。 我將不得不讀取這些文件並寫入這些文件。 因此，我將不能依靠RAM來存儲數據，並且需要從磁盤讀取和寫入文件。

我熟悉pandas庫提供的read_csv和to_csv選項。 但是，我不確定read csv函數是先讀取文件，然后將其存儲在RAM上還是直接從磁盤讀取文件。

使用熊貓從磁盤讀取和寫入文件的最佳方法是什么？

1 個解決方案

pandas.read_csv會將整個文件讀入內存。 如果只需要特定的列，則可以使用usecols參數指定列的子集，而pandas只加載那些列。

由於文件不適合內存，您可以使用split在磁盤上拆分文件，然后對塊執行所有操作。

一個簡單的替代方法是使用read_csv從dask.dataframe從DASK庫。

從文檔中：

A Dask DataFrame is a large parallel dataframe composed of many smaller Pandas dataframes, split along the index. These pandas dataframes may live on disk for larger-than-memory computing on a single machine, or on many different machines in a cluster.

如何使用 Pandas 從私有 GitHub 存儲庫中讀取 excel 數據框？

[英]How to read an excel dataframe from a private GitHub repository using pandas?

讀寫xlsx文件，從pandas dataframe到指定目錄

[英]Read and Write xlsx file, from pandas dataframe to specific directory

如何從url讀取數據到pandas dataframe

[英]How to read data from url to pandas dataframe

How can I read data from AWS- Aurora postgresql as python pandas DataFrame and write the same to Oracle table?

[英]How can I read data from AWS- Aurora postgresql as python pandas DataFrame and write the same to Oracle table?

如何使用 python 腳本將 Pandas 數據幀寫入 AWS Athena 表

[英]How to write a pandas dataframe to AWS Athena table using python script

如何在不使用熊貓或任何包的情況下編寫數據幀？

[英]How to write a dataframe without using pandas or any package?

如何在Pandas中使用quotechar從DAT文件讀取和寫入刺字符？

[英]How do I read and write the thorn character from a DAT file using quotechar in Pandas?

如何在python的S3中從pandas數據幀寫入鑲木地板文件

[英]How to write parquet file from pandas dataframe in S3 in python

將幾個文本文件的內容讀入 pandas Dataframe

[英]Read content of several text files into pandas Dataframe

以最新的修改時間將3個文件讀入Pandas Dataframe

[英]Read 3 files into Pandas Dataframe with latest modified time

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 如何使用 Pandas 從私有 GitHub 存儲庫中讀取 excel 數據框？讀寫xlsx文件，從pandas dataframe到指定目錄如何從url讀取數據到pandas dataframe How can I read data from AWS- Aurora postgresql as python pandas DataFrame and write the same to Oracle table? 如何使用 python 腳本將 Pandas 數據幀寫入 AWS Athena 表如何在不使用熊貓或任何包的情況下編寫數據幀？如何在Pandas中使用quotechar從DAT文件讀取和寫入刺字符？如何在python的S3中從pandas數據幀寫入鑲木地板文件將幾個文本文件的內容讀入 pandas Dataframe 以最新的修改時間將3個文件讀入Pandas Dataframe

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM