簡體 English 中英

讀取大型 csv 文件中特定列的最有效方法

[英]Most efficient way to read a specific column in large csv file

原文 2023-01-06 09:37:15 6 1 python/ pandas/ dataframe/ csv

大約有一個 CSV 文件。 大小為 2.5 GB，大約有 50 列和 450 萬行。

該數據集將用於不同的操作，但一次只使用幾列，因此我正在尋找一種高性能算法來只讀取 CSV 文件中的一列。

讀取一個塊中的文件大約需要 38 秒才能讀取一個 Pandas dataframe 中的文件。
```
 path = r"C:\my_path\my_csv.csv" pd.read_csv(path, header=0)
```
僅閱讀一個特定的列大約需要 14 秒

pd.read_csv(path, usecols=["my_specific_col"], header=0)

有沒有辦法減少閱讀時間？ 因為看起來列數對性能影響不大。

1 個解決方案

自 Pandas 的 1.4.0 版以來，有一個新的read_csv實驗引擎，它依賴於 Arrow 庫的 CSV 多線程解析器，而不是默認的 C 解析器。

所以，這可能有助於加快速度：

df = pd.read_csv(path, usecols=["my_specific_col"], header=0, engine="pyarrow")

讀取大型二進制文件python的最有效方法是什么

[英]What is the most efficient way to read a large binary file python

在python中解析大型.csv的最有效方法？

[英]Most efficient way to parse a large .csv in python?

使用 Python 讀取位於 S3 (AWS) 上的大型 CSV 文件（10 M+ 條記錄）的最有效方法是什么？

[英]What is the most efficient way to read a large CSV file ( 10 M+ records) located on S3 (AWS) with Python?

在.csv 中讀取和擴充（復制樣本並更改某些值）大型數據集的最有效方法是什么

[英]What is the most efficient way to read and augment (copy samples and change some values) large dataset in .csv

讀取CSV文件列中非空單元格的有效方法

[英]Efficient way to read non-empty cells in a column in CSV file

在python中僅讀取特定行的最有效的文件類型（非常大的文件）

[英]Most efficient file type to read in only specific rows, in python (very large files)

部分閱讀大型numpy文件的有效方法？

[英]Efficient way to partially read large numpy file?

Python：如何以有效的方式讀取 .csv 文件？

[英]Python: how to read .csv file in an efficient way?

在Python中修改大型文本文件的最后一行的最有效方法

[英]Most efficient way to modify the last line of a large text file in Python

搜索大型排序文本文件的最快和最有效的方法

[英]Quickest and most efficient way to search large sorted text file

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 讀取大型二進制文件python的最有效方法是什么在python中解析大型.csv的最有效方法？使用 Python 讀取位於 S3 (AWS) 上的大型 CSV 文件（10 M+ 條記錄）的最有效方法是什么？在.csv 中讀取和擴充（復制樣本並更改某些值）大型數據集的最有效方法是什么讀取CSV文件列中非空單元格的有效方法在python中僅讀取特定行的最有效的文件類型（非常大的文件）部分閱讀大型numpy文件的有效方法？ Python：如何以有效的方式讀取 .csv 文件？在Python中修改大型文本文件的最后一行的最有效方法搜索大型排序文本文件的最快和最有效的方法

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM