简体繁体 English

Pandas：使用 read_json 读取大 bz2 文件的第一个 x 块或行

[英]Pandas: Reading in first x chunks or lines of large bz2 file using read_json

原文 2019-12-02 06:11:59 3 1 python/ json/ pandas/ while-loop

Attempting to read in a bz2 file using pd.read_json尝试使用 pd.read_json 读取 bz2 文件

 i = 0
while i <3:
  i = i +1 
  df = pd.read_json("file.bz2",lines=True, chunksize = 100)
for c in chunks:
    print(c)

This doesn't stop at 3 chunks.这并不止于 3 个块。 How do i read in x amount of chunks or x amount of lines.我如何读取 x 块或 x 行。

1 个解决方案

Try:尝试：

chunks = pd.read_json("file.bz2",lines=True, chunksize = 100)
i = 0
chunk_list = []
for chunk in chunks:
    if i >= 3:
        break
    i += 1
    # do something with that chunk like this:
    result = pd.merge(chunk, merge_df)
    chunk_list.append(result)
 df = pd.concat(chunk_list)

在 python 中读取 bz2 文件的第一行 - Reading first lines of bz2 files in python

无法读取大bz2文件 - unable to read large bz2 file

Pandas：读取几个大的 .bz2 文件并附加它 - Pandas: Reading in several large .bz2 files and appending it

如何使用BZ2 JSON twitter文件有效地读取大型（30GB +）TAR文件到PostgreSQL中 - How to effectively read large (30GB+) TAR file with BZ2 JSON twitter files into PostgreSQL

将大型 .bz2 文件加载和聚合到 Pandas 中的有效方法是什么？ - what is an efficient way to load and aggregate a large .bz2 file into pandas?

如何使用 Python 解析 WIkidata JSON (.bz2) 文件？ - How to parse WIkidata JSON (.bz2) file using Python?

“ pandas to_json”和“ read_json”之间的大文件大小差异 - large filesize difference between `pandas to_json` and `read_json`

使用python限制bz2文件解压？ - Limit on bz2 file decompression using python?

如何从 CSV 的任意 BZ2 流中读取行？ - How to read lines from arbitrary BZ2 streams for CSV?

无法在 Jupyter 实验室中使用 pandas read_json 加载 json 文件 - Cannot load json file using pandas read_json in Jupyter lab

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 在 python 中读取 bz2 文件的第一行 - Reading first lines of bz2 files in python 无法读取大bz2文件 - unable to read large bz2 file Pandas：读取几个大的 .bz2 文件并附加它 - Pandas: Reading in several large .bz2 files and appending it 如何使用BZ2 JSON twitter文件有效地读取大型（30GB +）TAR文件到PostgreSQL中 - How to effectively read large (30GB+) TAR file with BZ2 JSON twitter files into PostgreSQL 将大型 .bz2 文件加载和聚合到 Pandas 中的有效方法是什么？ - what is an efficient way to load and aggregate a large .bz2 file into pandas? 如何使用 Python 解析 WIkidata JSON (.bz2) 文件？ - How to parse WIkidata JSON (.bz2) file using Python? “ pandas to_json”和“ read_json”之间的大文件大小差异 - large filesize difference between `pandas to_json` and `read_json` 使用python限制bz2文件解压？ - Limit on bz2 file decompression using python? 如何从 CSV 的任意 BZ2 流中读取行？ - How to read lines from arbitrary BZ2 streams for CSV? 无法在 Jupyter 实验室中使用 pandas read_json 加载 json 文件 - Cannot load json file using pandas read_json in Jupyter lab

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM