如何解压缩列中的多个 json gzip 文件？

Question

我有一列中有数千个 json.gz 文件的 csv 文件。 我的目标是遍历列中的所有行以解压缩每个 json.gz 文件。

指数	地点
0	'0_location_data.json.gz'
1	'1_location_data.json.gz'

我的代码：

import pandas as pd
import itertools, gzip


jsonfilename = list(df['location])

it = (gzip.open(f, 'rt') for f in jsonfilename)

for line in itertools.chain.from_iterable(it):
     print(line)

我的错误：

OSError: [Errno 22] Invalid argument: '0_location_data.json.gz'

我的目标是解压缩所有这些文件，然后我可以将它们标准化为 csv。

Answer 1

it = (gzip.open(f, 'rb') for f in jsonfilename)

鉴于 gzip 生成二进制文件，这可能是正确的参数。

如何解压缩列中的多个 json gzip 文件？

问题描述

1 个解决方案

解决方案1
0 2022-07-07 03:58:31

如何解压缩列中的多个 json gzip 文件？

问题描述

1 个解决方案

解决方案1 0 2022-07-07 03:58:31

解决方案1
0 2022-07-07 03:58:31