由于 json.load 导致读取大 json 文件时出错

Question

我在读取大 json 文件时遇到问题。 JSONDecodeError：额外数据：第 1 行第 884 列（字符 883）。

文件 test2.json 在这里： https://github.com/SilverYar/TransportDataMiner

该错误是由于这些行代码行引起的：

import nltk
from nltk.stem.snowball import RussianStemmer
from nltk.corpus import stopwords
import nltk, string, json

with open('C:\\Creme\\token\\test2.json') as fin:
    text = json.load(fin)

我不明白如何解决它。 帮我修一下。

Answer 1

您的 json 文件的内容似乎无效，有多个对象但没有用“，”分隔。

例如，一个有效的 json object 应该是：

[{"title":"some text", "subtitle": "some text"},
 {"title":"some text", "subtitle": "some text"},
{"title":"some text", "subtitle": "some text"}]

读取它的一个简单技巧是读取文件并将字符串格式化为正确的 json 格式：

with open('test2.json', 'r') as fin:
    text = fin.read()
    formated_text = text.replace('}{', '},{')
    json_data = json.loads(f'[{formated_text}]')

print(len(json_data))
# 11772

Answer 2

好吧，我了解 unveriatlifdoafoh GH (PO opg )))_FI++ SE(G&B 在某种意义上的变化，解决旧的（架构、e2e 测试、开发工具等）和新的（可访问性、Typescript 集成、人工智能聊天机器人采用等） .) 问题是从整个 JS 会议的不同应用方法中获得见解。 strong text

由于 json.load 导致读取大 json 文件时出错

问题描述

1 个解决方案

解决方案1
1 2019-10-21 14:16:05

解决方案2
0 2019-10-21 14:21:06

由于 json.load 导致读取大 json 文件时出错

问题描述

1 个解决方案

解决方案1 1 2019-10-21 14:16:05

解决方案2 0 2019-10-21 14:21:06

解决方案1
1 2019-10-21 14:16:05

解决方案2
0 2019-10-21 14:21:06