簡體   English   中英

基於從python字典中分組刪除鍵值對

[英]Remove key-value pair based on grouping from dictionaries in python

我有一個包含多個字典的 JSON 文件 A.json。 我想從grouped by brand鍵“模型”中刪除常見的鍵值對。

例如,考慮品牌:“福特”:

{"Number": '123', "brand": "Ford", "model":{"Mustang1":"2.64", "Mustang2":"3.00", "Mustang3":"1.00", "Mustang4":"1.64"}}

{"Number": '891', "brand": "Ford", "model":{"Mustang1":"2.64", "Mustang8":"3.00", "Mustang3":"1.00", "Mustang6":"1.64"}}

兩個字典中通用的鍵model中的鍵是Mustang1Mustang3 所以我從模型中刪除了兩個鍵值對。 最終詞典將是:

 {"Number": '123', "brand": "Ford", "model":{"Mustang2":"3.00", "Mustang4":"1.64"}}
{"Number": '891', "brand": "Ford", "model":{"Mustang8":"3.00", "Mustang6":"1.64"}}

json

{"Number": '123', "brand": "Ford", "model":{"Mustang1":"2.64", "Mustang2":"3.00", "Mustang3":"1.00", "Mustang4":"1.64"}}
{"Number": '321', "brand": "Toyota", "model":{"Camry":"2.64", "Prius":"3.00", "Corolla":"1.00", "Tundra":"1.64"}}
{"Number": '111', "brand": "Honda", "model":{"Accord":"2.64", "Civic":"3.00", "Insight":"1.00", "Pilot":"1.64"}}
{"Number": '891', "brand": "Ford", "model":{"Mustang1":"2.64", "Mustang8":"3.00", "Mustang3":"1.00", "Mustang6":"1.64"}}
{"Number": '745', "brand": "Toyota", "model":{"Camry":"2.64", "Sienna":"3.00", "4Runner":"1.00", "Prius":"1.64"}}
{"Number": '325', "brand": "Honda", "model":{"Accord":"2.64", "Passport":"3.00", "HR-V":"1.00", "Pilot":"1.64"}}
{"Number": '745', "brand": "Accura", "model":{"TLX":"2.64", "MDX":"3.00"}}
{"Number": '325', "brand": "Accura", "model":{"TLX":"2.64", "MDX":"3.00"}}

預期結果:Result.json

{"Number": '123', "brand": "Ford", "model":{"Mustang2":"3.00", "Mustang4":"1.64"}}
{"Number": '321', "brand": "Toyota", "model":{"Corolla":"1.00", "Tundra":"1.64"}}
{"Number": '111', "brand": "Honda", "model":{"Civic":"3.00", "Insight":"1.00", "Pilot":"1.64"}}
{"Number": '891', "brand": "Ford", "model":{"Mustang8":"3.00", "Mustang6":"1.64"}}
{"Number": '745', "brand": "Toyota", "model":{"Sienna":"3.00", "4Runner":"1.00"}}
{"Number": '325', "brand": "Honda", "model":{"Passport":"3.00", "HR-V":"1.00", "Civic Type R":"1.64"}}
{"Number": '745', "brand": "Accura", "model":{}}
{"Number": '325', "brand": "Accura", "model":{}}

首先,您的A.json不是常規的 json 文件。 這是更正后的版本:

[{"Number": "123", "brand": "Ford", "model":{"Mustang1":"2.64", "Mustang2":"3.00", "Mustang3":"1.00", "Mustang4":"1.64"}},
{"Number": "321", "brand": "Toyota", "model":{"Camry":"2.64", "Prius":"3.00", "Corolla":"1.00", "Tundra":"1.64"}},
{"Number": "111", "brand": "Honda", "model":{"Accord":"2.64", "Civic":"3.00", "Insight":"1.00", "Pilot":"1.64"}},
{"Number": "891", "brand": "Ford", "model":{"Mustang1":"2.64", "Mustang8":"3.00", "Mustang3":"1.00", "Mustang6":"1.64"}},
{"Number": "745", "brand": "Toyota", "model":{"Camry":"2.64", "Sienna":"3.00", "4Runner":"1.00", "Prius":"1.64"}},
{"Number": "325", "brand": "Honda", "model":{"Accord":"2.64", "Passport":"3.00", "HR-V":"1.00", "Pilot":"1.64"}},
{"Number": "745", "brand": "Accura", "model":{"TLX":"2.64", "MDX":"3.00"}},
{"Number": "325", "brand": "Accura", "model":{"TLX":"2.64", "MDX":"3.00"}}]

文件的內容應該用json模塊解析:

import io # to test without a file
f = io.StringIO(json_text) # json_text is a string containing the text above

import json
ds = json.load(f)

其次,你要按品牌建立set常用模型:

common_by_brand = {}
for d in ds:
    if d["brand"] in common_by_brand:
        common_by_brand[d["brand"]] &= set(d["model"])
    else:
        common_by_brand[d["brand"]] = set(d["model"])
    # {'Ford': {'Mustang1', 'Mustang3'}, 'Toyota': {'Camry', 'Prius'}, 'Honda': {'Accord', 'Pilot'}, 'Accura': {'TLX', 'MDX'}}

第三,只需遍歷列表並刪除那些常見模型:

for d in ds:
    common = common_by_brand[d["brand"]]
    d["model"] = {k: v for k, v in d["model"].items() if k not in common}
# [{'Number': '123', 'brand': 'Ford', 'model': {'Mustang2': '3.00', 'Mustang4': '1.64'}}, {'Number': '321', 'brand': 'Toyota', 'model': {'Corolla': '1.00', 'Tundra': '1.64'}}, {'Number': '111', 'brand': 'Honda', 'model': {'Civic': '3.00', 'Insight': '1.00'}}, {'Number': '891', 'brand': 'Ford', 'model': {'Mustang8': '3.00', 'Mustang6': '1.64'}}, {'Number': '745', 'brand': 'Toyota', 'model': {'Sienna': '3.00', '4Runner': '1.00'}}, {'Number': '325', 'brand': 'Honda', 'model': {'Passport': '3.00', 'HR-V': '1.00'}}, {'Number': '745', 'brand': 'Accura', 'model': {}}, {'Number': '325', 'brand': 'Accura', 'model': {}}]

四、將結果以json格式寫入文件:

g = io.StringIO()
json.dump(ds, g)
print (g.getvalue())

格式化輸出:

[{"Number": "123", "brand": "Ford", "model": {"Mustang2": "3.00", "Mustang4": "1.64"}},
{"Number": "321", "brand": "Toyota", "model": {"Corolla": "1.00", "Tundra": "1.64"}},
{"Number": "111", "brand": "Honda", "model": {"Civic": "3.00", "Insight": "1.00"}},
{"Number": "891", "brand": "Ford", "model": {"Mustang8": "3.00", "Mustang6": "1.64"}},
{"Number": "745", "brand": "Toyota", "model": {"Sienna": "3.00", "4Runner": "1.00"}},
{"Number": "325", "brand": "Honda", "model": {"Passport": "3.00", "HR-V": "1.00"}},
{"Number": "745", "brand": "Accura", "model": {}},
{"Number": "325", "brand": "Accura", "model": {}}]

首先,您需要使用json 內置庫在 python 中加載 json 。

然后,有幾種方法可以實現這一點。 例如,您可以迭代每個 dict 並在每次迭代時更新一個Counter 然后您刪除已被多次計數的每個鍵。

最后,您再次使用json庫將生成的 dict 轉儲到新文件中。

我假設您將使用標准的JSON格式。 您需要檢查字典中keyvalue type何時為dict類型。 isinstance()方法可用於此目的。 您可以使用以下代碼片段:

for key,value in your_json.items():
    if isinstance(value, dict):
       your_json[key]={}

我希望這可能會奏效。 干杯:)

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM