繁体   English   中英

在python中递归生成列表的子集

[英]Recursively generate subset of list in python

我有一个类似于以下内容的json文件:

[
  {
     "category1":"0120391123123"
  },
  [
     {
        "subcategory":"0120391123123"
     },
     [
        {
           "subsubcategory":"019301948109"
        },
        [
           {
              "subsubsubcategory":"013904123908"
           },
           [
              {
                 "subsubsubsubcategory":"019341823908"
              }
           ]
        ]
     ]
  ],
  [
     {
        "subcategory2":"0934810923801"
     },
     [
        {
           "subsubcategory2":"09341829308123"
        }
     ]
  ],
  [
     {
        "category2":"1309183912309"
     },
     [
        {
           "subcategory":"10293182094"
        }
     ]
  ]
]

我还有一份我想在原始列表中找到的类别列表。 如果类别存在于categoriesToFind中,我还想查找所有子类别并返回它们。

categoriesToFind = ['019301948109', '1309183912309']

finalCategories = []

def findCategories(currentList, isFirstIteration):
    for x in currentList:
        if type(x) is dict and (next(iter(x.values())) in categoriesToFind or not isFirstIteration):
            finalCategories.append(next(iter(x.values())))
            if len(currentList) < currentList.index(x) + 1:
                findCategories(currentList[currentList.index(x) + 1], False)

findCategories(data, True)

我希望finalCategories包含以下内容:

['019301948109', '013904123908', '019341823908', '1309183912309', '10293182094']

您可以使用生成器的递归:

categoriesToFind = ['019301948109', '1309183912309']
d = [{'category1': '0120391123123'}, [{'subcategory': '0120391123123'}, [{'subsubcategory': '019301948109'}, [{'subsubsubcategory': '013904123908'}, [{'subsubsubsubcategory': '019341823908'}]]]], [{'subcategory2': '0934810923801'}, [{'subsubcategory2': '09341829308123'}]], [{'category2': '1309183912309'}, [{'subcategory': '10293182094'}]]]
def get_subcategories(_d, _flag):
   flag = None
   for i in _d:
     if isinstance(i, dict):
       _val = list(i.values())[0]
       if _val in categoriesToFind or _flag:
         yield _val
         flag = True
     else:
        yield from get_subcategories(i, _flag or flag)

print(list(get_subcategories(d, False)))

输出:

['019301948109', '013904123908', '019341823908', '1309183912309', '10293182094']

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM