繁体   English   中英

如何在具有嵌套数据的大查询中查找重复项

[英]how to find duplicates in Big query having nested data

我有如下表结构,需要在 Google 大查询中查找重复记录

|- Main category
|- Field 1
|- Field 2
|- Field 3

+- sub category
    |- Field 1
    |- Field 2
    |- Field 3
    +- sub sub category
        |- Field 1
        |- Field 2
        |- Field 3

你的意思是这样的? 本质上,从数组中创建一个 json 字符串,将它们分组并排除 counts=1。

with test as (

    select * from unnest([
        struct(1 as id, [1,2,3] as subc),
        (2, [3,4,5]),
        (3, [1,2,3])
    ])
)

select 
    to_json_string(subc) as dupl
from test
group by 1
having count(dupl)>1

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM