繁体   English   中英

在 bigquery 中聚合列和构建数组

[英]Aggregate columns and build array in bigquery

从这样的表开始

WITH table AS (
SELECT 1001 as ID, 1 As Color_blue, 0  AS Color_red,  0 AS Color_black UNION ALL
SELECT 1002 as ID, 0 As Color_blue, 0  AS Color_red,  1 AS Color_black UNION ALL
SELECT 1003 as ID, 0 As Color_blue, 1  AS Color_red,  0 AS Color_black UNION ALL
SELECT 1004 as ID, 0 As Color_blue, 0  AS Color_red,  1 AS Color_black )

SELECT *
FROM table

在此处输入图像描述

我想聚合所有功能列并构建一个数组,如下所示:

select ID, array<float64>[Color_blue, Color_red, Color_black] features
from table

在此处输入图像描述

但是,我想动态地包含所有列名称,而不是在我的查询中对它们进行硬编码。 output 应该保持不变。 假设我不知道列名(仅“ID”),如何创建具有所有功能的浮点数数组?

以下是 BigQuery 标准 SQL

#standardSQL
SELECT id, 
  ARRAY(
    SELECT CAST(val AS FLOAT64) 
    FROM UNNEST(SPLIT(TRIM(FORMAT('%t', t), '()'))) val 
    WHERE CAST(id AS STRING) != val
  ) AS features
FROM `project.dataset.table` t   

当应用于您问题中的示例数据时 - 结果是

Row id      features     
1   1001    1.0  
            0.0  
            0.0  
2   1002    0.0  
            0.0  
            1.0  
3   1003    0.0  
            1.0  
            0.0  
4   1004    0.0  
            0.0  
            1.0  

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM