类似于 Spark 中的 groupByKey() 但使用 SQL 查询

Question

我试图使

 ID    CATEGORY  VALUE
'AAA'    'X'      123
'AAA'    'Y'      456
'BBB'    'X'      321
'BBB'    'Y'      654

进入

 ID     VALUE_X   VALUE_Y
'AAA'     123       456
'BBB'     321       654

仅使用 SQL 查询。 这有点类似于在 pyspark 中使用 groupByKey()。

有没有办法做到这一点？

Answer 1

只需使用条件聚合。 一种方法是：

select id,
       max(case when category = 'X' then value end) as x_value,
       max(case when category = 'Y' then value end) as y_value
from t
group by id;

在 Postgres 中，这将使用标准filter子句来表达：

select id,
       max(value) filter (where category = 'X'),
       max(value) filter (where category = 'Y')
from t
group by id;

类似于 Spark 中的 groupByKey() 但使用 SQL 查询

问题描述

1 个解决方案

解决方案1
1 已采纳 2021-03-15 00:29:53

类似于 Spark 中的 groupByKey() 但使用 SQL 查询

问题描述

1 个解决方案

解决方案1 1 已采纳 2021-03-15 00:29:53

解决方案1
1 已采纳 2021-03-15 00:29:53