[英]Athena get the minimum value in each group and corresponding other column values
Input Table输入表
user id action date collection
aaa 1 view 2020-09-01 {some JSON data_1}
aaa 1 view 2020-09-02 {some JSON data_2}
aaa 1 view 2020-09-03 {some JSON data_3}
bbb 2 view 2020-09-08 {some JSON data_22}
bbb 2 view 2020-09-09 {some JSON data_23}
ccc 2 view 2020-09-01 {some JSON data_99}
ddd 3 view 2020-09-01 {some JSON data_88}
Output_Table输出表
user id action date collection
aaa 1 view 2020-09-01 {some JSON data_1}
bbb 2 view 2020-09-08 {some JSON data_22}
ccc 2 view 2020-09-01 {some JSON data_99}
ddd 3 view 2020-09-01 {some JSON data_88}
if we see input table and output_table,如果我们看到输入表和输出表,
i want similar to this我想要类似的
group by (user,id,action) then i need min(date) and corresponding collection value
Can anyone suggest an idea?任何人都可以提出一个想法吗?
One option is to flter with a subquery:一种选择是使用子查询进行过滤:
select t.*
from mytable t
where t.date = (
select min(t1.date) from mytable t1 where t1.user = t.user
)
Another solution is to use window functions to rank records having the same user
by date
, then use that information to filter the resultset:另一种解决方案是使用窗口函数按
date
具有相同user
的记录进行排名,然后使用该信息过滤结果集:
select *
from (
select t.*, row_number() over(partition by user order by date) rn
from mytable t
) t
where rn = 1
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.