[英]count distinct window function Databricks
I am implementing count distinct window functions in Databricks.我正在 Databricks 中实现计数不同的 window 函数。
select *,count(distinct Marks) over(partition by Name) from data
It seems that count distinct is not supported in Databricks, how can I replicate the same query in databricks. Databricks 似乎不支持 count distinct,我如何在 databricks 中复制相同的查询。
Using collect_set
+ size
functions:使用
collect_set
+ size
函数:
select *, size(collect_set(Marks)) over(partition by Name) from data
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.