[英]Select unique from two columns with group by
I have a table:我有一张桌子:
CREATE TABLE stats_test
(
id1 bigint,
id2 bigint,
date timestamp with time zone
);
And data inside:和里面的数据:
id1 | id2 | date
-----+-----+------------------------
1 | 2 | 2020-12-01 00:00:00+00
2 | 1 | 2020-12-01 00:00:00+00
3 | 4 | 2020-11-01 00:00:00+00
4 | 3 | 2020-11-01 00:00:00+00
1 | 3 | 2020-12-01 00:00:00+00
1 | 3 | 2020-11-01 00:00:00+00
With this query i get results:通过此查询,我得到结果:
SELECT EXTRACT(YEAR FROM date), EXTRACT(MONTH FROM date),
COUNT(DISTINCT id1) AS unique_id1, COUNT(DISTINCT id2) AS unique_id2
FROM stats_test GROUP BY EXTRACT(YEAR FROM date), EXTRACT(MONTH FROM date);
date_part | date_part | unique_id1 | unique_id2
-----------+-----------+------------+------------
2020 | 11 | 3 | 2
2020 | 12 | 2 | 3
How to get another column with count unique ids from the set of both column (id1, id2) grouped by year and month?如何从按年和月分组的两个列(id1,id2)的集合中获取具有计数唯一ID的另一列?
date_part | date_part | unique_id1 | unique_id2 | unique_both_ids
-----------+-----------+------------+------------+----------------
2020 | 11 | 3 | 2 |
2020 | 12 | 2 | 3 |
count(distinct..)
only allows a single expression (so count(distinct id1,id2)
is rejected), but you can use an anonymous row expression to overcome that limitation: count(distinct..)
只允许单个表达式(因此count(distinct id1,id2)
被拒绝),但您可以使用匿名行表达式来克服该限制:
select extract(year from date) as year,
extract(month from date) as month,
count(distinct id1) as unique_id1,
count(distinct id2) as unique_id2,
count(distinct (id1,id2)) as unique_both_ids
from stats_test
group by extract(year from date), extract(month from date);
Note that 1,2 and 2,1 would be considered two different things.请注意,1,2 和 2,1 将被视为两个不同的事物。 If you want them to be treated the same use:
count(distinct (least(id1,id2), greatest(id1,id2)))
如果您希望它们被视为相同的用途:
count(distinct (least(id1,id2), greatest(id1,id2)))
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.