簡體   English   中英

Postgres 中用於查找最后一個值的聚合

[英]Aggregation in Postgres for finding last value

我在 postgres 中有一個包含聚合數據的表,這些表具有以下字段

search_term --> 一個特定的搜索詞
日期 --> 執行搜索的日期
search_count --> 使用此搜索詞執行了多少次搜索
min_result_count --> 搜索詞返回的最小結果數是多少
max_results_count --> 搜索詞返回的最大結果數是多少
last_result_count --> 上次搜索時返回的搜索結果數
zero_result_count --> 這個搜索詞有多少次沒有結果

其中datesearch_term組合是唯一的,這意味着搜索詞不會在日期重復,而是會更新值。

我正在嘗試編寫一個為期 7 天的 sql 查詢以獲得以下記錄
搜索詞
min_result_count
max_result_count
zero_result_count
last_result_count

我可以使用聚合 MIN、MAX、SUM 找到所有值,但我無法找到 last_result_count 的值,因為這需要我只獲取最后一個值。

這是一張具有預期結果的同一張表

search_term    search_count    min_rc    max_rc    zero_count    last_rc    date
---------------------------------------------------------------------------------------
term1          10              10        20        0              4        01-01-2020
term1          10              11        21        0              5        02-01-2020
term1          10              12        22        0              6        03-01-2020
term1          10              13        23        0              7        04-01-2020
term1          10              14        24        0              8        05-01-2020

term2          10              24        25        0              9        01-01-2020
term2          10              23        26        0              10       02-01-2020
term2          10              22        27        0              11       03-01-2020
term2          10              21        28        0              12       04-01-2020
term2          10              0         29        3              0        04-01-2020

如果我運行查詢 05-01-2020,我應該得到

search_term    search_count    min_rc    max_rc    zero_count    last_rc
-------------------------------------------------------------------------
term1          50              10        24        0              8      
term2          50              0         29        3              0     

如果我運行查詢 04-01-2020,我應該得到

search_term    search_count    min_rc    max_rc    zero_count    last_rc
-------------------------------------------------------------------------
term1          40              10        23        0              7      
term2          40              21        28        0              12     

如果我運行查詢 03-01-2020,我應該得到

search_term    search_count    min_rc    max_rc    zero_count    last_rc
-------------------------------------------------------------------------
term1          30              10        23        0              6      
term2          30              22        27        0              11     
  • rc 代表 result_count

依此類推,派生 last_result_count 的任何幫助都會非常有幫助

您可以為此使用ROW_NUMBER window function。 ROW_NUMBER使用您的 int 對您的數據進行排序,然后生成一個數字。

ROW_NUMBER()OVER(PARTITION BY date,search_term ORDER BY LAST_RC) AS ROW_NUMBERED_COLUMN

然后,您可以對數據進行分組並使用MAX(ROW_NUMBERED_COLUMN)

您可以使用如下所示的window_functions

Select search_term ,
SUM(search_count) OVER (partition by search_term order BY date)  as search_count,
MIN(min_rc) OVER (partition by search_term order BY date)  as min_rc,
MAX(max_rc) OVER (partition by search_term order BY date)  as max_rc,
zero_count,
last_rc , 
DATE 
from t
ORDER BY search_term,date 

結果集:

search_term    search_count    min_rc    max_rc    zero_count    last_rc   date
term1          10              10        20         0              4       01-01-2020
term1          20              10        21         0              5       02-01-2020
term1          30              10        22         0              6       03-01-2020
term1          40              10        23         0              7       04-01-2020
term1          50              10        24         0              8       05-01-2020
term2          10              24        25         0              9       01-01-2020
term2          20              23        26         0              10      02-01-2020
term2          30              22        27         0              11      03-01-2020
term2          50              0         29         0              12      04-01-2020
term2          50              0         29         3              0       04-01-2020

更新后的版本*

SELECT search_term,search_count, min_rc, max_rc, zero_count, last_rc
FROM
(SELECT search_term ,
        SUM(search_count) OVER (partition by search_term order BY date) as search_count,
        MIN(min_rc) OVER (partition by search_term order BY date) as min_rc,
        MAX(max_rc) OVER (partition by search_term order BY date) as max_rc,
        zero_count,
        last_rc,
        RANK() OVER (partition by search_term order BY date desc) as rnk,
        date
 FROM t
 WHERE date <= '05-01-2020'
 ) A 
 WHERE A.rnk = 1

另一種更簡單的方法,我在您發表評論后意識到您想要什么。

SELECT search_term ,
SUM(search_count) as search_count,
MIN(min_rc) as min_rc,
MAX(max_rc) as max_rc,
SUM(zero_count) as zero_count,
(SELECT last_rc FROM t as a WHERE a.search_term = t.search_term AND a.date = 
 t.date ORDER BY date desc LIMIT 1) AS last_rc,
MAX(date) as date
FROM t
WHERE date <= '05-01-2020'
GROUP BY search_term
ORDER BY search_term

使用 window function last_value 更簡單

Select search_term ,
SUM(search_count) as search_count,
MIN(min_rc) as min_rc,
MAX(max_rc) as max_rc,
SUM(zero_count) as zero_count,
LAST_VALUE(last_rc) OVER (Partition by search_term ORDER BY date desc) AS last_rc,
MAX(date) as date
FROM t
WHERE date <= '03-01-2020'
GROUP BY search_term
ORDER BY search_term

使用任何更新版本的結果集。

search_term search_count    min_rc  max_rc  zero_count  last_rc
term1       50              10      24      0           8
term2       50              0       29      3           0

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM