[英]select up to N rows for each unique value of the column
我有一個帶有下幾列的表:
First Name,
Last Name,
Age
讓我們假設我們有
我想要一個記錄集,每個年齡段我最多擁有N條記錄。 (記錄可以是隨機的)
你能建議嗎?
例如,如果N = 3,那么我們將有
2 records with age = 25
3 records with age = 26
3 records with age = 27
我會這樣使用ROW_NUMBER函數:
DECLARE @TopN INT;
SET @TopN = 3;
SELECT ...
FROM
(
SELECT ...,
RowNum = ROW_NUMBER() OVER(PARTITION BY t.Age ORDER BY t.LastName, t.FirstName)
FROM MySchema.MyTable AS t
) src
WHERE src.RowNum <= @TopN
如果您已安裝AdventureWorks數據庫 (我使用AdventureWorks2008),則可以使用以下腳本進行測試 :
-- Because Person.Person table doesn't has an `Age` column
-- I create a new table (dbo.Person) having following columns:
-- BusinessEntityID, LastName, FirstName and Age columns
SELECT p.BusinessEntityID, p.LastName, p.FirstName,
1 + ABS(CHECKSUM(NEWID())) % 100 AS Age
INTO dbo.Persons
FROM Person.Person p;
GO
/*
ALTER TABLE dbo.Persons
ADD CONSTRAINT PK_Persons_BusinessEntityID
PRIMARY KEY (BusinessEntityID)
*/
DECLARE @TopN INT;
SET @TopN = 3;
SELECT src.BusinessEntityID, src.LastName, src.FirstName, src.Age, src.RowNum
FROM
(
SELECT p.BusinessEntityID, p.LastName, p.FirstName, p.Age,
RowNum = ROW_NUMBER() OVER(PARTITION BY p.Age ORDER BY p.LastName, p.FirstName)
FROM dbo.Persons AS p
) src
WHERE src.RowNum <= @TopN
ORDER BY src.Age, src.LastName, src.FirstName;
-- DROP TABLE dbo.Persons
結果:
BusinessEntityID LastName FirstName Age RowNum
---------------- --------- ---------- --- ------
...
10905 Allen Kaitlyn 30 1
15052 Alonso Gina 30 2
5505 Alonso Jessie 30 3
20216 Alexander Alyssa 31 1
3789 Allen Wyatt 31 2
2798 Alonso Alfredo 31 3
16850 Adams Gabriel 32 1
4747 Adams Ian 32 2
7761 Alexander Jacqueline 32 3
...
您可以使用ROW_NUMBER()
函數來模擬此行為:
SELECT t.*
FROM (SELECT t.*, ROW_NUMBER() OVER (PARTITIN BY age ORDER BY 1) as rk
FROM some_table
) t
WHERE rk <= 3;
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.