SQL 中的 Pivot 表具有多列

Question

I have this data我有这个数据

ID  Month   PRODUCT VALUE_1 VALUE_2
1234    1   a        34 12
1233    2   B        54 
1245    3   c        23 42
1236    4   d        12 8
1238    1   a        56 5
1239    2   B        42 1
1234    3   c        32 6
1233    4   d           3
1245    1   a        8  6
1236    2   B        5  2
1238    3   c        1  6
1239    4   d        2  
1234    1   a        15 4
1233    2   c        8  12
1245    3   d        15 6
1236    4   b         1 1
1238    1   c        14 10
1239    2   d        13 6
1234    3   c        13 17
1233    4   b        15 5
1245    1   c        18 11
1236    2   d        12 15
1238    3   c        8  12
1239    4   a       17  4

and trying to reach to this:并试图达到这一点：

SUM   a                 b               c               d           
ID    a_1   a_2 a_3 a_4 b_1 b_2 b_3 b_4 c_1 c_2 c_3 c_4 d_1 d_2 d_3 d_4
1233    0   0   0   0   0   54  0   15  0   8   0   0   0   0   0   0
1234    49  0   0   0   0   0   0   0   0   0   45  0   0   0   0   0
1236    0   0   0   0   0   5   0   1   0   0   0   0   0   12  0   12
1238    56  0   0   0   0   0   0   0   14  0   9   0   0   0   0   0
1239    0   0   0   17  0   42  0   0   0   0   0   0   0   13  0   2
1245    8   0   0   0   0   0   0   0   18  0   23  0   0   0   15  0

Basically, I want to see each product value aggregated for each month;基本上，我想查看每个月汇总的每个产品价值；

In python would be the equivalent to, just one line of code.在 python 中，只需一行代码。

data.pivot_table(index=['ID'], columns=["Product","Month"],
                            values=["Value_1","Value_2"], 
                            aggfunc=np.sum, 
                            dropna = False, fill_value=0)

I've tried this:我试过这个：

WITH pivot_table as (SELECT * FROM
(
SELECT ID, PRODUCT,Value_1,Value_2, Month  
FROM DATASET  
) New_dataset
PIVOT (
    --#2 aggregation
    SUM(Value_1) as value1 , SUM(Value_2) as value2
    --#3 Pivot_column
    FOR PRODUCT IN ('a','b','c','d') 
))
#as pivot_table
SELECT ID ,Month, SUM(value1_a) as a_1, SUM(value2_a) as a_2 FROM pivot_table
GROUP BY ID, Month
ORDER BY value_1

I've got so far to this table, which ignores the blanks, and still needs to be transposed我已经到了这个表，它忽略了空格，仍然需要转置

 ID     MONTH  value1_a
1234    1      49
1238    1      56
1239    4      17
1245    1      8

But I will still need to run for all the others and concatenate?但是我仍然需要为所有其他人运行并连接？ or do i have to write all the products?还是我必须写所有的产品？ and then transpose?然后转置？ SQL might be able to do this in one go right? SQL 可能能够在一个 go 中做到这一点，对吧？ or am I thinking to much on python way?还是我对 python 的方式有很多想法？

Answer 1

I will still need to run for all the others and concatenate?我仍然需要为所有其他人运行并连接吗？ or do i have to write all the products?还是我必须写所有的产品？ and then transpose?然后转置？ SQL might be able to do this in one go right? SQL 可能能够在一个 go 中做到这一点，对吧？

Below solution makes it下面的解决方案使它

execute immediate (             
select '''select * from (select id, lower(product) || '_' || month product_month, value_1 from `project.dataset.table`)
  pivot(sum(value_1) for product_month in ("''' ||  string_agg(product_month, '", "')  || '''"))
  order by id
'''
from (
  select product || '_' || month product_month 
  from (select distinct lower(product) product from `project.dataset.table`)
  cross join (select distinct month from `project.dataset.table`) 
  order by product_month) 
);

If applied to sample data in your question - output is如果应用于您问题中的示例数据 - output 是

SQL 中的 Pivot 表具有多列

问题描述

1 个解决方案

解决方案1
1 已采纳 2021-06-09 18:35:15

SQL 中的 Pivot 表具有多列

问题描述

1 个解决方案

解决方案1 1 已采纳 2021-06-09 18:35:15

解决方案1
1 已采纳 2021-06-09 18:35:15