How to delete 90% of records from each group of a table (postgres)

Question

I have a table called 'sales' in postgres which has a column called 'region'. I am trying to find out a way to delete 90% of records from each 'region' of the same table.

I am using the below query. But the same is not working in postgres and also the table does not have a primary/unique key column

delete from table
 ( select row_number() over (partition by region) as PAR 
   from sales
 )b  
where PAR >= 
 ( select S*0.1 as ninety
   from 
    ( select region, count(*) as S 
      from sales 
      group by region
    )a
and b.region = a.region

can anyone provide any better solution to this.

Answer 1

If you have an unique id in the table, you can do:

delete
    from t
    using (select t.*,
                  row_number() over (partition by region order by region) as seqnum,  -- I always include order by
                  count(*) over (partition by region) as cnt
           from t
          ) tt
    where t.id = tt.id and
          tt.seqnum < 0.9 * cnt;

How to delete 90% of records from each group of a table (postgres)

Question

1 answers

solution1
0 ACCPTED 2018-11-09 11:41:39

How to delete 90% of records from each group of a table (postgres)

Question

1 answers

solution1 0 ACCPTED 2018-11-09 11:41:39

solution1
0 ACCPTED 2018-11-09 11:41:39