在 Postgres 中使用公共列进行索引

Question

Suppose I have 4 attributes in my Postgres db a,b,c,d.假设我的 Postgres db a、b、c、d 中有 4 个属性。

I want to create an index on a and b, these are the two options I can go ahead with我想在 a 和 b 上创建索引，这是我可以继续使用的两个选项

indexing a and b separately分别索引 a 和 b
creating index as (c,a) and (c,b)创建索引为 (c,a) 和 (c,b)

Will creating index in the 2nd form optimize the performance of db?以第二种形式创建索引会优化数据库的性能吗？ And how does indexing work in that case?在这种情况下，索引如何工作？

Answer 1

An index can only be efficiently used by a query if a prefix of its columns is applied to the query's WHERE -condition.只有将其列的前缀应用于查询的WHERE条件时，查询才能有效地使用索引。 The order of the columns in the CREATE INDEX -statement matters. CREATE INDEX语句中列的顺序很重要。 Suppose we have a table假设我们有一张桌子

CREATE TABLE my_table(
   a INT,
   b INT,
   c INT,
   d INT
);

and indexes和索引

CREATE INDEX idx_a ON my_table (a);
CREATE INDEX idx_b ON my_table (b);
CREATE INDEX idx_ca ON my_table (c, a);
CREATE INDEX idx_cb ON my_table (c, b);

then query然后查询

SELECT * FROM my_table WHERE a = 42;

will only use idx_a (while all other indexes won't fit to this query),只会使用idx_a （而所有其他索引都不适合此查询），

SELECT * FROM my_table WHERE b = 42;

will only use idx_b (while all other indexes won't fit to this query),只会使用idx_b （而所有其他索引都不适合此查询），

SELECT * FROM my_table WHERE a = 42 and c = 23;

will use idx_ca ( idx_a would fit, too, but it has probably less performance),将使用idx_ca （ idx_a也适合，但性能可能较差），

SELECT * FROM my_table WHERE b = 42 and c = 23;

will use idx_cb ( idx_b would fit, too, but it has probably less performance) and将使用idx_cb （ idx_b也适合，但它的性能可能较低）和

SELECT * FROM my_table WHERE c = 23;

can eather use idx_ca or idx_cb (the optimizer will decide by statistical information).可以使用idx_ca或idx_cb （优化器将根据统计信息决定）。

So it depends on your queries.所以这取决于你的查询。 If they always include a condition on c then idx_ca will presumably have better query performance in comparison to idx_a .如果它们总是在c上包含一个条件，那么idx_ca与idx_a相比可能具有更好的查询性能。

Of course you shouldn't create indexes on suspicion that actually aren't needed.当然，您不应该创建实际上不需要的怀疑索引。 In Postgres you can use EXPLAIN ( docs ) to see how the optimizer makes use of the indexes you have defined.在Postgres中，您可以使用EXPLAIN ( docs ) 查看优化器如何使用您定义的索引。

Answer 2

As already said, it really depends ot the queries on the table.如前所述，它实际上取决于表上的查询。

Postgres can in certain situation use several index for a query. Postgres 在某些情况下可以使用多个索引进行查询。 See this very good answer: https://dba.stackexchange.com/a/91269/216879看到这个很好的答案： https ://dba.stackexchange.com/a/91269/216879

If it is possible You can create the 4 indexes, and check after some times which of your index are not used by querying the pg_stat_user_indexes table.如果可能，您可以创建 4 个索引，并在一段时间后通过查询pg_stat_user_indexes表来检查哪些索引未使用。

More litterature about index utilization from the offical documentation: https://www.postgresql.org/docs/current/indexes-bitmap-scans.html#:~:text=Fortunately%2C%20PostgreSQL%20has%20the%20ability,conditions%20across%20several%20index%20scans .更多关于官方文档中索引利用率的文献： https ://www.postgresql.org/docs/current/indexes-bitmap-scans.html#:~:text=Fortunately%2C%20PostgreSQL%20has%20the%20ability,conditions %20across%20several%20index%20scans 。

在 Postgres 中使用公共列进行索引

问题描述

2 个解决方案

解决方案1
0 2022-07-19 09:31:29

解决方案2
0 2022-07-21 19:33:33

在 Postgres 中使用公共列进行索引

问题描述

2 个解决方案

解决方案1 0 2022-07-19 09:31:29

解决方案2 0 2022-07-21 19:33:33

解决方案1
0 2022-07-19 09:31:29

解决方案2
0 2022-07-21 19:33:33