[英]How to generate statistics of a table and columns of a table in snowflake?
Is there any function available like Generate Statistics in Netezza to generate the column metadata (duplicates, unique values, min value, max value etc) in snowflake.是否有任何 function 可用,例如在 Netezza 中生成统计信息以在雪花中生成列元数据(重复项、唯一值、最小值、最大值等)。
No, not really.不,不是。
You have the TABLES
View which contains size(storage) and number of rows,您有包含大小(存储)和行数的
TABLES
视图,
but the rest of the information (including the COLUMNS
View ) is related to schema metadata and not data metadata.但是信息(包括
COLUMNS
View )的 rest 与模式元数据有关,而不是与数据元数据有关。
On the other hand the table structure itself (aka micro-partitions ) contains table metadata that makes eg MIN()
and MAX()
functions very efficient.另一方面,表结构本身(又名微分区)包含表元数据,这使得例如
MIN()
和MAX()
函数非常高效。 Some of the table statistics may be cached globally (ie in the Cloud Services part of the Snowflake architecture )某些表统计信息可能会全局缓存(即在Snowflake 架构的云服务部分)
Thank you for the question on stats gathering in Snowflake.感谢您提出有关在雪花中收集统计数据的问题。 Some information:
一些信息:
All these auto-magic features happen without user manual intervention (hence why Snowflake is known as a self-tuning, simple to use, data warehousing platform).所有这些自动魔术功能都无需用户手动干预(因此 Snowflake 被称为自我调整、易于使用的数据仓库平台)。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.