COPY INTO：有没有办法显示在将数据加载到 Snowflake 期间跳过的记录数？

Question

I am using copy-into-table from an external location and there is an option to continue loading the data in case the row have corrupted data.我正在使用从外部位置复制到表中，并且可以选择继续加载数据，以防该行的数据损坏。 Is there an option to show how many rows were skipped while loading, like there is an option in Teradata TPT.是否有一个选项可以显示加载时跳过了多少行，就像 Teradata TPT 中有一个选项一样。

Answer 1

Assuming that you are not doing transformations in your COPY INTO command, you can leverage the VALIDATE() function after the load and get the records skipped and the reason why they were not loaded:假设您没有在 COPY INTO 命令中进行转换，您可以在加载后利用 VALIDATE() function 并跳过记录以及未加载它们的原因：

https://docs.snowflake.com/en/sql-reference/functions/validate.html https://docs.snowflake.com/en/sql-reference/functions/validate.html

Example where t1 is your table being loaded. t1 是正在加载的表的示例。 You can also specify a specific query_id if you know it:如果您知道，您还可以指定特定的 query_id：

select * from table(validate(t1, job_id => '_last'));

Answer 2

The COPY INTO outputs the following columns : COPY INTO 输出以下列：

ROWS_PARSED: Number of rows parsed from the source file
ROWS_LOADED: Number of rows loaded from the source file
ERROR_LIMIT: If the number of errors reaches this limit, then abort
ERRORS_SEEN: Number of error rows in the source file

The number of rows skipped can be calculated as ROWS_PARSED - ROWS_LOADED .跳过的行数可以计算为ROWS_PARSED - ROWS_LOADED 。 I am using pyodbc the parsing of these columns might differ the way you are scripting.我正在使用pyodbc ，这些列的解析可能与您编写脚本的方式不同。

COPY INTO：有没有办法显示在将数据加载到 Snowflake 期间跳过的记录数？

问题描述

2 个解决方案

解决方案1
1 2020-07-05 22:37:20

解决方案2
0 已采纳 2020-07-09 19:49:12

COPY INTO：有没有办法显示在将数据加载到 Snowflake 期间跳过的记录数？

问题描述

2 个解决方案

解决方案1 1 2020-07-05 22:37:20

解决方案2 0 已采纳 2020-07-09 19:49:12

解决方案1
1 2020-07-05 22:37:20

解决方案2
0 已采纳 2020-07-09 19:49:12