Amazon Athena：如何在查询跳过列标题后存储结果？

Question

I ran a simple query using Athena dashboard on data of format csv.The result was a csv with column headers. 我在格式csv的数据上使用Athena仪表板运行了一个简单的查询。结果是带有列标题的csv。 When storing the results,Athena stores with the column headers in s3.How can i skip storing header column names,as i have to make new table from the results and it is repetitive 存储结果时，Athena在s3中存储列标题。如何跳过存储标题列名称，因为我必须从结果中创建新表并且它是重复的

Answer 1

From an Eric Hammond post on AWS Forums : 来自AWS论坛上的Eric Hammond帖子：

...
  WHERE
    date NOT LIKE '#%'
...

I found this works! 我发现这个有效！ The steps I took: 我采取的步骤：

Run an Athena query, with the output going to Amazon S3 运行Athena查询，输出将转至Amazon S3
Created a new table pointing to this output based on How do I use the results of my Amazon Athena query in another query? 根据我如何在另一个查询中使用我的Amazon Athena查询结果创建了一个指向此输出的新表？ , changing the path to the correct S3 location ，将路径更改为正确的S3位置
Ran a query on the new table with the above WHERE <datefield> NOT LIKE '#%' 使用上面的WHERE <datefield> NOT LIKE '#%'对新表进行查询

However, subsequent queries store even more data in that S3 directory, so it confuses any subsequent executions. 但是，后续查询会在该S3目录中存储更多数据，因此会混淆任何后续执行。

Answer 2

Try "skip.header.line.count"="1", This feature has been available on AWS Athena since 2018-01-19, here's a sample: 尝试“skip.header.line.count”=“1”，此功能自2018-01-19以来已在AWS Athena上提供，以下是一个示例：

CREATE EXTERNAL TABLE IF NOT EXISTS tableName (
  `field1` string,
  `field2` string,
  `field3` string 
)
 ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.OpenCSVSerde'
 WITH SERDEPROPERTIES (
   'separatorChar' = ',',
   'quoteChar' = '\"',
   'escapeChar' = '\\'
   )
LOCATION 's3://fileLocation/'
TBLPROPERTIES ('skip.header.line.count'='1')

You can refer to this question: Aws Athena - Create external table skipping first row 你可以参考这个问题： Aws Athena - 创建跳过第一行的外部表

Amazon Athena：如何在查询跳过列标题后存储结果？

问题描述

2 个解决方案

解决方案1
1 2017-07-13 14:04:45

解决方案2
0 2018-03-23 07:42:13

Amazon Athena：如何在查询跳过列标题后存储结果？

问题描述

2 个解决方案

解决方案1 1 2017-07-13 14:04:45

解决方案2 0 2018-03-23 07:42:13

解决方案1
1 2017-07-13 14:04:45

解决方案2
0 2018-03-23 07:42:13