简体繁体中英

AWS lambda function and Athena to create partitioned table

原文 2020-08-03 14:16:29 4 1 amazon-web-services/ amazon-s3/ aws-lambda/ parquet

Here's my requirements. Every day i'm receiving a CSV file into an S3 bucket. I need to partition that data and store it into Parquet to eventually map a Table. I was thinking about using AWS lambda function that is triggered whenever a file is uploaded. I'm not sure what are the steps to do that.

1 answers

There are (as usual in AWS,) several ways to do this: the 2 first ones that come to me first are:

using a Cloudwatch Event, with an S3 PutObject Object level) action as trigger, and a lambda function that you have already created as a target.
starting from the Lambda function it is slightly easier to add suffix-filtered triggers, eg for any .csv file, by going to the function configuration in the Console, and in the Designer section adding a trigger, then choose S3 and the actions you want to use, eg bucket, event type, prefix, suffix.

In both cases, you will need to write the lambda function in either case to do the work you have described, and it will need IAM access to the bucket to pull the files and process them.

How to query on AWS Athena partitioned table

Update partitioned table schema on AWS Glue/Athena

create bucket table in AWS Athena

AWS Athena create table and partition

Run AWS Athena’s queries with Lambda function

Lambda function to query AWS Athena gives timeout

AWS Lambda function fails while query Athena

aws athena - Create table by an array of json object

Create AWS Athena table from json event

Querying an Athena table partitioned by year, month, day

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to query on AWS Athena partitioned table Update partitioned table schema on AWS Glue/Athena create bucket table in AWS Athena AWS Athena create table and partition Run AWS Athena’s queries with Lambda function Lambda function to query AWS Athena gives timeout AWS Lambda function fails while query Athena aws athena - Create table by an array of json object Create AWS Athena table from json event Querying an Athena table partitioned by year, month, day

Related Tags

AWS lambda function and Athena to create partitioned table

Question

1 answers

solution1 0 ACCPTED 2020-08-03 14:47:39

solution1
0 ACCPTED 2020-08-03 14:47:39