如何從 AWS Lambda 中的 s3 存儲桶讀取 csv 文件？

Question

我正在嘗試讀取上傳到 s3 存儲桶上的 csv 文件的內容。 為此，我從觸發 lambda 函數的事件中獲取存儲桶名稱和文件鍵，並逐行讀取。 這是我的代碼：

import json
import os
import boto3
import csv

def lambda_handler(event,  context):
    for record in event['Records']:
        bucket = record['s3']['bucket']['name']
        file_key = record['s3']['object']['key']
        s3 = boto3.client('s3')
        csvfile = s3.get_object(Bucket=bucket, Key=file_key)
        csvcontent = csvfile['Body'].read().split(b'\n')
        data = []
        with open(csvfile['Body'], 'r') as csv_file:
          csv_file = csv.DictReader(csv_file)
          data = list(csv_file)

我在 CloudWatch 上遇到的確切錯誤是：

[ERROR] TypeError: expected str, bytes or os.PathLike object, not list
Traceback (most recent call last):
  File "/var/task/lambda_function.py", line 19, in lambda_handler
    with open(csvcontent, 'r') as csv_file:

有人可以幫我解決這個問題嗎？ 感謝您提供的任何幫助，因為我是 lambda 的新手

Answer 1

要以正確且易於檢索的索引格式從 s3 存儲桶中獲取 CSV 文件數據，下面的代碼對我有很大幫助：

key = 'key-name'
bucket = 'bucket-name'
s3_resource = boto3.resource('s3')
s3_object = s3_resource.Object(bucket, key)

data = s3_object.get()['Body'].read().decode('utf-8').splitlines()

lines = csv.reader(data)
headers = next(lines)
print('headers: %s' %(headers))
for line in lines:
    #print complete line
    print(line)
    #print index wise
    print(line[0], line[1])

Answer 2

csvfile = s3.get_object(Bucket=bucket, Key=file_key)
csvcontent = csvfile['Body'].read().split(b'\n')

在這里，您已經檢索了文件內容並將其拆分為多行。 我不確定您為什么要再次open某些內容，您可以將csvcontent傳遞給您的閱讀器：

csv_data = csv.DictReader(csvcontent)

Answer 3

csvfile['Body']類型是StreamingBody ，因此您不能使用open xx with .

此代碼已從流中讀取所有數據。

csvcontent = csvfile['Body'].read().split(b'\n')

所以只需解析該行以獲得更有用的內容。

如何從 AWS Lambda 中的 s3 存儲桶讀取 csv 文件？

問題描述

3 個解決方案

解決方案1
11 2020-04-10 11:27:14

解決方案2
4 已采納 2019-07-02 09:31:53

解決方案3
1 2019-07-02 09:39:04

如何從 AWS Lambda 中的 s3 存儲桶讀取 csv 文件？

問題描述

3 個解決方案

解決方案1 11 2020-04-10 11:27:14

解決方案2 4 已采納 2019-07-02 09:31:53

解決方案3 1 2019-07-02 09:39:04

解決方案1
11 2020-04-10 11:27:14

解決方案2
4 已采納 2019-07-02 09:31:53

解決方案3
1 2019-07-02 09:39:04