用于批量转换的 AWS SageMaker 部署

Question

我正在尝试在 Sage Maker 中使用 XGBoost 模型，并使用它来使用批量转换对存储在 S3 中的大型数据进行评分。

我使用现有的 Sagemaker Container 构建模型，如下所示：

estimator = sagemaker.estimator.Estimator(image_name=container, 
                                          hyperparameters=hyperparameters,
                                          role=sagemaker.get_execution_role(),
                                          train_instance_count=1, 
                                          train_instance_type='ml.m5.2xlarge', 
                                          train_volume_size=5, # 5 GB 
                                          output_path=output_path,
                                          train_use_spot_instances=True,
                                          train_max_run=300,
                                          train_max_wait=600)

estimator.fit({'train': s3_input_train,'validation': s3_input_test})

下面的代码是用来做Batch Transform的

 The location of the test dataset
batch_input = 's3://{}/{}/test/examples'.format(bucket, prefix)

# The location to store the results of the batch transform job
batch_output = 's3://{}/{}/batch-inference'.format(bucket, prefix)

transformer = xgb_model.transformer(instance_count=1, instance_type='ml.m4.xlarge', output_path=batch_output)

transformer.transform(data=batch_input, data_type='S3Prefix', content_type='text/csv', split_type='Line')

transformer.wait()

当模型在 Jupyter 中构建时，上述代码在开发环境（Jupyter notebook）中运行良好。 但是，我想部署模型并调用其端点进行批量转换。

SageMaker 端点创建的大多数示例用于对单个数据进行评分，而不是用于批量转换。

有人可以指出如何在 SageMaker 中部署和使用批量转换的端点吗？ 谢谢

Answer 1

以下链接有一个示例，说明如何在 SageMaker 中调用存储模型来运行批量转换作业。

批量转换参考

用于批量转换的 AWS SageMaker 部署

问题描述

1 个解决方案

解决方案1
1 已采纳 2020-10-29 19:29:17

用于批量转换的 AWS SageMaker 部署

问题描述

1 个解决方案

解决方案1 1 已采纳 2020-10-29 19:29:17

解决方案1
1 已采纳 2020-10-29 19:29:17