简体   繁体   English

如何在 sagemaker 上自动缩放 SKLearn 作业

[英]How to autoscale a SKLearn job on sagemaker

I want to launch a SKLearn job using sagemaker .我想使用sagemaker启动 SKLearn 作业。 The way I do this is as follows:我这样做的方式如下:

from sagemaker.sklearn.estimator import SKLearn

FRAMEWORK_VERSION = '0.23-1' 
script_path = 'main.py'

sklearn = SKLearn(
    entry_point=os.path.join(script_path),
    framework_version=FRAMEWORK_VERSION,
    instance_type='ml.m5.2xlarge',
    source_dir='src',
    output_path='my/output/path',
)

I am not sure if the instance_type that I have chosen is enough (in terms of memory etc) for my application though.我不确定我选择的 instance_type 是否足够(就 memory 等而言)我的应用程序。

Is there a way to "let sagemaker" decide on the instance type?有没有办法“让 sagemaker”决定实例类型?

Or, is there a way to choose an instance_type and if along the way it is about to run out of memory, the sagemaker to automatically scale up?或者,有没有办法选择一个instance_type,如果沿途快要用完memory,sagemaker会自动扩容?

Automatic scale-up feature for Training doesn't exist in SageMaker at this time.目前,SageMaker 中不存在用于训练的自动扩展功能。

On a separate note, for selecting the right instance type for inference, we have an instance recommender service ( https://docs.aws.amazon.com/sagemaker/latest/dg/inference-recommender.html ).另外,为了选择正确的推理实例类型,我们有一个实例推荐服务 ( https://docs.aws.amazon.com/sagemaker/latest/dg/inference-recommender.html )。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 在 Sagemaker 中延迟执行 Sagemaker.sklearn.processing.SKLearnProcessor.run 作业 - Delay in executing Sagemaker.sklearn.processing.SKLearnProcessor.run job, in Sagemaker 如何为 SageMaker 训练作业准备 docker 图像 - How to prepare docker image for SageMaker training job 如何运行 sagemaker 处理和训练工作? - How to run sagemaker processing and training job? 如何在 Sagemaker 脚本模式下恢复训练作业? - How can I resume a training job in Sagemaker script mode? SageMaker:如何手动运行 Model 质量监控作业? - SageMaker: How to manually run a Model Quality Monitor Job? 如何根据预定义的计划运行 AWS Sagemaker Studio 作业 - How to run AWS Sagemaker Studio job based on pre defined schedule 如何将 stepfunction executionId 解析为 SageMaker 批量转换作业名称? - How to parse stepfunction executionId to SageMaker batch transform job name? 如何在 AWS sagemaker 上部署预训练的 sklearn model? (端点停留在创建) - How do I deploy a pre trained sklearn model on AWS sagemaker? (Endpoint stuck on creating) 如何通过自定义推理代码在 sagemaker 管道中运行批量转换作业? - how to run a batch transform job in sagemaker pipeline via custom inference code? sagemaker 批处理作业的最佳设置是什么? - what is a optimal setting for a sagemaker batch job?
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM