Savings Plans Applicability to ML instances

0

ML instances are not specifically listed under the available instance types for Savings Plans: https://aws.amazon.com/savingsplans/pricing/ But there is nothing that explicitly excludes them.

Are ML instances (https://aws.amazon.com/sagemaker/pricing/instance-types/) covered under the compute or instance savings plans?

2 réponses
0
Réponse acceptée

Savings plans only apply to EC2 (compute and EC2 instance savings plans) and Fargate instances (compute savings plans). They do not apply to other services, such as SageMaker.

For cost optimization, SageMaker supports spot instances for training jobs, and for the hosting of real-time inference endpoints, look at:

  • right-sizing and auto-scaling the endpoint
  • Using elastic inference or inf1 instances (support for inf1 in SageMaker is on the roadmap)
  • Compiling models with Neo to optimize model performance (for rightsizing)
  • Using inference pipelines instead of multiple model endpoints
  • Hosting in multi-model endpoints
  • Hosting in self-managed compute, such as EC2 with RI, Lambda, or Fargate/spot.

Elastic inference, Neo, inference pipelines and multi-model endpoints work well in some, but not all use-cases, depending on use case, model framework, complexity, size, etc. so it will require some investigation and experimentation to figure out of they’re right for you, but they can help reduce inference cost significantly.

répondu il y a 4 ans
0

Since the accepted answer was posted 2 years ago, we've actually launched Savings Plans for SageMaker already, so for more information check out this page: https://aws.amazon.com/savingsplans/ml-pricing/

profile pictureAWS
EXPERT
répondu il y a 2 ans
profile picture
EXPERT
vérifié il y a 2 mois

Vous n'êtes pas connecté. Se connecter pour publier une réponse.

Une bonne réponse répond clairement à la question, contient des commentaires constructifs et encourage le développement professionnel de la personne qui pose la question.

Instructions pour répondre aux questions