SageMaker endpoint stuck in "Updating" status

0

I have a SageMaker inference endpoint that I want to delete, but I'm unable to delete it because it's been stuck in the "Updating" status for weeks at this point.

When I've tried deleting it, I get an error message that says I can't delete an endpoint that's updating: Cannot update in-progress endpoint

I've tried the following:

  • Updating the endpoint with a new endpoint configuration to get it out of the Updating status. Didn't work--was unable to complete the update.
  • Deleting the endpoint's original endpoint configuration. Didn't work--endpoint is still stuck and unable to be deleted.

Does anyone have any ideas of how this happened and what I can do to force delete the endpoint?

AWS
질문됨 한 달 전108회 조회
1개 답변
0

Hi, Here are a few suggestions that may help:

1.- Retry Deleting the Endpoint: Even though the endpoint is in the "Updating" status, you can still try to delete it. The DeleteEndpoint API operation should be able to delete the endpoint, even if it's in an unusual state. Try deleting the endpoint again and see if it's successful this time.

2.- Delete the Endpoint Configuration: If you're still unable to delete the endpoint, try deleting the associated endpoint configuration first. The DeleteEndpointConfig API operation allows you to delete the endpoint configuration, which may then allow you to delete the stuck endpoint. https://docs.aws.amazon.com/sagemaker/latest/dg/realtime-endpoints-delete-resources.html

3.- Use the AWS CLI or SDK: You can try deleting the endpoint and configuration using the AWS CLI or an AWS SDK, such as the AWS SDK for Python (Boto3). This may provide more detailed error messages that could help identify the root cause.

Delete the endpoint using the AWS CLI

aws sagemaker delete-endpoint --endpoint-name <endpoint-name>

Delete the endpoint configuration using the AWS CLI

aws sagemaker describe-endpoint-config --endpoint-config-name <endpoint-config-name>

4.- Check for Dependent Resources: Ensure that there are no other resources, such as models or other configurations, that are still dependent on the endpoint you're trying to delete. These dependencies may be preventing the deletion from completing successfully. Example command to delete model via AWS CLI aws sagemaker describe-endpoint-config --endpoint-config-name <endpoint-config-name> https://docs.aws.amazon.com/sagemaker/latest/dg/realtime-endpoints-delete-resources.html

I will recommend to try again, even point 1 and 2, if any of this steps works please contact AWS Support. They can investigate the issue further.

profile pictureAWS
Arifc
답변함 한 달 전
AWS
지원 엔지니어
검토됨 한 달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠