cannot Launch instance (kernel) with custom Sagemaker Image

1

The custom image has been used for a long time.

For some reason, when I created new instance by using custom image from sagemaker studio, I keep get "Failed to start kernel" error.

Same image could work fine in the past.

Also I noticed that if failed first, and try with other instance, it could work. Some times if you kept trying on same instance there was a chance worked as well.

I hope sagemaker team aware of this issue.

질문됨 9달 전331회 조회
4개 답변
1

I'm experiencing the same error. My image has also been used for months without any problems. It started yesterday.

Also, cloudwatch logs don't show any errors.

As a temporary fix, you can start an instance with a default kernel (data science 3.0) and then switch to your custom image.

Gonzalo
답변함 9달 전
  • Thanks, this works :)

    But for some instance type (by default, only allow 1 running app per domain), I cannot use this way to switch because after I created one by default kernel, it won't let you create second one with custom image.

    Hope Sagemaker team notice this issue and implement the fix asap.

1

Enter image description here

답변함 9달 전
0
수락된 답변

I have been worked with Sagemaker support and they implemented the changes, the instance should be created correctly with custom images.

during the issue, @Gonzalo's temp solution was working perfectly

답변함 9달 전
0

I also tested running the notebook from "notebook job". Looks like the kernel from job scheduler can start kernel without custom image without error.

답변함 9달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠