SageMaker issue: cannot start kernel

0

Hello, I have a problem with SageMaker: I tried to start kernel with ml.g5.12xlarge instance, but it failed with InternalFailure error. After a couple of unsuccessful attempts I started to get another error about account-level service limit. My limit is 1, and it says I already have 1 instance running. However, I do not see any running instances in the list of running instances. So now I am stuck. Could someone from AWS technical support help me please?

InternalFailure error

Failed to start kernel Failed to launch app [pytorch-1-13-cpu-py-ml-g5-12xlarge-9b5704dda36ab08eb9a29af41aed]. ResourceLimitExceeded: The account-level service limit 'Studio KernelGateway Apps running on ml.g5.12xlarge instance' is 1 Apps, with current utilization of 1 Apps and a request delta of 1 Apps. Please contact AWS support to request an increase for this limit. (Context: RequestId: 03518865-dde3-493e-8751-b631cf7e80ea, TimeStamp: 1682699325.895717, Date: Fri Apr 28 16:28:45 2023)

irina
已提問 1 年前檢視次數 341 次
1 個回答
0

Hi - I would suggest to check on endpoint and any notebook instance types in your account.

You can then always request a quota increase using these steps

profile pictureAWS
專家
已回答 1 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南