使用 AWS re:Post 即表示您同意 AWS re:Post 使用條款

Unable to load trained model in Sagemaker

0

I have trained a few models in sagemaker however I am unable to load them for prediction.

I am picking model details from: Sagemaker > Inference > Models > Container 1 section: Image_uri = value in image model_data = Value in model data location

then passing these values into sagemaker Model function.

When I deploy this model, it gives error: ping health check failed for AllTraffic production variant. This error doesn't come when I train a new model and deploy it.

已提問 2 年前檢視次數 641 次
1 個回答
0

The cause for issues like this are due to a mismatch between the base model between the training and inference endpoints. A solution similar to below would help resolve your issue.

Github repo : https://github.com/marshmellow77/sm-extend-container/blob/main/02_extend_container.ipynb. talks about how to extend the existing Hugging Face DLCs by pulling them from the public ECR and running a simple Dockerfile on top of them that will install the latest available version of transformers.

AWS
已回答 2 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南