Questions tagged with Amazon SageMaker Deployment

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

Sagemaker Deployment Issues: TypeError: Not A String

Trying to get codegen25-7b-multi to launch on Sagemaker and hitting issues trying to launch on 2xlarge, 8xlarge and 12xlarge instances. All are throwing the same errors: Error #1...

Amazon SageMaker Amazon SageMaker Deployment

answers

votes

260

views

texnoob

asked 9 months ago

CUDA out of memory - Starcoder

Running into issues in getting Starcoder to deploy on Sagemaker. I'm getting the following errors in CloudWatch and even with the instance type: ml.g5.8xlarge Error 1: ``` Error: ShardCannotStart ...

Accepted AnswerAmazon SageMaker Amazon SageMaker Deployment

answers

votes

367

views

texnoob

asked 10 months ago

Deploy LLM serverlessly on Sagemaker

Is it possible (and efficient) to deploy an LLM model serverlessly using Sagemaker? I'm concerned about the performance and costs involved? The ML application doesn't receive a lot of requests.

Amazon SageMaker Amazon SageMaker Deployment

answers

votes

2020

views

Xuan Nguyen

asked 10 months ago

Deploy HuggingFace pretained model (multiple bin files) to AWS Sagemaker

I have been trying to deploy this HuggingFace model ( https://huggingface.co/bigcode/starcoderplus/tree/main ) to AWS Sagemaker but failed. The error message from Cloudwatch is > "No safetensors...

Amazon SageMaker Amazon SageMaker Deployment

answers

votes

934

views

Jeremy

asked 10 months ago

Issue with AWS Signature Calculation

I am encountering an issue while trying to calculate the AWS Signature for my requests. I have been following the AWS documentation and various examples, but I keep getting the following error: "The...

Amazon SageMaker Amazon SageMaker Deployment

answers

votes

780

views

rePost-User-7403195

asked a year ago

11B parameters support GPU

What type of instances can support models with 11B parameters ? I need to use this model for inference jobs.

Amazon SageMaker Deployment

answers

votes

160

views

Sandeep Agarwal

asked a year ago

hyperparameters evaluation

I need some clarification related to hyperparameters : a) Ways to evaluate best hyperparameters result and how those are linked with model b) Ways to version control parameters for training jobs

Accepted AnswerAmazon SageMaker Deployment

answers

votes

186

views

Sandeep Agarwal

asked a year ago

In SageMaker, how to get the value from Http Header?

I have a machine learning classification model that was trained outside of SageMaker. The model is in Scikit-learn format. To run this model, the preprocessing step requires the binary content of a...

Amazon SageMaker Deployment

answers

votes

426

views

Ken

asked a year ago

Sagemaker Autoscaling Delay

I am setting up autoscaling for a realtime inference endpoint in sagemaker. I set up a load test using locust, and by setting relatively high numbers (i.e: 100 users, with 10 user spawned per seconds)...

Accepted AnswerAWS Auto Scaling Amazon SageMaker Deployment

answers

votes

705

views

seicaratteri

asked a year ago

Deploy ML Timeseries models effectively

Hi team ! I need to deploy a ton of Machine Learning Models (Timeseries models) and I'm seeking a way that is effective. In details, the problem is to build a platform capable of serving many time...

Accepted AnswerAmazon SageMaker Machine Learning & AI Amazon SageMaker Deployment

answers

votes

607

views

Quan Dang

asked a year ago

Not able to create an endpoint

I am able to train and tune the model. But at the time of model deployment, the endpoint is not getting created, and it fails after some time. It gives the error as "FileNotFoundError: [Errno 2] No...

Amazon SageMaker Amazon SageMaker Studio Lab Amazon SageMaker Deployment

answers

votes

235

views

rePost-User-1788057

asked a year ago

async inference docker restart after less than 20 minutes, not helpful log found

i have a async inference on SageMaker, with BYOC. The job may take about 20 minutes and more. And i already set InvocationTimeoutSeconds to 3600 seconds. the problem is, when i start a new...

Accepted AnswerAmazon SageMaker Amazon SageMaker Deployment

answers

votes

262

views

rePost-User-9519218

asked a year ago

1
2
3
4
5
•••
8
12 / page