Unanswered Questions tagged with Amazon SageMaker
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I have set up few steps in sagemaker pipeline , to process and train data. but in the event where i need a custom script for inference , i want to able to just run the model re-packaging step. is it...
0
answers
0
votes
106
views
asked a year agolg...
I have a YOLOv5-trained model. Exported as Tensorflow, I'm trying to compile and deploy to my device as a Greengrass Noe edge component.
Device configs are,
device: NVIDIA Jetson AGX...
0
answers
0
votes
92
views
asked a year agolg...
Hello,
Regardless of the instance type I can't launch a Tensorflow 2.11.0 CPU/GPU kernel on the Frankfurt datacenter, getting this error:
Starting notebook kernel...
Had error starting kernel 1...
0
answers
1
votes
154
views
asked a year agolg...
Hi all,
I set a Sagemaker Ground Truth labelling task. All the labelled objects results with "Failed" status, though it is possible retrieve the labels from S3. Eventually, nothing apart from the job...
0
answers
0
votes
142
views
asked a year agolg...
I'm trying to label a 3D point cloud using the Ground Truth 3d Point Cloud labeling tool. The image is an immersive 3d object. Therefore, in order to make the UI usable, I need to be able to rotate...
0
answers
0
votes
35
views
asked a year agolg...
NVMLError_FunctionNotFound: I was trying to deploy a PyTorch model in a ml.g4dn.xlarge instancelg...
I was trying to deploy a PyTorch model in a 'ml.g4dn.xlarge' instance for real-time inference.
framework_version='1.13.1'
py_version='py39'
However, I kept getting...
0
answers
0
votes
140
views
asked a year agolg...
Our SageMaker Studio service is broken in one of our AWS accounts in some deep way. Our original domain encountered this issue of "Update_Failed" when attempting to attach a new custom docker image....
0
answers
0
votes
112
views
asked a year agolg...
I am trying to set up a Random Cut Forest model with a Data Quality job attached.
I managed to train and deploy the model with the "data_capture" feature enabled.
``` python
# Training
rcf =...
0
answers
0
votes
189
views
asked a year agolg...
Hi MLOps Gurus,
I'd like to seek guidance on my below situation.
I am currently working on a Sagemaker project where I'm using the MLOPS template for model building, training, and deployment. I...
0
answers
0
votes
102
views
asked a year agolg...
Hi there, nice to meet you all,
I've been trying to train an Object Detection Model (using Built-in Algoritms, Tensorflow) following the [jumpstart...
0
answers
0
votes
140
views
asked a year agolg...
Hi There,
I'm in trouble with autoscaling related to Sagemaker Async Endpoint. In Particular, I have 3 cloudwatch alarms that trigger the scaling policy:
- ApproximateBacklogSizePerInstance < 4.5...
0
answers
0
votes
169
views
asked a year agolg...
Hi all,
I have a model that I've finetuned from the TF MobileResnet model available in SageMaker. The model is working well, and returns correct inference values when I host it using the SageMaker...
0
answers
0
votes
83
views
asked a year agolg...