Questions tagged with Amazon SageMaker Model Training
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Good day!
**My main purpose:**
Easy way to collect information about different failure scenarios in SageMaker TrainingJob.
**What do I use currently?**
Sagemaker SKLearn Estimator(TrainingJobs...
4
answers
0
votes
935
views
asked 2 years agolg...
Hello!
We started using Sagemaker Jupyter Lab to run a few Depp Learning experiments we previously ran on GoogleColabPro+. The training starts fine and everything seems to work, however, the progress...
1
answers
0
votes
1091
views
asked 2 years agolg...
Hi,
I am working on Hugging Face example in Sagemaker labs, and I got the below error
"ResourceLimitExceeded: An error occurred (ResourceLimitExceeded) when calling the CreateTrainingJob operation:...
1
answers
0
votes
709
views
asked 2 years agolg...
I run neural network tensorflow train on studiolab. and I got:
```
Epoch 145/4000
1941/1941 - 10s - ... - 10s/epoch - 5ms/step
```
then I try to make a train job with script_mode with...
2
answers
0
votes
672
views
asked 2 years agolg...
Currently, we are trying to SK-Learn model from a python script running in a local computer by uploading data to S3 bucket.
```
from sagemaker.amazon.amazon_estimator import get_image_uri
# container...
0
answers
0
votes
101
views
asked 2 years agolg...
I recently tried the smddp v1.4.0 on SageMaker notebook instance (not sagemaker studio), using 8-GPU instances `ml.p3.16xlarge`, by directly using `smddp` as backend in the training scripts. I...
0
answers
0
votes
115
views
asked 2 years agolg...
I was developing a machine learning model in SageMaker and it takes about 2-3 days to finish executing my codes. However, I was automatically logged out of the console which killed the kernel that was...
1
answers
0
votes
2166
views
asked 2 years agolg...
Hello,
I am able to invoke my endpoint using the following command template:
> aws --profile ‘insert_profile_name’ sagemaker-runtime invoke-endpoint --endpoint-name 'insert_endpoint_name' --body...
1
answers
0
votes
547
views
asked 2 years agolg...
Stumbled upon this while trying to evaluate my xgboost model:
```
model = pickle.load(open("./data/xgboost-model", "rb"))
UnpicklingError: unpickling stack underflow
```
The model was trained...
1
answers
0
votes
942
views
asked 2 years agolg...
I'm creating a pipeline with multiple steps
One to preprocess a dataset and the other one takes the preprocessed one as an input to train a BlazingText model for classification
My first...
0
answers
0
votes
87
views
asked 2 years agolg...
The DeepAR documentation page mentions the option of using Parquet format to providing datasets to DeepAR.
But the details are extremely terse.
I would like to use DeepAR with a fairly large dataset,...
0
answers
0
votes
136
views
asked 2 years agolg...
I am currently utilizing an ml.c4.2xlarge instance type for a DeepAR use case to run an Automated Model Tuning job. The data consists of 7157 time series with 152 timesteps in the training set and 52...
1
answers
0
votes
573
views
asked 2 years agolg...