If the real time inference in sagemaker does not return output in 1 min , how many times does it retries? and how can we stop those retry ?

0

I hosted one large model in Sagemaker using real-time inference but it is not giving any error in case of timeout and retrying by itself after 1 min 2 times. Also how much size is considered to be too large for sagemaker models ?

sparsh
asked 7 months ago60 views
No Answers

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions