Can we host AWS JumpStart Foundation models directly on AWS Inf1 or Inf2 Instances ?

0

As the title says, we can host LLM's and Stable diffusion models from jumpstart directly on SageMaker Inf1 or Inf2 chips ?

I tried doing that with Stable Diffusion 2 Model (i.e from studio notebook of AWS JumpStart Stable Diffusion, selected instance type as one of the AWS Inf chip). The endpoint got hosted as well but later failed at invoke endpoint step.

feita há 9 meses447 visualizações
2 Respostas
0

To deploy a model on Inf1 and Inf2 instances, you need to compile the model using AWS Neuron. In this documentation page you will find the updated list of Supported models for AWS Inferentia2, AWS Inferentia, and also AWS Trainium.

If you want to deploy Stable Diffusion on AWS Inferentia2, please see this blogpost for a full walkthrough.

Hope this helps.

profile pictureAWS
jnavrro
respondido há 9 meses
  • This is not correct. You can deploy some supported models from prebuilt JumpStart images that are precompiled for Trainium and Inferentia.

0

Some of the JumpStart models support Trainium and Inferentia instances. You'll notice this in the model description. Sometimes the model will say "Neuron" in the title, such as with Llama 2 and 3.

You can search for "neuron" in the JumpStart Studio page. I'm doing this now and it shows me 17 models that support this, including Llama3.

Each of these will vary in terms of what modes they support, training, hosting, and evaluation.

Once you've selected the model and the mode, you'll be prompted to select the instance type. For Neuron models this will be only Trainium and Inferentia instances with a variety of sizes.

You can do the same search on our product documentation page here.

To work with JumpStart models in the Python SDK, including for Llama 3, check out the steps here.

Searching for neuron in the JumpStart model hub

profile picture
Emily
respondido há 5 dias

Você não está conectado. Fazer login para postar uma resposta.

Uma boa resposta responde claramente à pergunta, dá feedback construtivo e incentiva o crescimento profissional de quem perguntou.

Diretrizes para responder a perguntas