How to use Neuron SDK

0

I am new to AWS Neuron SDK and the documentation seems confusing to me. There is no direct guide on how to install the SDK and use it to compile models. The examples are outdated and the installation process is not clear.

I want to compile my models for Inferential on a CICD pipeline and deploy it. The instructions all suggest running an inf instance for compilation. I followed this to install the Inferentia on my local environment. Here is what I have done:

python -m pip config set global.extra-index-url https://pip.repos.neuron.amazonaws.com
python -m pip install tensorflow-neuron[cc] "protobuf"
python -m pip install tensorboard-plugin-neuron

Then I looked at this example. The simpale_save has been deprecated. I could not fix it by sing the save method either.

Other examples have like this is hard to follow. I did not understand why the compilation process is so complicated in this example.

Is there any other documentations?

  • Hi Mosi. It would help to better understand what you are trying to accomplish. You mentioned CI/CD – are you trying to continuously develop a model over time and automate its deployment for testing or automate the deployment of an existing model based on incoming inference requests, or something else?

    I suggest dividing the work into two steps. First, manually getting a Neuron environment and instance properly set up such that you can successfully run the sample apps. The existing Neuron documentation should be helpful as the majority of the Neuron documentation relates to building a specific model, and then deploying the resulting artifact (NEFF file) under a long-lived EC2 instance to handle inference traffic.

    Second, tackle the CI/CD steps needed to automate the instance setup and model deployment. The Neuron documentation currently doesn’t prescribe model deployment so this would require some design and docs beyond what Neuron offers today. Some customer find that the automation offered by Sagemaker helps with this process as Sagemaker supports inf1 instances.

Mosi
질문됨 5달 전289회 조회
1개 답변
0

It looks like you tried to run the inf2 installation with inf1 example code. These run different versions of the tensorflow integration. Please take a look at the inf2 tutorials e.g. https://awsdocs-neuron.readthedocs-hosted.com/en/latest/src/examples/tensorflow/tensorflow-neuronx/tfneuronx-roberta-base-tutorial.html with inf2.

The correct installation procedures are listed for inf1, linked from the tutorial.

AWS
답변함 5달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠