ThrottlingException on AWS Bedrock when using meta.llama3-70b-instruct-v1:0

1

Hi,

When I use meta.llama3-70b-instruct-v1:0 i consistently get throttlingExceptions. I am nowhere near the limit (in fact, every request I make gets throttled). I have checked by enabling cloudwatch logs and s3 logs, and there are no requests getting through. if I switch to any other model everything works fine.

Hristo
질문됨 한 달 전288회 조회
3개 답변
0

Hello, ThrottlingException while invoking models in on-demand mode, despite requests being below the documented quota limit can arise because the on-demand mode utilizes a shared capacity pool across multiple customers. Consequently, during periods of high demand when the base model processes a substantial number of requests, throttling may occur even if you have the necessary limits in place.

It's important to note that individual accounts can be throttled below their expected rates due to the shared capacity pool being utilized by all customers during high-demand periods. The internal team is actively working on long-term solutions to expand capacity and address this issue, but a specific timeline is currently unavailable.

To mitigate this issue, you can consider implementing retry mechanisms or exponential backoffs. However, switching to provisioned throughput might be the most effective option, as it provides reserved capacity specifically for your account. This approach ensures consistent performance by avoiding the inherent peaks and valleys of the on-demand mode.

Additionally, you could try using a different AWS region to see if that alleviates the throttling issues.

If further assistance is needed please feel free to reach out to AWS Support.

zeekg
답변함 한 달 전
0

Have had the same issue with Llama 3. Had to pull it from our production application because of this. No other models have been an issue. Happy to find this question after wasting 3 days with the support center. Thank you OP and Zeekg

답변함 한 달 전
0

only issue here is that you cannot provision llama 3.8 capacity as of now. hopefully this gets fixed one way or the other.

Hristo
답변함 한 달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인