Assessing the Stability and Recovery from Potential Throttling of High Throughput Queries on DynamoDB with On-Demand Mode

Question

Hello,

I am currently working on an application where we are leveraging AWS Lambda in conjunction with DynamoDB for real-time data rendering purposes. Specifically, we are utilizing the table.query feature on DynamoDB, set in on-demand mode, while extensively using Global Secondary Indexes (GSI) for our query filtering.

Our system occasionally experiences high throughput situations, and there is a concern regarding the stability of this feature under such stress, especially in terms of maintaining performance consistency. We aim to understand better the behavior of DynamoDB under frequent, high-volume query operations and any potential throttling that could occur despite opting for on-demand capacity, as indicated by certain sources [1].

The critical points we are seeking insight into include:

Throttling Recovery Time: If throttling conditions are encountered, what is the typical recovery time, and how does DynamoDB manage sudden spikes in requests in on-demand mode? Understanding this is vital as it directly correlates with our real-time data rendering requirements.

Stability and Performance: Can DynamoDB sustain occasional high-throughput loads, specifically using the table.query function with GSI? Is the performance hit noticeable, and could it disrupt real-time operations?

Best Practices for Query Optimization: In the context of ensuring minimal latency and maintaining high availability and performance, would using table.scan be a more efficient approach compared to table.query in scenarios of high request volume? Are there recommended strategies for optimizing query patterns, especially when dealing with GSI and high-throughput situations?

Given the real-time nature of our application, the emphasis is on consistent, high-performance read operations. Any suggestions or insights into the architecture design, capacity planning (even in an on-demand setup), or practical experiences you could share would be immensely valuable.

Thank you for your time and assistance.

Best regards,

Ben

[1] "Why are on-demand tables getting throttled in Amazon DynamoDB?" Knowledge Center, AWS.

Answer

> Throttling Recovery Time: If throttling conditions are encountered, what is the typical recovery time, and how does DynamoDB manage sudden spikes in requests in on-demand mode? Understanding this is vital as it directly correlates with our real-time data rendering requirements.

On-demand lets you scale to twice your previous peak instantly. If you require more than twice your previous peak in short duration, you may be throttled. Recovery time can vary depending on your data model and throughput, but DynamoDB will typically take action to scale within minutes.

> Stability and Performance: Can DynamoDB sustain occasional high-throughput loads, specifically using the table.query function with GSI? Is the performance hit noticeable, and could it disrupt real-time operations?

Read and Writes consume separate capacity, so high read requests will not impact any real time operations. Furthermore, as DynamoDB is horizontally scalable, your performance should remain the same under high load conditions.

> Best Practices for Query Optimization: In the context of ensuring minimal latency and maintaining high availability and performance, would using table.scan be a more efficient approach compared to table.query in scenarios of high request volume? Are there recommended strategies for optimizing query patterns, especially when dealing with GSI and high-throughput situations?

Query is a much more efficient request than Scan, as Query targets items based on the partition key, whereas Scan reads all items in the table/index. Optimizations should start with a correct data model, which uses a high cardinality partition key which has well distributed traffic.

> Given the real-time nature of our application, the emphasis is on consistent, high-performance read operations. Any suggestions or insights into the architecture design, capacity planning (even in an on-demand setup), or practical experiences you could share would be immensely valuable.

Ensure you understand how to model correctly on DynamoDB, as that is the most important aspect of getting the performance you require, you can start with our data modelling course:

https://explore.skillbuilder.aws/learn/course/external/view/elearning/17754/amazon-dynamodb-data-modeling-techniques

If you have really high throughput out of the box, you may consider pre-warming your table/indexes so you set a high previous peak which helps you avoid throttling during unprecedented high load;

https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/HowItWorks.ReadWriteCapacityMode.html#HowItWorks.PreWarming

Assessing the Stability and Recovery from Potential Throttling of High Throughput Queries on DynamoDB with On-Demand Mode

Relevant content