Skip to content

Designing highly resilient architectures in AWS

Language: English
Learn about Resiliency and operational excellence
4
Do you have critical workloads running in AWS? Review these handpicked resources to find ways to ensure your applications are resilient to failures.

What is resilience?

Important concepts to understand resilience and the responsibility model.

Disaster Recovery (DR) vs High Availability (HA)

What is the difference between Disaster Recovery (DR) and High Availability (HA)?

Multi-AZ vs Multi-Region

Designing an application across multiple Availability Zones (AZ) or regions

Control plane and data plane

Learn about the control plane and data plane concepts

Video: AWS re:Invent 2018: Close Loops & Opening Minds: How to Take Control of Systems, Big & Small

Come hear about the techniques we use to build stable and scalable control planes at Amazon.

Video: AWS re:Invent 2019: Beyond five 9s: Lessons from our highest available data planes

Come hear lessons from how AWS has built and architected Amazon Route 53 and the AWS authentication system, designed to survive cataclysmic failures, enormous load increases, and more.

Architecture for Compartmentalization

What is compartmentalization and do you need it?

Observability

Detection is key in ensuring your applications are resilient, but is only effective if you're monitoring the right metrics, logs, and traces — key data points in observability.

Chaos Engineering

Continuous testing is imperative in understanding how your system will react to unknowns.

Solutions

AWS Services available to help you build resilient applications.