Skip to content

Enhance resiliency and optimize recovery through DR.PET AWS Recovery Framework

6 minute read
Content level: Intermediate
1

In this post, we'll explore how organizations can overcome the common challenge of creating and validating effective disaster recovery plans. We'll introduce AWS's entitlement for ES customers, The Dr.PET a proactive engagement model where Technical Account Managers provide expert guidance to help customers implement, validate, and continuously improve their disaster recovery strategies.

In today's digital-first world, Disaster Recovery (DR) is a mission-critical component for business continuity. AWS enables customers to build resilient architectures and implement comprehensive DR strategies through various programs. The Dr.PET framework (Diving Resilience: Planning - Execution - Testing) provides a prescriptive approach that helps customers achieve their recovery time objectives (RTO) and recovery point objectives (RPO) while optimizing costs through pay-as-you-go pricing models. Using AWS services like AWS Resilience Hub, AWS Fault Injection Service and AWS Backup, customers can automate DR workflows, maintain business continuity, and meet their compliance requirements at scale.

The Dr.PET Framework is led by Technical Account Managers (TAMs) at AWS for Enterprise Support customers, who serve as trusted advisors throughout the implementation journey. This TAM-led methodology ensures customers develop resilient, cost-effective disaster recovery solutions tailored to their specific business requirements. TAMs bring a unique combination of technical expertise and business acumen to Dr.PET implementations, serving as the bridge between AWS's capabilities and your organization's specific recovery requirements.

Dr.PET Program Overview

Dr.PET is designed exclusively for AWS Enterprise Support customers, providing these organizations with advanced resilience assessment capabilities aligned with their comprehensive support package. TAMs work closely with customers to identify their critical workloads through a structured discovery process. Customers must designate at least one business-critical workload for assessment. This focused approach allows the Dr.PET framework to deliver targeted insights where they matter most to your business operations.

Dr.PET Shows the structured approach to ensure business continuity during disruptions by focusing on five key pillars:

Dr.PET Shows the structured approach to ensure business continuity during disruptions by focusing on five key pillars

How Dr.PET Works:

The Dr.PET Assessment follows a structured three-phase methodology to help customers build resilient disaster recovery solutions on AWS.

Dr PET Resilience Improvement Journey

Dr PET Resilience Improvement Journey

Prioritize Mission-Critical Workloads

Start with what matters most. When implementing disaster recovery on AWS, many builders rush to protect everything equally. This approach increases costs and complexity without delivering proportional value. Instead:

  1. Identify truly mission-critical workloads that directly impact customer’s customer experience and business continuity
  2. Apply appropriate protection levels based on workload importance
  3. Optimize resources by aligning DR investments with business impact This customer-obsessed approach ensures you're protecting what matters most while maintaining cost efficiency. Remember, effective DR isn't about equal protection, it's about strategic protection that maximizes resilience where it counts.

What Matters?

  • Which systems directly impact revenue or customer trust? Identify workloads that customers depend on most.
  • Which applications cannot tolerate downtime? Understand where even brief outages create unacceptable friction.
  • Where would data loss be catastrophic? Recognize scenarios where recovery would be impossible or damage irreparable.

Disaster Recovery in AWS Cloud

Disaster Recovery (DR) in cloud computing, particularly with Amazon Web Services (AWS), focuses on ensuring that systems can recover quickly and reliably from various failure events, such as hardware malfunctions, natural disasters, or cyber-attacks, with minimal impact on end-users. AWS offers a wide range of services and architectural patterns that support DR planning. These include:

  • Availability Zones: Independent data centers within each region providing isolation and redundancy
  • Multi-region deployments: Architectures that distribute applications and data across multiple geographic regions for increased resilience
  • Auto Scaling Groups: Automatically adjust capacity to maintain steady, predictable performance at the lowest possible cost
  • AWS Backup: A fully managed backup service that simplifies the process of backing up data across AWS services
  • AWS Elastic Disaster Recovery: A service that enables rapid recovery of on-premises and cloud-based applications using affordable storage, minimal compute, and point-in-time recovery Effective DR starts with defining measurable objectives that align technical implementation with business requirements. Two metrics matter most:

Recovery Time Objective (RTO)

RTO defines how quickly your system must be operational after failure. "How long can this workload be unavailable before causing significant business impact?". A shorter RTO means you need faster failover and potentially higher costs (e.g., warm standby environments, automation).

Recovery Point Objective (RPO)

RPO defines how much data your business can afford to lose during a disruption. "How much recent data can we recreate or live without?" A tighter RPO means more frequent data replication or backups.

If you’re an AWS Enterprise Support customer, your TAM and Solutions Architect can help you define, validate, and test these objectives using tools like AWS Resilience Hub and AWS Well-Architected Framework

How to Initiate Dr.PET

If you're looking to elevate your cloud management strategy, connect with your Technical Account Managers (TAMs) to implement Dr.PET on AWS. This powerful framework can substantially improve your AWS environment resilience and is available at no additional cost as part of your Enterprise Support enablement. Contact your AWS account team today to schedule a Dr.PET assessment and implementation planning session.

Conclusion

AWS provides a robust suite of tools and services to implement Disaster Recovery (DR) strategies, ensuring minimal downtime, data integrity, and business continuity. Whether you're opting for a simple backup and restore approach or implementing a more sophisticated multi-region active-active solution, AWS offers flexibility, scalability, and security to meet your specific recovery needs. By leveraging the power of AWS's global infrastructure, automation capabilities, and cost-effective solutions, businesses can dramatically reduce the risk of data loss and service disruption. As you embark on your DR journey in AWS, remember that the key is to continuously assess and refine your recovery plans, ensuring that your organization is always prepared for the unexpected. Embrace the cloud and build a resilient disaster recovery strategy that empowers your business to recover quickly, stay agile, and maintain customer's trust.

Find out more

About the authors

KM

Karthikeyan KM is a Senior Technical Account Manager supporting Enterprise Users at AWS. With over 20 years of IT experience, he focuses on designing secure, reliable, and scalable solutions while ensuring operational excellence. He is passionate about helping customers accelerate their digital transformation journeys through efficient cloud architectures that align with their business objectives.

bhanu

Bhanusree (Bhanu) Vadlamudi is a Technical Account Manager (TAM) at Amazon who is passionate about building trust-based relationships with customers, understanding their technical needs. Bhanu partners closely with customers to provide technical guidance, architectural recommendations, and best practices that enable them to achieve their goals through the AWS platform. Bhanu enjoys spending time with family, going for hikes and traveling.