- Newest
- Most votes
- Most comments
This issue you're experiencing with RDS connections dropping and then self-recovering could be related to a few different factors. Here are some insights and troubleshooting steps to consider:
-
Network connectivity: There might have been a temporary network issue in the Ohio region affecting multiple EC2 instances and RDS databases. This could explain why all your sites went down simultaneously.
-
RDS maintenance or updates: AWS occasionally performs maintenance on RDS instances, which can cause brief interruptions. Check your RDS event logs to see if there were any scheduled maintenance activities during that time.
-
Resource constraints: If your RDS instances were experiencing high load or resource constraints, it could lead to connection drops. DevOps Guru for RDS, if enabled, can help detect performance issues like high DB load that might degrade service quality.
-
SSL/TLS certificate issues: Although less likely if it affected multiple RDS instances, expired SSL/TLS certificates can cause connection problems. Ensure your certificates are up to date.
-
Proxy issues: If you're using RDS Proxy, there could be temporary connection problems between the proxy and the DB instances.
To troubleshoot and prevent future occurrences:
-
Check AWS service health dashboard for any reported issues in the Ohio region during that time.
-
Review your RDS and EC2 CloudWatch metrics for any anomalies around the time of the incident.
-
Enable Performance Insights on your RDS instances if not already done. This will provide more detailed analysis if DevOps Guru detects any issues in the future.
-
Implement retry logic in your application to handle temporary connection drops gracefully.
-
Consider using RDS Proxy to manage database connections more efficiently and improve application resilience.
-
Regularly update your SSL/TLS certificates and ensure they're not near expiration.
If the issue persists or recurs, you may want to contact AWS support for a more in-depth investigation of your specific setup and the root cause of the connection drops.
Sources
How DevOps Guru for RDS works - Amazon DevOps Guru
Troubleshooting for RDS Proxy - Amazon Relational Database Service
Troubleshoot Amazon RDS expired SSL/TLS certificate connection issues | AWS re:Post
Troubleshooting applications on Amazon RDS - Amazon Relational Database Service
In terms of what's available from AWS Health, what date did this occur?
The Health Dashboard either the Service (public) view or your account specific events (when logged into the console) could give you an indication of service events affecting your resources. For example if EC2, RDS or another related service posted about an ongoing issue at that time.
Relevant content
- Accepted Answerasked 2 years ago
- asked 2 months ago
- AWS OFFICIALUpdated 3 months ago
- AWS OFFICIALUpdated 9 months ago
- AWS OFFICIALUpdated 24 days ago