- Newest
- Most votes
- Most comments
When experiencing "task blocked for more than 122 seconds" errors after upgrading EKS nodes from Amazon Linux 2 (AL2) to Amazon Linux 2023 (AL2023) with EKS 1.31, there are several important differences between AL2 and AL2023 that could be causing this issue.
AL2023 uses a new node initialization process called nodeadm that follows a YAML configuration schema, which is different from the process used in AL2. This change in initialization could affect how your Java application interacts with the underlying system.
Additionally, AL2023 requires Instance Metadata Service Version 2 (IMDSv2) and sets the default hop count for managed node groups to 1, which restricts containers from accessing node metadata. If your Java application relies on accessing node metadata or credentials, this could be causing the blocking issue.
For troubleshooting this specific issue:
-
Check if your application needs access to node metadata or credentials. If so, you may need to increase the HttpPutResponseHopLimit to 2 in your EC2 launch template or use Amazon EKS Pod Identity for credential access.
-
Ensure you're using Amazon VPC CNI add-on version 1.16.2 or later, which is required for AL2023.
-
If you're using any third-party add-ons that interact with node metadata, they might be affected by the hop count limitation.
-
Review your Java application's resource usage and threading model, as there might be differences in how AL2023 handles resource allocation compared to AL2.
While AL2 is approaching end of life, these configuration adjustments should help resolve the blocking task issues on AL2023.
Sources
FAQs on Amazon Linux 2023 with Amazon EKS | AWS re:Post
AL2023.7 upgrades from AL2 - Amazon Linux 2023
Relevant content
- asked 4 months ago
- asked 2 years ago
- asked 7 months ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 3 months ago
