Help troubleshooting ECS Cluster update; appears to fail without error msg

0

I've been running a tast (webserver) on ECS for quite a while, through 585 revisions. Yesterday, I followed my usual procedure to update the cluster: a.) build a new container, b.) push to ECR, c.) create new revision in ECS console, d.) update cluster to new revision 586, e.) watch for new log stream to appear in CloudWatch to verify startup and check for errors. However, in this case no log stream appeared. Backing up to the ECS console, the task running the new revision had disappeared. I rebuilt and tested the container locally, re-pushed a new version to ECR, created a new task revision, and updated again, with the same result. The new task simply disappears from the Cluster->Tasks tab in the ECR console. No log stream is created for the task. The Cluster->Tasks->Logs tab only shows output from the last running task, Revision 585. The Cluster->Service->Events log shows the 'Start Task' event but no errors or terminations. No ECS Fargate instance is created, so nothing to exec into. How can I find out what's broken?

1 Answer
1

Look at stopped tasks. Usually it will indicate issues here at the top or reason why it failed.

profile picture
EXPERT
answered 7 months ago
  • Thanks Gary Mclean! Found the error message as you said via ECS-> Cluster-> Tasks with the filter for Stopped tasks. In this case it shows CannotPullContainerError: tcp: lookup 445523203977.dkr.ecr.us-west-2.amazonaws.co25m on 10.0.0.2:53: no such host. The Task indicates an ENI: eni-0976fd0bd24381f1d. Looking at that raises an error that this ENI does not exist. Not sure what ENI let a Fargate Task pull a container from my ECR registry. Not sure if I should try to fix this or recreate a new Task Definition.

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions