[BUG] Unable to retrieve attribute for AWS::OpenSearchService::Domain, with error message Internal error occurred, due to throttling

0

Posting https://github.com/aws/aws-cdk/issues/18239 here for increased visibility.

Stacks using attributes of Opensearch domains will randomly fail to deploy. The message shown in Cloudformation is similar to "Unable to retrieve Arn attribute for AWS::OpenSearchService::Domain, with error message Internal error occurred".

The actual cause as per discussion in that issue is that the DescribeDomain calls are being throttled, and Cloudformation is not handling the throttling sensibly.

This is a pernicious bug because of it's unpredictability. In my case, I've deployed my application successfully many times, and have never seen this issue before. I hit this issue whilst upgrading from Elasticsearch 7.10 to Opensearch 1.2; I was making no other changes. I'm stuck unable to rollback cleanly either, as it's not possible to go from Opensearch 1.2 to Elasticsearch 7.10. So I'm having to skip rolling back the cluster, and attempt to roll forward until either it succeeds, or I'm forced to hardcode the domain attributes I should be looking up.

According to the CDK issue, the internal tracking code for this issue is V498467686.

已提问 2 年前115 查看次数
没有答案

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则