How should I delete afi that is stuck in 'pending'

0

I'd created several AFIs that are all stuck in 'pending' from the end of Aug. the relative thread is https://repost.aws/questions/QU-CsdCyAqTxedKYkG9PuwtA/pending-create-fpga-images-using-aws-ec-2

when I've tried to delete all stuck AFIs and the commands was banned because of OperationNotPermitted. How can I delete the AFIs regardless of their states?

Thank you.

질문됨 2년 전269회 조회
6개 답변
0

Dear Chen. I'm still in trouble deleting those AFIs for 3 weeks. If those AFIs are able to be killed manually by AWS manager, I'd like to delete all AFIs.

below log is printed by aws ec2 describe-fpga-images --filter "Name=state,Values=pending"

Thank you

Steven

{
    "FpgaImages": [
        {
            "FpgaImageId": "afi-0b1e36b58c43fb122",
            "FpgaImageGlobalId": "agfi-0742d5f554e5eb457",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-31T00:29:06+00:00",
            "UpdateTime": "2022-08-31T00:29:06+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-0b1cd25dd8300762b",
            "FpgaImageGlobalId": "agfi-08c81a0873387fad7",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-30T04:38:43+00:00",
            "UpdateTime": "2022-08-30T04:38:43+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-098d0f359d2dfecbf",
            "FpgaImageGlobalId": "agfi-0cc8380b5ad01cbd0[[j",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-28T04:00:02+00:00",
            "UpdateTime": "2022-08-28T04:00:02+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-0ffea842aa12ac079",
            "FpgaImageGlobalId": "agfi-004c7b66611eee27b",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-24T23:06:33+00:00",
            "UpdateTime": "2022-08-24T23:06:33+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-07526d4edb613f3fe",
            "FpgaImageGlobalId": "agfi-02c6fd68a4bb8c681",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-25T08:09:01+00:00",
            "UpdateTime": "2022-08-25T08:09:01+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-0d1b75d427423d770",
            "FpgaImageGlobalId": "agfi-07d807d4721e4acd8",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-29T05:21:34+00:00",
            "UpdateTime": "2022-08-29T05:21:34+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-06d840b7c87a67081",
            "FpgaImageGlobalId": "agfi-02a72d5444f21c979",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-25T06:12:51+00:00",
            "UpdateTime": "2022-08-25T06:12:51+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-0f4bf08ce52eec30d",
            "FpgaImageGlobalId": "agfi-0a4f156256f299198",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-25T07:43:01+00:00",
            "UpdateTime": "2022-08-25T07:43:01+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        }
    ]
}
답변함 2년 전
  • Hi Steven,

    Thank you for providing us the information. I'll bring this information back to the development team and we'll find out what's causing the pending AFI creation and how to resolve it for you. I'll keep you posted as soon as I have the solution handy.

    Thanks,

    Chen

0

Hi Steven,

Could you please let us know which region these pending AFIs were created in?

Thanks

AWS
답변함 2년 전
  • Hi Chen. I've created those AFIs in North Virginia (us east-1). Thanks

0

Hi Steven,

Can you please check the current state of those pending AFIs? They should be unblocked and you can go ahead delete them if needed. Please let me if there is any problem there.

Thanks,

Chen

AWS
답변함 2년 전
  • Hi Chen. I was able to delete them all, so I did it. Thank you. By the way, could you please tell me the reason why those AFIs were stuck in pending for long time? In my assumption, if the file size (about 1GB) of xclbin is too big to proceed the implementation in aws server, the process is stuck. Am I right?

    Thank you. Steven

0

Hi,

Please refer to https://github.com/aws/aws-fpga/blob/master/hdk/docs/delete_fpga_image.md for the detailed instructions on deleting AFI. It sounds like the IAM policies might not be set up properly so you don't have permission to delete those AFIs. In addition, please note that shared AFIs are not allowed to be deleted. I'm not sure if this applies here, but it might be worth to double check as well. Please let me know if you still have trouble deleting AFIs.

Thanks,

Chen

AWS
답변함 2년 전
0

Hi Steven,

We don't believe the large file size of xclbin caused it, large xclbin file be fine. Please keep us posted if you see this happens to you again.

Thanks,

Chen

AWS
답변함 2년 전
0

Hi Chen.

After all, I've created new xclbin and I tried to make a new awsxclbin, and the awsxclbin's state is also 'pending' for 9 hours.

Could you please try to make a awsxclbin by using my xclbin?

I can provide my xclbin which was hacked not to be available for a run.

please contact steeven.kim@blue-dot.io if you can.

Region is north virginia
"FpgaImages": [
        {
            "FpgaImageId": "afi-0092bd7a65eb6908c",
            "FpgaImageGlobalId": "agfi-074215586b74cb0d4",
            "Name": "",
            "Description": "",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-10-06T23:49:51+00:00",
            "UpdateTime": "2022-10-06T23:49:51+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        }
    ]

Special thanks.

Steven

답변함 2년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠