How should I delete afi that is stuck in 'pending'

0

I'd created several AFIs that are all stuck in 'pending' from the end of Aug. the relative thread is https://repost.aws/questions/QU-CsdCyAqTxedKYkG9PuwtA/pending-create-fpga-images-using-aws-ec-2

when I've tried to delete all stuck AFIs and the commands was banned because of OperationNotPermitted. How can I delete the AFIs regardless of their states?

Thank you.

asked 2 years ago264 views
6 Answers
0

Dear Chen. I'm still in trouble deleting those AFIs for 3 weeks. If those AFIs are able to be killed manually by AWS manager, I'd like to delete all AFIs.

below log is printed by aws ec2 describe-fpga-images --filter "Name=state,Values=pending"

Thank you

Steven

{
    "FpgaImages": [
        {
            "FpgaImageId": "afi-0b1e36b58c43fb122",
            "FpgaImageGlobalId": "agfi-0742d5f554e5eb457",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-31T00:29:06+00:00",
            "UpdateTime": "2022-08-31T00:29:06+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-0b1cd25dd8300762b",
            "FpgaImageGlobalId": "agfi-08c81a0873387fad7",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-30T04:38:43+00:00",
            "UpdateTime": "2022-08-30T04:38:43+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-098d0f359d2dfecbf",
            "FpgaImageGlobalId": "agfi-0cc8380b5ad01cbd0[[j",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-28T04:00:02+00:00",
            "UpdateTime": "2022-08-28T04:00:02+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-0ffea842aa12ac079",
            "FpgaImageGlobalId": "agfi-004c7b66611eee27b",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-24T23:06:33+00:00",
            "UpdateTime": "2022-08-24T23:06:33+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-07526d4edb613f3fe",
            "FpgaImageGlobalId": "agfi-02c6fd68a4bb8c681",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-25T08:09:01+00:00",
            "UpdateTime": "2022-08-25T08:09:01+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-0d1b75d427423d770",
            "FpgaImageGlobalId": "agfi-07d807d4721e4acd8",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-29T05:21:34+00:00",
            "UpdateTime": "2022-08-29T05:21:34+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-06d840b7c87a67081",
            "FpgaImageGlobalId": "agfi-02a72d5444f21c979",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-25T06:12:51+00:00",
            "UpdateTime": "2022-08-25T06:12:51+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        },
        {
            "FpgaImageId": "afi-0f4bf08ce52eec30d",
            "FpgaImageGlobalId": "agfi-0a4f156256f299198",
            "Name": "Modified--Confidential",
            "Description": "Modified--Confidential",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-08-25T07:43:01+00:00",
            "UpdateTime": "2022-08-25T07:43:01+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        }
    ]
}
answered 2 years ago
  • Hi Steven,

    Thank you for providing us the information. I'll bring this information back to the development team and we'll find out what's causing the pending AFI creation and how to resolve it for you. I'll keep you posted as soon as I have the solution handy.

    Thanks,

    Chen

0

Hi Steven,

Could you please let us know which region these pending AFIs were created in?

Thanks

AWS
answered 2 years ago
  • Hi Chen. I've created those AFIs in North Virginia (us east-1). Thanks

0

Hi Steven,

Can you please check the current state of those pending AFIs? They should be unblocked and you can go ahead delete them if needed. Please let me if there is any problem there.

Thanks,

Chen

AWS
answered 2 years ago
  • Hi Chen. I was able to delete them all, so I did it. Thank you. By the way, could you please tell me the reason why those AFIs were stuck in pending for long time? In my assumption, if the file size (about 1GB) of xclbin is too big to proceed the implementation in aws server, the process is stuck. Am I right?

    Thank you. Steven

0

Hi,

Please refer to https://github.com/aws/aws-fpga/blob/master/hdk/docs/delete_fpga_image.md for the detailed instructions on deleting AFI. It sounds like the IAM policies might not be set up properly so you don't have permission to delete those AFIs. In addition, please note that shared AFIs are not allowed to be deleted. I'm not sure if this applies here, but it might be worth to double check as well. Please let me know if you still have trouble deleting AFIs.

Thanks,

Chen

AWS
answered 2 years ago
0

Hi Steven,

We don't believe the large file size of xclbin caused it, large xclbin file be fine. Please keep us posted if you see this happens to you again.

Thanks,

Chen

AWS
answered 2 years ago
0

Hi Chen.

After all, I've created new xclbin and I tried to make a new awsxclbin, and the awsxclbin's state is also 'pending' for 9 hours.

Could you please try to make a awsxclbin by using my xclbin?

I can provide my xclbin which was hacked not to be available for a run.

please contact steeven.kim@blue-dot.io if you can.

Region is north virginia
"FpgaImages": [
        {
            "FpgaImageId": "afi-0092bd7a65eb6908c",
            "FpgaImageGlobalId": "agfi-074215586b74cb0d4",
            "Name": "",
            "Description": "",
            "State": {
                "Code": "pending"
            },
            "CreateTime": "2022-10-06T23:49:51+00:00",
            "UpdateTime": "2022-10-06T23:49:51+00:00",
            "OwnerId": "034954464192",
            "Tags": [],
            "Public": false,
            "DataRetentionSupport": false
        }
    ]

Special thanks.

Steven

answered 2 years ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions