I ran the single instance training job using the PennyLane Lightning-GPU simulator on a ml.p3.16xlarge instance
as shown at the following URL using Amazon Braket SDK from my local machine command line:
https://github.com/aws/amazon-braket-examples/blob/main/examples/hybrid_jobs/5_Parallelize_training_for_QML/Parallelize_training_for_QML.ipynb
I get the following output:
Traceback (most recent call last):
File "/media/user1/QML/amazon-braket-examples/examples/hybrid_jobs/5_Parallelize_training_for_QML/parallelize_training.py", line 27, in <module>
job = AwsQuantumJob.create(
File "/home/user1/.conda/envs/braket/lib/python3.10/site-packages/braket/aws/aws_quantum_job.py", line 198, in create
job_arn = aws_session.create_job(**create_job_kwargs)
File "/home/user1/.conda/envs/braket/lib/python3.10/site-packages/braket/aws/aws_session.py", line 211, in create_job
response = self.braket_client.create_job(**boto3_kwargs)
File "/home/user1/.conda/envs/braket/lib/python3.10/site-packages/botocore/client.py", line 508, in _api_call
return self._make_api_call(operation_name, kwargs)
File "/home/user1/.conda/envs/braket/lib/python3.10/site-packages/botocore/client.py", line 915, in _make_api_call
raise error_class(parsed_response, operation_name)
botocore.errorfactory.AccessDeniedException: An error occurred (AccessDeniedException) when calling the CreateJob operation: This account is not authorized to use this resource. In order to access additional resources, please contact customer support
I am told by AWS support that my approved service quota is:
Service:SageMaker Notebook Instances
Region: US East (Northern Virginia)
Resource Type: Training Job Instances
Limit name: ml.p3.16xlarge
New limit value: 2
What could be the issue here regarding the AwsQuantumJob.create error? The same training job ran on a ml.m5.2xlarge instance with the lightning.qubit simulator. Just the ml.p3.16xlarge instance with the lightning.gpu simulator does not work.