Hello,
I have an AWS Glue job that is only supposed to perform an SQL query on the current status. Unfortunately, I always get the following error: "Error Category: QUERY_ERROR; AnalysisException: org.apache.hadoop.hive.ql.metadata.HiveException: com.amazonaws.SdkClientException: Unable to execute HTTP request: Connect to glue.eu-central-1.amazonaws.com:443 [glue.eu-central-1.amazonaws.com/10.0.0.16, glue.eu-central-1.amazonaws.com/10.0.1.167] failed: connect timed out"
Now I have read in a few articles that the VPC endpoint must be configured for this. Unfortunately, this has hardly any effect on my problem.
I have the following setup for debugging:
- 1x VPC
- 1x RDS in VPC
- 1x AWS Glue with Dataconnection in the VPC
- 1 x EC2 instance in the VPC
- 1 x VPC interface endpoint to aws glue
When I execute telnet glue.eu-central-1.awsamazon.com 443
in the ec2 instance I get the following error:
telnet: glue.eu-central-1.awsamazon.com: Name or service not known
If I run the command without an active VPC endpoint I get the same error but much later. I also see the same in the AWS glue runs. A run with the VPC endpoint runs for about 2.5 minutes a run without VPC endpoint runs for about 8.5 minutes