Questions tagged with AWS Glue
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
1732 results
I have AWS Glue Crawlers that crawls databases for table's metadata in Snowflake via JDBC connection. It will crawl table's fields and types to AWS Glue but it will not bring the fields...
1) RDS oracle DB is created in account A and publicly access is enabled
2) Oracle DB can be accessed via SQL Developer using
Hostname = *<Endpoint>* of oracle DB in step 1
Port = 1521
SID = xx
3)...
Hello,
I’m seeking guidance and suggestions on cost-effective methods for scanning a couple of DynamoDB tables, each with sizes of up to 3 TB and 5 TB.
Our goal is to join these tables based on...
Hi,
I'm trying to use SFTP connector in the marketplace to connect to a SFTP server using AWS Glue. I have set up a sftp server using AWS Transfer Family, that has only username and a key file....
Under AWS Glue > Data Catalog > Connections > Create connection, a JDBC connection is created. Would like to test the connection, however, no IAM role is available for selection to proceed with the...
I am using an AWS Glue to write an ETL pipeline that gets data from an S3 bucket, processes them and writes them back to the bucket while also creating a Glue catalog table. The code I am using is the...
i'm now trying to use AWS CLI to set the jobRunQueuing param after job is created, however, below is not working :
aws glue update-job --job-name my-job --job-update '{"JobRunQueuingEnabled":...
Glue job has a new feature to use job queuing for sequencially running job run requests when max concurrency limit has reached.
This feature can be enabled by setting jobRunQueuingEnabled to true from...
I met a error from
```
Caused by: java.net.SocketTimeoutException: connect timed out
```
while script went to
```
# Script generated for node Amazon Redshift
AmazonRedshift_nodexxxxxxxxxxx =...
Hi folks,
I have a partitioned table in Athena that uses dynamic partition projection, enabled with the following table properties:
```
projection.account.type injected
projection.region.type ...
I have created a dynamodb table that stores some data, then created a glue crawler that crawls to store the metadata of this table so I can query it using Athena. I am seeing the dynamodb table got...
Would like a clear step by step install of the connector. I was able to download docker image but where does that go? First time using Glue and setting it up this way.