Questions tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

I have been trying to send data from Salesforce to Redshift using App Flow. Every time when I setup the flow I am getting an error 'Connector timed out'. I have tried both serverless and cluster. I am...
2
answers
0
votes
703
views
asked 2 months ago
I have a glue database called `edb_iris_iceberg_test`. It has an iceberg type table called `sample_data_iceberg_2`. Below is the table DDL - ``` CREATE TABLE...
1
answers
0
votes
260
views
suraj
asked 2 months ago
I have a pyspark script , where i read data from a etl table and post it to rds , sample code below . I read the data in a dataframe and use overwrite mode to update the data. this deletes the old...
1
answers
0
votes
261
views
asked 2 months ago
I have a glue job that is supposed to read from DynamoDB table of size 1.4GB, process it and write to Redshift. The job always fails with: **'An error occurred while calling o181.pyWriteDynamicFrame....
0
answers
0
votes
470
views
asked 2 months ago
Hi, I am trying to read a csv file and then write to Delta file in S3 in AWS Glue notebook. Getting error: Caused by: java.lang.ClassNotFoundException: delta.DefaultSource I am using below : from...
Accepted AnswerAWS Glue
1
answers
0
votes
248
views
asked 2 months ago
I have a AWS glue table with one partition named dt, i can add data in my s3, using Athena via this glue table and can also query it. But I am not able to query data using redshift query editor. I...
1
answers
0
votes
533
views
shivank
asked 2 months ago
I'm trying to find out if Trino on EMR supports access controls maintained in Lake Formation. My catalog is AWS Glue. I couldn't find any documentation on Lake Formation or EMR side that would talk...
1
answers
0
votes
447
views
profile picture
Saawgr
asked 2 months ago
Hi everyone, I changed the KMS key in Glue Catalog setting. So I need to delete my tables, and then re-create them by running Crawlers. it seems that deleting and recreating tables causes the bookmark...
Accepted AnswerAWS Glue
1
answers
0
votes
147
views
profile picture
gh02
asked 2 months ago
Hey guys, I created ETL jobs with 'Change schema' nodes, renamed the source keys, and dropped the keys I didn't need. When I edit these jobs, I see that the renames and the drop keys are not saved. Do...
1
answers
0
votes
165
views
asked 2 months ago
Hi everyone, I got stuck with something. if bookmark is reset it means that all data will be scanned. how can I estimate the cost for that? let's explain and can you guide me with this following...
1
answers
0
votes
183
views
profile picture
gh02
asked 2 months ago
I have set up default IAM role in the step - "Admins: Grant access to AWS Glue and set a default IAM role." But in a new glue job, this IAM role is not appearing by default. What is missing? Is...
1
answers
0
votes
237
views
asked 2 months ago
Hi everyone. I did one experiment and found out in Glue if we delete a table and re-create it by crawler it has effect on glue bookmark (for ETL jobs). it is like reset bookmark. am I correct? and...
Accepted AnswerAWS Glue
1
answers
0
votes
197
views
profile picture
gh02
asked 2 months ago