Questions tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

I have a glue job that is supposed to read from DynamoDB table of size 1.4GB, process it and write to Redshift. The job always fails with: **'An error occurred while calling o181.pyWriteDynamicFrame....
0
answers
0
votes
470
views
asked 2 months ago
Hi, I am trying to read a csv file and then write to Delta file in S3 in AWS Glue notebook. Getting error: Caused by: java.lang.ClassNotFoundException: delta.DefaultSource I am using below : from...
Accepted AnswerAWS Glue
1
answers
0
votes
248
views
asked 2 months ago
I have a AWS glue table with one partition named dt, i can add data in my s3, using Athena via this glue table and can also query it. But I am not able to query data using redshift query editor. I...
1
answers
0
votes
533
views
shivank
asked 2 months ago
I'm trying to find out if Trino on EMR supports access controls maintained in Lake Formation. My catalog is AWS Glue. I couldn't find any documentation on Lake Formation or EMR side that would talk...
1
answers
0
votes
445
views
profile picture
Saawgr
asked 2 months ago
Hi everyone, I changed the KMS key in Glue Catalog setting. So I need to delete my tables, and then re-create them by running Crawlers. it seems that deleting and recreating tables causes the bookmark...
Accepted AnswerAWS Glue
1
answers
0
votes
146
views
profile picture
gh02
asked 2 months ago
Hey guys, I created ETL jobs with 'Change schema' nodes, renamed the source keys, and dropped the keys I didn't need. When I edit these jobs, I see that the renames and the drop keys are not saved. Do...
1
answers
0
votes
165
views
asked 2 months ago
Hi everyone, I got stuck with something. if bookmark is reset it means that all data will be scanned. how can I estimate the cost for that? let's explain and can you guide me with this following...
1
answers
0
votes
182
views
profile picture
gh02
asked 2 months ago
I have set up default IAM role in the step - "Admins: Grant access to AWS Glue and set a default IAM role." But in a new glue job, this IAM role is not appearing by default. What is missing? Is...
1
answers
0
votes
233
views
asked 2 months ago
Hi everyone. I did one experiment and found out in Glue if we delete a table and re-create it by crawler it has effect on glue bookmark (for ETL jobs). it is like reset bookmark. am I correct? and...
Accepted AnswerAWS Glue
1
answers
0
votes
194
views
profile picture
gh02
asked 2 months ago
I need to write a Log Insights query to find all of the job run ID's associated with a Glue job within a certain time frame and when I write what I believe is the correct query I come up with 0 of 0...
1
answers
0
votes
159
views
Ben
asked 2 months ago
Hello there! I am trying to create a kind of push_down_predicate from a sql_query using the conceptions from [https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-pushdown.html](). I...
2
answers
0
votes
183
views
asked 2 months ago
I am trying to create an aws glue rotine which consum an database table from datacatalog and an csv, in this way join this table based on two columns (on from each table). After that i added an regex...
1
answers
0
votes
141
views
asked 2 months ago