Questions tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

1732 results
I have set up default IAM role in the step - "Admins: Grant access to AWS Glue and set a default IAM role." But in a new glue job, this IAM role is not appearing by default. What is missing? Is...
1
answers
0
votes
273
views
asked 3 months ago
Hi everyone. I did one experiment and found out in Glue if we delete a table and re-create it by crawler it has effect on glue bookmark (for ETL jobs). it is like reset bookmark. am I correct? and...
Accepted AnswerAWS Glue
1
answers
0
votes
226
views
profile picture
gh02
asked 3 months ago
I need to write a Log Insights query to find all of the job run ID's associated with a Glue job within a certain time frame and when I write what I believe is the correct query I come up with 0 of 0...
1
answers
0
votes
170
views
Ben
asked 3 months ago
Hello there! I am trying to create a kind of push_down_predicate from a sql_query using the conceptions from [https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-pushdown.html](). I...
2
answers
0
votes
222
views
asked 3 months ago
I am trying to create an aws glue rotine which consum an database table from datacatalog and an csv, in this way join this table based on two columns (on from each table). After that i added an regex...
1
answers
0
votes
157
views
asked 3 months ago
Hi everyone, I need a bash script which gives me the maximum runtime of each crawler. it seems that the 'LastRuntimeSeconds' just gives the latest run time.how can I have the max of run time for each...
1
answers
0
votes
145
views
profile picture
gh02
asked 3 months ago
I am creating an ETL pipeline over AWS Visual Glue Jobs going from a Glue Catalog => Change Schema => Flatten => S3 Bucket. However, I am getting the same error consistently during this process:...
Accepted AnswerAWS Glue
1
answers
0
votes
235
views
Ben
asked 3 months ago
I've been using an AWS Glue interactive Jupyter Notebook to write a script. This script reads JSON data from an S3 bucket, transforms the data types, and writes the output as a Parquet file back to...
1
answers
0
votes
1079
views
profile picture
asked 3 months ago
Do I still get charged if I stop the Spark Session in the AWS Glue Interactive Session Notebook? Still don't understand how AWS Glue Interactive Session Notebook charges me. On the website, it...
Accepted AnswerAWS GlueAWS Billing
1
answers
0
votes
369
views
asked 3 months ago
Hi, I am trying to test something. Consider that the key that was defined in aws glue catalog is deleted (CMK). so I don't have access the tables and DBs. I want to delete tables and DBs and re-create...
1
answers
0
votes
181
views
profile picture
gh02
asked 3 months ago
Hi From one day to another, all my Jupyter Notebooks jobs links in AWS Glue got like the screenshot attached ![Broken link](/media/postImages/original/IMieROVZoTTXeimkxmSBKotg) Any idea?
2
answers
0
votes
137
views
asked 3 months ago
I’m getting an error from my glue job (see error below). I’m using kinesis to stream data to amazon redshift, and doing some transformations with Glue using the visual ETL. As data sources, I’m using...
Accepted AnswerAWS Glue
1
answers
0
votes
185
views
asked 3 months ago