Questions tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

I need to write a Log Insights query to find all of the job run ID's associated with a Glue job within a certain time frame and when I write what I believe is the correct query I come up with 0 of 0...
1
answers
0
votes
161
views
Ben
asked 2 months ago
Hello there! I am trying to create a kind of push_down_predicate from a sql_query using the conceptions from [https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-pushdown.html](). I...
2
answers
0
votes
188
views
asked 2 months ago
I am trying to create an aws glue rotine which consum an database table from datacatalog and an csv, in this way join this table based on two columns (on from each table). After that i added an regex...
1
answers
0
votes
144
views
asked 2 months ago
Hi everyone, I need a bash script which gives me the maximum runtime of each crawler. it seems that the 'LastRuntimeSeconds' just gives the latest run time.how can I have the max of run time for each...
1
answers
0
votes
133
views
profile picture
gh02
asked 2 months ago
I am creating an ETL pipeline over AWS Visual Glue Jobs going from a Glue Catalog => Change Schema => Flatten => S3 Bucket. However, I am getting the same error consistently during this process:...
Accepted AnswerAWS Glue
1
answers
0
votes
214
views
Ben
asked 2 months ago
I've been using an AWS Glue interactive Jupyter Notebook to write a script. This script reads JSON data from an S3 bucket, transforms the data types, and writes the output as a Parquet file back to...
1
answers
0
votes
902
views
profile picture
asked 2 months ago
Do I still get charged if I stop the Spark Session in the AWS Glue Interactive Session Notebook? Still don't understand how AWS Glue Interactive Session Notebook charges me. On the website, it...
Accepted AnswerAWS GlueAWS Billing
1
answers
0
votes
359
views
asked 2 months ago
Hi, I am trying to test something. Consider that the key that was defined in aws glue catalog is deleted (CMK). so I don't have access the tables and DBs. I want to delete tables and DBs and re-create...
1
answers
0
votes
171
views
profile picture
gh02
asked 2 months ago
Hi From one day to another, all my Jupyter Notebooks jobs links in AWS Glue got like the screenshot attached ![Broken link](/media/postImages/original/IMieROVZoTTXeimkxmSBKotg) Any idea?
2
answers
0
votes
126
views
asked 2 months ago
I’m getting an error from my glue job (see error below). I’m using kinesis to stream data to amazon redshift, and doing some transformations with Glue using the visual ETL. As data sources, I’m using...
Accepted AnswerAWS Glue
1
answers
0
votes
159
views
asked 3 months ago
Consider CloudFront Logs injected by AWS are stored in S3 with pattern: `s3://aws-cloudfront-log-[AWS Account ID]/[Any prefix I preferred]/E[CloudFront Distribution...
1
answers
0
votes
295
views
Tom Lo
asked 3 months ago
following sample/example provided by aws here -...
1
answers
0
votes
147
views
asked 3 months ago