Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Hi,
I've created Glue Crawler to determine data sctructure from XML file uploaded to s3, and write Table into Data Catalog.
I tried 2 approaches:
1. Use Glue default classifier - this is preferred...
0
answers
0
votes
18
views
asked 2 hours agolg...
Hello.
I am trying to configure specific iam permission for an user. I need a permission for only read tables from existing Data Catalog.
So, I have configured this policiy:
```
{
"Version":...
0
answers
0
votes
65
views
asked 10 hours agolg...
I've successfully set up AWS Glue with an RDS database serving as the data source and a Snowflake database as the data target. In this setup, I've configured AWS Glue crawlers to catalog the metadata...
0
answers
0
votes
38
views
asked 20 hours agolg...
I am running a glue job with python script shell(version 3.9) and glue version is 3.0. I am passing 8 arguments to the glue job and accessing it using getResolvedOptions(args, options). One of the...
2
answers
0
votes
89
views
asked 3 days agolg...
Hi,
I have created a Glue job to trigger it for S3 event. So I have below design
S3 Bucket ---> SQS ---> Lambda ---> Trigger Glue job from Lambda.
I am facing below error when multiple files are...
2
answers
0
votes
48
views
asked 4 days agolg...
Hello,
I work as a data engineer and business intelligence specialist for a fintech startup. We've entered into a new agreement with a supplier to provide a technological solution for managing their...
0
answers
0
votes
147
views
asked 4 days agolg...
Hello, I am relatively new to Glue and encountering some challenges with Glue ETL.
Our setup involves a datalake that retrieves data from a backend database as its source. This datalake is...
1
answers
0
votes
98
views
asked 6 days agolg...
Hello,
I have parquets files in S3 that i parse using Glue Crawler and query in Athena. I found that some files have two columns "x" and "y" that have a type **int64** while other files have them as...
1
answers
0
votes
109
views
asked 7 days agolg...
Unable to push Glue job to GitHub. Empty connections list is now allowed if connection is specified.lg...
Hi,
I am trying to Push the Glue job to GitHub repo. I have got added the access permissions to my role as specified in...
3
answers
0
votes
71
views
asked 7 days agolg...
I am using aws lakeformation workflow to create a data lake following [this](https://docs.aws.amazon.com/lake-formation/latest/dg/getting-started-tutorial-jdbc.html) guide. everything is setup as...
0
answers
0
votes
71
views
asked 7 days agolg...
Trying to connect to redshift database in python shell jobs. Tried with packages like psycopg2, redshift connector and pg. All of them gave similar error, hence I'm assuming problem is to establish...
1
answers
0
votes
120
views
asked 7 days agolg...
I've imported a dataset of JSON objects that all have consistent schema. Glue crawler finished successfully and created a table. I can select * with a limit in Athena, but when I select all rows I get...
0
answers
0
votes
134
views
asked 7 days agolg...