Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I'm running an EMR Serverless Spark job that uses Delta OSS to handle Delta tables. I previously resolved a configuration issue with EMR Serverless and AWS Glue Data Catalog...
1
answers
0
votes
276
views
asked a month agolg...
Why doesn't Glue Job and Glue Workflow have the function of version control and alias likes Labmda.lg...
I tried to develop the data orchestlation with s3, Glue Job and Glue Workflow. After I developed it, I found that Glue Job and Glue Workflow doesn't have the function of version control and alias...
0
answers
0
votes
185
views
asked a month agolg...
Hi team, first post, let me know if it provides a good explanation.
I'd like to know a way to minimize the effort for data ingestion.
We have two options as follows:
(1) csv files from a file...
0
answers
0
votes
311
views
asked 2 months agolg...
I get the error "InvalidInputException: Unable to resolve any valid connection" when I test my AWS Glue connection to my mongDB Atlas database.
I can connect with an identical string, user and...
1
answers
0
votes
79
views
asked 2 months agolg...
Kinesis Firehose allows to configure S3 as a destination and in the Parquet section allows selecting a Glue catalog table of format Iceberg. However I had little to no luck querying the data.
Does...
1
answers
0
votes
138
views
asked 2 months agolg...
I am trying to connect AWS glue crawler with a postgres db on RDS. Both the crawler and DB are in the same region. steps followed:
1. created a connection with jdbc url username and...
1
answers
0
votes
137
views
asked 2 months agolg...
Hi all,
I have shared a Glue table (S3) with another account where I can already query it via Athena.
Now I added LakeFormation permissions for the database and table to the role that I am using...
1
answers
0
votes
137
views
asked 2 months agolg...
I want to use AWS Glue Data Catalog as a metastore. I'm running an EMR Serverless job that inserts and updates data in a Delta Table. I've successfully populated Delta tables on my localhost...
2
answers
0
votes
143
views
asked 2 months agolg...
Hi, I'm implementing a case where either one column can be null, but not both in the same record. And implementing rule
(ColumnValues "col_1" = NULL) or (ColumnValues "col_2" = NULL)
I'm seeing below...
1
answers
0
votes
67
views
asked 2 months agolg...
Glue Job - S3 to S3lg...
Hi Team,
I am working on Glue to job to copy/move file from one bucket to another bucket. Could you please help me with your thoughts
1. Using Python how to copy/move the unzipped file to target...
0
answers
0
votes
86
views
asked 2 months agolg...
Following this post: https://repost.aws/knowledge-center/glue-reduce-cloudwatch-logs
I have created the following glue job:
```
from awsglue.context import GlueContext
from pyspark.context import...
1
answers
0
votes
98
views
asked 2 months agolg...
Hi,
I am testing Amazon DataZone features and therefore set up a domain together with another associated account.
I enabled the DataLake blueprint in both accounts. I have 2 projects (producer,...
2
answers
0
votes
130
views
asked 2 months agolg...