Questions tagged with AWS Glue
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
1732 results
I have been writing CloudFormation Stack using `yaml` and deploying it to AWS Infrastructure ( For legacy reasons, I can not switch to CDK unfortunately ;))
Following yaml code is a part of the...
Got this error when trying to insert from temp internal table to external table.
ERROR: Invalid DataCatalog response for external table "reportdb"."logs_aggregated": Cannot deserialize Table. Error:...
I have plenty of databases listed in Glue. I am using a policy with limited resource access so that I can only see specific type of databases.
These are my policies with respective permission:...
Hi,
We have a glue job with these details: version 4, worker_type G.4X with 20 number of workers. It runs Python script. When executing, it fails with below error:
:...
I am trying to connect my AWS Glue notebook in Sagemaker Studio to Redshift Serverless, but I keep encountering a connection timeout error.
The network mode is: Public internet access. To this mode, I...
I'm trying to refresh a materialized view with a glue job, connecting to Redshift cluster using boto3 authenticating with a database username. The execution timeouts with no errors in CloudWatch.
I'm...
Hi
I am facing **ERROR : Internal Service Exception** while trying to crawl the S3 bucket folder using the Glue crawler.
Carwler Target is the Glue catalog tables.
Earlier it worked for one crawler...
I'm confused by AWS documentation regarding compatibility with delta tables. We need to delete a column that is the "column mapping" feature supported in delta-lake 1.2.0 and we do it through spark...
Hi,
I am creating a Glue job to copy files from Source S3 to another target S3. The source S3 and Glue Job are in same AWS account. But the target bucket is different account.
1. I can read the file...
I am using Glue Version 4 notebook in Glue Studio. Also tried Script version in the console.
All of them do not recognize Lake Formation hybrid opt-in APIs. Throws below error.
"AttributeError:...
I have been implementing a small ETL job using Pyspark.
**I plan to deploy it to AWS Glue and will use an S3 bucket. to read and write my files instead of local file, once it is ready.**
This ETL...
I am using AWS GLUE ETL job that is fetching data from Mongo DB and putting it to AWS Glue catalog table but the issue is everytime the job runs it is creating the duplicate entries.(If there are 1000...