Questions tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

1735 results
I created a custom visual transform component and put the needed json and python files in S3. The component loaded up as expected. Later, I needed to do some more adjustments to the parameters...
2
answers
0
votes
186
views
EdwardR
asked 3 months ago
I have a glue pyspark script that processes DDB data exported to S3 and writes it to Redshift. Initially, it was using below logic: ``` redshiftConnectionOptions = { "postactions": "BEGIN; MERGE...
1
answers
0
votes
232
views
asked 3 months ago
I just can't understand what I'm doing wrong. I have a table. ``` CREATE EXTERNAL TABLE test ( originalrequest string, requeststarted string ) PARTITIONED BY ( req_start_partition...
Accepted AnswerAmazon AthenaAWS Glue
2
answers
0
votes
375
views
profile picture
Smotrov
asked 3 months ago
I have been writing CloudFormation Stack using `yaml` and deploying it to AWS Infrastructure ( For legacy reasons, I can not switch to CDK unfortunately ;)) Following yaml code is a part of the...
2
answers
0
votes
277
views
profile picture
asked 3 months ago
Got this error when trying to insert from temp internal table to external table. ERROR: Invalid DataCatalog response for external table "reportdb"."logs_aggregated": Cannot deserialize Table. Error:...
1
answers
0
votes
234
views
shivank
asked 3 months ago
I have plenty of databases listed in Glue. I am using a policy with limited resource access so that I can only see specific type of databases. These are my policies with respective permission:...
1
answers
0
votes
181
views
asked 3 months ago
Hi, We have a glue job with these details: version 4, worker_type G.4X with 20 number of workers. It runs Python script. When executing, it fails with below error: :...
Accepted AnswerAWS Glue
2
answers
0
votes
122
views
Sharif
asked 3 months ago
I am trying to connect my AWS Glue notebook in Sagemaker Studio to Redshift Serverless, but I keep encountering a connection timeout error. The network mode is: Public internet access. To this mode, I...
1
answers
0
votes
252
views
asked 3 months ago
I'm trying to refresh a materialized view with a glue job, connecting to Redshift cluster using boto3 authenticating with a database username. The execution timeouts with no errors in CloudWatch. I'm...
3
answers
0
votes
251
views
profile picture
asked 3 months ago
Hi I am facing **ERROR : Internal Service Exception** while trying to crawl the S3 bucket folder using the Glue crawler. Carwler Target is the Glue catalog tables. Earlier it worked for one crawler...
1
answers
0
votes
139
views
ravi_tb
asked 3 months ago
I'm confused by AWS documentation regarding compatibility with delta tables. We need to delete a column that is the "column mapping" feature supported in delta-lake 1.2.0 and we do it through spark...
1
answers
0
votes
291
views
Sergii
asked 3 months ago
Hi, I am creating a Glue job to copy files from Source S3 to another target S3. The source S3 and Glue Job are in same AWS account. But the target bucket is different account. 1. I can read the file...
5
answers
0
votes
466
views
Bharath
asked 3 months ago