Questions tagged with AWS Glue
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
1734 results
Hi,
I have an RDS postgres db that i want to crawl using aws glue, i already set up the glue crawler job and the glue connection.
But i'm currently getting this error on executing the crawler...
This code is working for some of the assets in datazone but giving error for some assets, Do I need to change code for according to the asset type? or what can I change to fix the error?
Content = ...
I get this Error when run Glue ETL job:
`Error Category: RESOURCE_NOT_FOUND_ERROR; An error occurred while calling o228.pyWriteDynamicFrame....
* **Glue version**: 4.0
* **the Python codes that occurs the error:**
```
df.select([col(c).cast("string") for c in df.columns]).repartition(1).write.mode('overwrite').option('header',...
I have a crawler that I'm trying to have extract headers and data from a CSV file. When I run the crawler and then use Athena to query the table it returns the no data. It seems to only extract the...
Invoking a Glue Workflow from Step Functions got the following error when deploying Cloudformation:
```
Resource handler returned message: "Invalid State Machine Definition:...
I have 4 csvs that have same columns and I am able to crawl them as 1 data table. the issue I am facing is even after adding
areColumnsQuoted = true I am seeing each column value enclosed with double...
I have below code for setting up alarm for AWS glue job using CDK:
`
```
import { aws_cloudwatch as cloudwatch, aws_events as events } from 'aws-cdk-lib';
// jobName. This is our AWS Glue script to...
I want to create a crawler on my RDS database but I cannot create the role needed as it it disabled. The AWS console user I am using has admin level role.
![Enter image description...
I have written an ETL job in AWS Glue using the interactive notebook and I want to enable job bookmark to avoid reprocessing already processed data. The source data are in an S3 bucket, a Glue data...
Hi,
I am using a s3 bucket for data shuffling. The Glue job failed with the following error:
"An error occurred while calling o147.saveAsTable. Job aborted due to stage failure: ResultStage 5...
Hallo,
I wanted to add file pattern in AWS Glue ETL job python script where it should generate the files in s3 bucket with pattern dostrp*.csv.gz but could not find way how to provide this file...