Questions tagged with AWS Glue
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
1735 results
Hallo,
I wanted to add file pattern in AWS Glue ETL job python script where it should generate the files in s3 bucket with pattern dostrp*.csv.gz but could not find way how to provide this file...
I am developing a data pipeline for building a Redshift data warehouse as below:
1. Export DynamoDB data to S3 using 'export to S3' feature
2. In glue, create a spark data-frame on the S3 exported...
I'm trying to build a regular expression for a Grok pattern. The goal is: giving a sequence of digits, I need to assign the first eleven numbers to a named capturing group, and then digits from...
Hi,
I am using s3 for data shuffling in my glue job.
When I ran the notebook, it failed with FileNotFoundException of some objects. However, I can see the objects on the shuffling s3 bucket. Any...
**Overview of Your AWS Architecture:**
- **Data Source:** RDS Aurora MySQL
- **ETL Process:** AWS Glue
- **Data Target:** Amazon Redshift
- **Reporting Tool:** Amazon QuickSight
**My Objective:**
i...
Hi, I would appreciate any help on how I can connect or test the connection to Oracle on-prem either from Glue or other services. I followed the steps from this article...
Hi, I need help connecting to the Oracle on-prem database via JDBC in AWS Glue. I am using a custom driver. Our Oracle version is 19c and I'm using ojdbc10. I've been getting different errors and this...
Hello team,
I have a use case to read data from on-premise table to s3 using mwaa . Once the read completes i need to update a flag in the on-premise table { a typical oracle update records from glue...
I am blocked trying to use the OpenSearch Glue connection to connect to an OpenSearch domain that is hosted on a VPC. I have configured the connection to OpenSearch with a security group that allows...
Currently, I am using the spark hudi deltastreamer job to read CSV and write to the hudi table. Additionally, sync the s3 bucket with the AWS glue catalog. Deltastreamer job creates 2 tables...
Hello,
I am experiencing an issue with configuring AWS Glue to use Bitbucket for version control. Despite providing the necessary permissions and configuring the repository owner, I am unable to list...
Hello,
I am facing this weird issue from AWS Glue. I do have a NAT Gateway in the VPC, which should take care of the network issues. So I am not sure why the networking issue exists/persists. I...