Questions tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

1732 results
Hi, I am using s3 for data shuffling in my glue job. When I ran the notebook, it failed with FileNotFoundException of some objects. However, I can see the objects on the shuffling s3 bucket. Any...
2
answers
0
votes
168
views
mykc
asked 2 months ago
**Overview of Your AWS Architecture:** - **Data Source:** RDS Aurora MySQL - **ETL Process:** AWS Glue - **Data Target:** Amazon Redshift - **Reporting Tool:** Amazon QuickSight **My Objective:** i...
1
answers
0
votes
180
views
Mouhcin
asked 2 months ago
Hi, I would appreciate any help on how I can connect or test the connection to Oracle on-prem either from Glue or other services. I followed the steps from this article...
Accepted AnswerAWS Glue
1
answers
0
votes
180
views
JSM
asked 2 months ago
Hi, I need help connecting to the Oracle on-prem database via JDBC in AWS Glue. I am using a custom driver. Our Oracle version is 19c and I'm using ojdbc10. I've been getting different errors and this...
0
answers
0
votes
123
views
JSM
asked 2 months ago
Hello team, I have a use case to read data from on-premise table to s3 using mwaa . Once the read completes i need to update a flag in the on-premise table { a typical oracle update records from glue...
0
answers
0
votes
321
views
asked 2 months ago
I am blocked trying to use the OpenSearch Glue connection to connect to an OpenSearch domain that is hosted on a VPC. I have configured the connection to OpenSearch with a security group that allows...
2
answers
0
votes
259
views
asked 2 months ago
Currently, I am using the spark hudi deltastreamer job to read CSV and write to the hudi table. Additionally, sync the s3 bucket with the AWS glue catalog. Deltastreamer job creates 2 tables...
0
answers
0
votes
366
views
Pavlo
asked 2 months ago
Hello, I am experiencing an issue with configuring AWS Glue to use Bitbucket for version control. Despite providing the necessary permissions and configuring the repository owner, I am unable to list...
Accepted AnswerAWS Glue
1
answers
0
votes
335
views
asked 2 months ago
Hello, I am facing this weird issue from AWS Glue. I do have a NAT Gateway in the VPC, which should take care of the network issues. So I am not sure why the networking issue exists/persists. I...
Accepted AnswerAmazon VPCAWS Glue
1
answers
0
votes
281
views
pbocan
asked 2 months ago
I'm running a Visual ETL job under Glue service. I'm testing that service thru visual editor and I stated thru datasource pointing to some DynamoDB table (before I made a crawler, run it then I aws...
2
answers
0
votes
555
views
cfabres
asked 2 months ago
I have an EventBridge rule that triggers when a new file is added to an S3 bucket with the EventBridge target being a glue workflow. Now I want to pass event data from EventBridge to my glue workflow...
2
answers
0
votes
551
views
Toby
asked 2 months ago
I zipped my modules into zip file, uploaded to s3 and added to Pyspark and Shell jobs under `Python library path ` parameter: ![Enter image description...
1
answers
0
votes
257
views
asked 2 months ago