Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I have a basic example of 2 CSV files where I'm doing an INNER JOIN by one of the columns in the CSV.
I believe the results in the Data preview are not correct:
* ![Contacts...
1
answers
0
votes
226
views
asked 6 months agolg...
We are running Open search on a few EC2 instances and want to see if there is a way to use this cluster as a target for our AWS Glue jobs.
The cluster can be reached via a URL on port 9200 using...
1
answers
0
votes
117
views
asked 6 months agolg...
Getting started with AWS Glue at my new workplace (I previously used AirFlow). My colleagues are running scheduled jobs via Glue, but they usually put a python script in the Glue editor, then adjust...
2
answers
0
votes
294
views
asked 6 months agolg...
We're trying to set up a cross-account configuration where a glue job in Account A connects and pulls data from a DB in an Aurora MySQL RDS cluster in Account B, using IAM authentication.
We've...
2
answers
0
votes
367
views
asked 6 months agolg...
I am building a data pipeline to Load data into Redshift from an S3 data lake.
Data are stored in Parquet format on S3 and I would like to load them into the respective Redshift tables using an AWS...
1
answers
0
votes
447
views
asked 6 months agolg...
Getting error while trying to access the file stored in S3.
Error retrieving script
[s3.ap-south-1.amazonaws.com] getObject: UriParameterError: Expected uri parameter to have length >= 1, but found ""...
1
answers
0
votes
420
views
asked 6 months agolg...
I have a glue job that is reading a couple of CSV files form S3.
1. I manually choose examples files to infer the schema, that works fine.
2. Then I add a join action but I don't want to infer the...
1
answers
0
votes
225
views
asked 6 months agolg...
Hey,
We have a Glue crawler crawling a series of CSVs in S3 and capturing this in a database. This is surfaced in Redshift via Spectrum Schema.
The problem we have is that in Redshift, the...
1
answers
0
votes
187
views
asked 6 months agolg...
Scenario:
Source table: Glue Data Catalog table **study** crawled from MySQL with columns:
* id (int),
* code (varchar),
* desc (varchar)
* and 2 other columns not used in the job.
Target table:...
0
answers
0
votes
101
views
asked 6 months agolg...
Hey all, I wish to move files from a folder in my S3 bucket to a different folder. I get below error.
```
ClientError: An error occurred (AccessDenied) when calling the CopyObject operation: Access...
1
answers
0
votes
222
views
asked 6 months agolg...
We have a usecase where two appflows that should trigger one glue model,
The glue model takes more than 5 minutes to complete its run but by the time the glue job is running, we have a second job run...
0
answers
0
votes
67
views
asked 6 months agolg...
Hi Folks, I have a postgres RDS connection that I wish to connect with my AWS Glue. The version is PostgreSQL 12.14 on x86_64-pc-linux-gnu. I require to use a custom JDBC driver which I have uploaded...
1
answers
0
votes
351
views
asked 6 months agolg...