Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I set up a replication task with AWS Database Migration Service to implement full load + CDC from a RDS instance to a S3 bucket. Since I want to use Athena to query the data in S3, I set the option...
2
answers
0
votes
236
views
asked 2 months agolg...
I managed to use glue crawler to crawled data (parquet file) from s3, however the column with type "boolean" is recognised as "string" when checking the data schema. Although i can edit the schema on...
1
answers
0
votes
587
views
asked 2 months agolg...
I have a scenario where I need to move data from S3 to a Postgres database running on an EC2 instance. All of this is part of cdk app so I'm looking to add this as a step to the current step function....
1
answers
0
votes
288
views
asked 3 months agolg...
I'm unable to remove null rows in data using aws glue in either ways via visual or script...my data is getting un transformed even after running the schema
Accepted AnswerAWS Glue
1
answers
0
votes
183
views
asked 3 months agolg...
Is there any plan/ETA for bringing Spark 3.5 to AWS Glue. Is there any public roadmap?
2
answers
0
votes
212
views
asked 3 months agolg...
I am trying to use WHL file which contains all the packages and glue connection for etl job, The job is not getting initiliaze and is not populating or creating the logs and job run remains in running...
1
answers
0
votes
161
views
asked 3 months agolg...
AWS Glue DataBrewlg...
Hello All ,
I am trying to clean up my dataset see below ![Dataset](/media/postImages/original/IMlISUqk8QRVCZTKGDM3uwcA).
I want to remove the first row since the name is invalid and want to add it...
1
answers
0
votes
128
views
asked 3 months agolg...
I have created an External Table using Redshift Spectrum and using AWS Glue to crawl deeply nested json files coming into s3 bucket every second.
I was able to populate a redshift table by...
2
answers
0
votes
232
views
asked 3 months agolg...
I'm testing the use of MSK Connect with IcebergSinkConnector, but I'm having some difficulty making it work. If anyone has experience with MSK and can help me, that would be great.
I have a topic in...
2
answers
0
votes
295
views
asked 3 months agolg...
We have problem in determining that if a VPC Endpoint can be use by multiple AWS services. Lets say I have a S3 bucket endpoint and currently AWS Transfer Family is using it, then I want to use AWS...
1
answers
0
votes
918
views
asked 3 months agolg...
I have some json files being loaded to s3 and the events are being queued to sqs, I need to copy the information to redshift, I want to know if there is a way to load it directly from the sqs or if I...
2
answers
0
votes
436
views
asked 3 months agolg...
I have a parallelized python script running in containers which transforms data and writes it to S3 with updates to the Glue catalog. Each container runs several tasks in parallel and the overall data...
2
answers
0
votes
154
views
asked 3 months agolg...