Questions tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

1733 results
I am keeping track of some data, based on the date on S3. The files are stored in directories like this: year=yyyy/month=mm/day=dd and inside this directory, there are multiple csv files. I want a...
Accepted AnswerAmazon AthenaAWS Glue
1
answers
0
votes
429
views
asked 4 months ago
I'm trying to run a Python script in aws glue that uses athena.get_query_runtime_statistics when I run i on my local machine the script works, but running at glue returns this error
1
answers
0
votes
315
views
Marcelo
asked 4 months ago
I setup a job using Glue visual, connect to the appropriate table in Oracle, and the data appears to be selected perfectly from the source: ![Data looks...
0
answers
0
votes
200
views
RJ
asked 4 months ago
Hi, I'm getting started at AWS Glue and developing my first ETL. While doing that, I'm testing the FillMissingValues step and getting this error. ![Enter image description...
Accepted AnswerAWS Glue
1
answers
0
votes
198
views
profile picture
Jona
asked 4 months ago
I'd be grateful for a clue on how to craft a connection string for AWS Glue to connect to a SQL Server Always on AG using the Microsoft JDBC Drivers. I'm trying to use this bring your own driver...
2
answers
0
votes
493
views
Bryan
asked 4 months ago
I am trying to add a default to an existing field in an Avro schema in AWS Glue, but the change isn't registering as a new version. Is this behavior expected? If so, why? If not, how can I go about...
2
answers
0
votes
318
views
asked 4 months ago
I have data that I collect from AWS Batch and CloudWatch. I made a lambda function that runs every day that collects those data daily and saves the result to S3. I have a folder called 'logs' and the...
0
answers
0
votes
689
views
asked 4 months ago
We have enabled the lake formation for some POC and we are unable to disable it and get default setting. The problem we have is if I create a db in athena and to create table in same database using...
1
answers
0
votes
505
views
Dasari
asked 4 months ago
Is there any version control integration for Glue Workflow (orchestration) jobs and triggers? We primarily use the visual editor and recently accidently deleted a workflow. We need a way to version...
Accepted AnswerAWS Glue
1
answers
0
votes
156
views
Vince
asked 4 months ago
When creating the temporary table to perform the MERGE in Redshift, I get the error "**String length exceeds DDL length.**" I am using visual ETL. How can I make Redshift use the maximum length for a...
2
answers
0
votes
559
views
asked 4 months ago
Hi, I have been trying to find documentation on how to include a column with the Oracle SCN in the DMS task output ...
3
answers
0
votes
235
views
JT
asked 4 months ago
I'm experimenting with Amazon DataZone and encountered something unexpected. I have a simple setup with one AWS account and one DataZone domain, which includes: 1 Glue Table 1 S3 bucket with my...
1
answers
0
votes
418
views
Vincent
asked 4 months ago