Questions tagged with AWS Glue
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
1733 results
I am keeping track of some data, based on the date on S3. The files are stored in directories like this: year=yyyy/month=mm/day=dd and inside this directory, there are multiple csv files. I want a...
I'm trying to run a Python script in aws glue that uses athena.get_query_runtime_statistics when I run i on my local machine the script works, but running at glue returns this error
I setup a job using Glue visual, connect to the appropriate table in Oracle, and the data appears to be selected perfectly from the source:
![Data looks...
Hi,
I'm getting started at AWS Glue and developing my first ETL. While doing that, I'm testing the FillMissingValues step and getting this error.
![Enter image description...
I'd be grateful for a clue on how to craft a connection string for AWS Glue to connect to a SQL Server Always on AG using the Microsoft JDBC Drivers. I'm trying to use this bring your own driver...
I am trying to add a default to an existing field in an Avro schema in AWS Glue, but the change isn't registering as a new version.
Is this behavior expected? If so, why? If not, how can I go about...
I have data that I collect from AWS Batch and CloudWatch. I made a lambda function that runs every day that collects those data daily and saves the result to S3. I have a folder called 'logs' and the...
We have enabled the lake formation for some POC and we are unable to disable it and get default setting. The problem we have is if I create a db in athena and to create table in same database using...
Is there any version control integration for Glue Workflow (orchestration) jobs and triggers?
We primarily use the visual editor and recently accidently deleted a workflow. We need a way to version...
When creating the temporary table to perform the MERGE in Redshift, I get the error "**String length exceeds DDL length.**" I am using visual ETL. How can I make Redshift use the maximum length for a...
Hi, I have been trying to find documentation on how to include a column with the Oracle SCN in the DMS task output ...
I'm experimenting with Amazon DataZone and encountered something unexpected. I have a simple setup with one AWS account and one DataZone domain, which includes:
1 Glue Table
1 S3 bucket with my...