Unanswered Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I need to pre-process some data on S3 before the Glue Crawler crawls the data. For this I created an S3 Object Lambda to do the pre-processing. If I test the Object Lambda using the CLI, it provides...
0
answers
1
votes
192
views
asked 2 years agolg...
In the [documentation](https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-python-libraries.html) I can see that we can add additional modules from pip using the following...
0
answers
0
votes
86
views
asked 2 years agolg...
Hello,
Currently I am trying to read csv files in my s3 bucket with the following format:
```
header 1, header 2, header 3
value 1, value 2, value 3
header 1, header 2, header 3, header 4
value 1,...
0
answers
1
votes
141
views
asked 2 years agolg...
Hello,
I am trying to query some files stored in s3. The files are stored in binary Amazon Ion format and have gzip compression.
When attempting to preview my table:
"SELECT * FROM...
0
answers
0
votes
87
views
asked 2 years agolg...
Hello, I am trying to use Glue to take an input file, do my required transformations, then output the columns in a specific order. I also want to output columns that may not be present in the input...
0
answers
0
votes
64
views
asked 2 years agolg...
I created glue catalog external schema on Redshift serverless .Both Glue and redshift are in same location .
----------------------
create external schema dojo
from data catalog
database...
0
answers
0
votes
170
views
asked 2 years agolg...
I was doing a POC using Glue to Migrate data from RDS MySql to RDS Postgres. I have created Connectors to both source and target, and a crawler which connected to source. Then created a job and tried...
0
answers
0
votes
77
views
asked 2 years agolg...
I have a parameterized glue job , that will be called in parallel (25 glue job) through step functions, when bookmark enabled , version mismatch exception is thrown, when disabled, it runs fine.
....
0
answers
0
votes
163
views
asked 2 years agolg...
I have a glue job that write to a Data Catalog. In the Data Catalog I originally set it up as CSV, and all works fine. Now I would like to try to use Parquet for the Data Catalog. I thought I would...
0
answers
0
votes
138
views
asked 2 years agolg...
I have a Glue ETL job which creates partitions during the job
```
additionalOptions = {"enableUpdateCatalog": True, "updateBehavior": "LOG"}
additionalOptions["partitionKeys"] = ["year",...
0
answers
0
votes
118
views
asked 2 years agolg...
Hello.
I'm trying to create an analysis from my DocumentDB instance. I'm using the aws services Glue, Athena and Quicksight.
In Glue I have created a connection to the DocumentDB and a crawler for...
0
answers
0
votes
336
views
asked 2 years agolg...
When developing some Glue scripts from a successful Crawler run from a JDBC Oracle data source, I am encountering an error that I cannot resolve.
```
An error occurred while calling...
0
answers
0
votes
139
views
asked 2 years agolg...