Questions tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

1736 results
Steps taken: 1. Select existing ETL Job (let's call it "sample-job"). 2. Clone job. 3. New job created, called "sample-job-copy". 4. Rename job. 5. Hit enter immediately after renaming. Outcome: New...
Accepted AnswerAWS Glue
2
answers
0
votes
73
views
asked a month ago
We are new to Glue env and dealing with our huge cloud-watch bill, we changed log-level in pyspark script from INFO to ERROR. We are using both python logger and spark logger as below in pyspark (Glue...
0
answers
0
votes
39
views
asked a month ago
![My Cost](/media/postImages/original/IMQAIamlJDTlC0OjRCQSZWWg) I'm using Pretier. EC2 is using t2.micro. I understand about the crawling cost of glue. But I don't understand why the cost of vpc...
1
answers
0
votes
47
views
asked a month ago
Is GitLab running on an organization's host(URL is other than https://gitlab.com/) suported by Git integration in Glue Studio? Thank you.
Accepted AnswerAWS Glue
2
answers
0
votes
92
views
asked a month ago
Hello. Is there any Glue API which can create a Glue job with a downloaded JSON code from Glue Studio editor? Or can the downloaded code only be used to upload to Glue Studio editor manually? Thank...
1
answers
0
votes
60
views
asked a month ago
I need to execute two commands into an Aurora MySQL database, which has a connection in glue already in place. The first command is **TRUNCATE TABLE** and the second **LOAD DATA FROM S3** into a...
0
answers
0
votes
22
views
profile picture
asked a month ago
Good morning, When creating a crawler with the wizard, the following message is appearing: One crawler failed to create The following crawler failed to create: "teste-ucs" Here is the most recent...
1
answers
0
votes
38
views
asked a month ago
connection is trying to connect to postgreSQL RDS. Information like endpoint, username, password is well written. It might be timeout, given the spinning when trying to connect. But I don't know the...
2
answers
0
votes
73
views
asked a month ago
Trying to have a Glue Crawler/Athena automatically pick up a timestamp format of `yyyy-MM-dd'T'HH:mm:ss.SSS'Z'`. I have confirmed all the values in my dataset are formatted as this format, example:...
0
answers
0
votes
31
views
asked a month ago
Came across the following useful documentation: https://docs.aws.amazon.com/prescriptive-guidance/latest/apache-iceberg-on-aws/best-practices-read.html#read-sort-order I have a large table where I...
1
answers
0
votes
58
views
Kai
asked a month ago
Hi everyone! I’ve been following this guide (https://aws.amazon.com/blogs/aws/simplify-custom-contact-center-insights-with-amazon-connect-analytics-data-lake/) to gather data from Amazon Connect and...
0
answers
0
votes
28
views
Haider
asked 2 months ago
Invoking a Glue Workflow from Step Functions, from wich i need to pass the Input State Step Function ASL: ``` "StartGlueWorkflow": { "Type": "Task", "Resource":...
0
answers
0
votes
43
views
asked 2 months ago