Questions tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

![ARCHITECTURE OVERVIEW](/media/postImages/original/IMkwpNhJtDTW2rxZ-wu0oG2g) a device that sends sensor data to RDS and then to an S3 bucket via LTE and WiFi. The sensor data is stored in the S3...
0
answers
0
votes
295
views
bud
asked 3 months ago
Hi Team, I am unable to create a Glue Crawler despite several attempts; it is showing the same issue. Below are the policies I attached for my Glue...
1
answers
0
votes
129
views
Priyam
asked 3 months ago
![Screenshot of Error](/media/postImages/original/IMmOli5QYNQ7ODktWoA70CZA) I've been trying to work around this for hours now. I've experimented with all different kinds of permissions but wasn't...
2
answers
0
votes
262
views
asked 3 months ago
I need to create a POC to implement an ETL process with SAS tables from a Glue job without business rules with 7 source tables adding another Job with business rules in each table and after the ETL...
1
answers
0
votes
252
views
profile picture
Edson
asked 3 months ago
Hi all, I'm relatively new to Glue, but I've got a Python ETL script that I've built that works pretty well. It reads two CSV files into dataframes and then unions them together into one normalized...
1
answers
0
votes
190
views
asked 3 months ago
I have an AWS Glue connection pointing at an external Kafka Cluster. I have a table declared within AWS Glue pointing at a topic on my Kafka Cluster. It references the AWS Glue connection. Within...
0
answers
0
votes
104
views
kwall
asked 3 months ago
Hey everyone I am trying to query a set of JSON files in S3 with Athena and I am getting the Hive cursor error for invalid JSON even though the files in question are valid single-line JSON. Is there a...
2
answers
0
votes
316
views
Mike
asked 3 months ago
I've got a fairly simple ETL job that reads several catalog tables or views and does some joins. the job errors out with the following error: ``` Error Category: UNCLASSIFIED_ERROR; An error...
0
answers
0
votes
258
views
asked 3 months ago
I have a AWS Glue workflow which is triggered when a file gets dropped into a s3 bucket thru evenbridge rule . Inside this Glue workflow I have setup a Glue trigger to trigger a ETL job. I have...
1
answers
0
votes
205
views
asked 3 months ago
HIVE_CANNOT_OPEN_SPLIT: Error opening Hive split...
1
answers
0
votes
369
views
MRyan
asked 3 months ago
I am crawling data from S3. The data are stored in CSV form. This is how the directory looks like: S3 Bucket - logs - north_america - year=2024/ - europe -...
1
answers
0
votes
138
views
asked 3 months ago
We would need to transfer the data from the firehose to parquet format using Glue and the final destination is to store in S3. Access was denied when assuming role. Please ensure that the role...
1
answers
0
votes
290
views
asked 3 months ago