Questions tagged with AWS Glue
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
![ARCHITECTURE OVERVIEW](/media/postImages/original/IMkwpNhJtDTW2rxZ-wu0oG2g)
a device that sends sensor data to RDS and then to an S3 bucket via LTE and WiFi. The sensor data is stored in the S3...
Hi Team,
I am unable to create a Glue Crawler despite several attempts; it is showing the same issue. Below are the policies I attached for my Glue...
![Screenshot of Error](/media/postImages/original/IMmOli5QYNQ7ODktWoA70CZA)
I've been trying to work around this for hours now. I've experimented with all different kinds of permissions but wasn't...
I need to create a POC to implement an ETL process with SAS tables from a Glue job without business rules with 7 source tables adding another Job with business rules in each table and after the ETL...
Hi all,
I'm relatively new to Glue, but I've got a Python ETL script that I've built that works pretty well. It reads two CSV files into dataframes and then unions them together into one normalized...
I have an AWS Glue connection pointing at an external Kafka Cluster. I have a table declared within AWS Glue pointing at a topic on my Kafka Cluster. It references the AWS Glue connection.
Within...
Hey everyone I am trying to query a set of JSON files in S3 with Athena and I am getting the Hive cursor error for invalid JSON even though the files in question are valid single-line JSON. Is there a...
I've got a fairly simple ETL job that reads several catalog tables or views and does some joins. the job errors out with the following error:
```
Error Category: UNCLASSIFIED_ERROR; An error...
I have a AWS Glue workflow which is triggered when a file gets dropped into a s3 bucket thru evenbridge rule . Inside this Glue workflow I have setup a Glue trigger to trigger a ETL job. I have...
HIVE_CANNOT_OPEN_SPLIT: Error opening Hive split...
I am crawling data from S3. The data are stored in CSV form. This is how the directory looks like:
S3 Bucket
- logs
- north_america
- year=2024/
- europe
-...
We would need to transfer the data from the firehose to parquet format using Glue and the final destination is to store in S3.
Access was denied when assuming role. Please ensure that the role...