Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I've got a fairly simple ETL job that reads several catalog tables or views and does some joins. the job errors out with the following error:
```
Error Category: UNCLASSIFIED_ERROR; An error...
0
answers
0
votes
23
views
asked 17 hours agolg...
I have a AWS Glue workflow which is triggered when a file gets dropped into a s3 bucket thru evenbridge rule . Inside this Glue workflow I have setup a Glue trigger to trigger a ETL job. I have...
1
answers
0
votes
66
views
asked a day agolg...
HIVE_CANNOT_OPEN_SPLIT: Error opening Hive split...
1
answers
0
votes
112
views
asked 2 days agolg...
I am crawling data from S3. The data are stored in CSV form. This is how the directory looks like:
S3 Bucket
- logs
- north_america
- year=2024/
- europe
-...
1
answers
0
votes
50
views
asked 2 days agolg...
We would need to transfer the data from the firehose to parquet format using Glue and the final destination is to store in S3.
Access was denied when assuming role. Please ensure that the role...
1
answers
0
votes
77
views
asked 2 days agolg...
Hi
I am experimenting with a task about the "medaillon pattern".
I have three folder in one S3 bucket:
raw
silver
gold
and two Glue jobs:
- raw_to_silver which copies a couple of files from raw to...
0
answers
0
votes
79
views
asked 2 days agolg...
I keep getting this error, Error Category: UNCLASSIFIED_ERROR; An error occurred while calling o106.getDynamicFrame. Invalid connection string
But the connection string looks fine, looks identical to...
1
answers
0
votes
150
views
asked 5 days agolg...
I have csv files stored in S3. The files are named as followed: {region name}_{today's date}.csv. There are multiple regions. These files are saved under 'log/year/month/date' directory. So this...
1
answers
0
votes
87
views
asked 6 days agolg...
I have Excel sheets with multiple sheets on it stored in S3. Currently, I have separate csv files for each sheet, and crawling from each csv files. Instead of doing this, I would like to crawl from...
1
answers
0
votes
83
views
asked 6 days agolg...
Is there a way for the crawler to generate multiple metadata? I have multiple files that contains region names. I want to generate separate metatables for each regions. Is there a way a crawler can...
1
answers
0
votes
102
views
asked 7 days agolg...
I am keeping track of some data, based on the date on S3. The files are stored in directories like this: year=yyyy/month=mm/day=dd and inside this directory, there are multiple csv files. I want a...
1
answers
0
votes
111
views
asked 7 days agolg...
I'm trying to run a Python script in aws glue that uses athena.get_query_runtime_statistics when I run i on my local machine the script works, but running at glue returns this error
0
answers
0
votes
108
views
asked 9 days agolg...