Complete a 3 Question Survey and Earn a re:Post Badge
Help improve AWS Support Official channel in re:Post and share your experience - complete a quick three-question survey to earn a re:Post badge!
Questions tagged with AWS Glue
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Content language: English
Select tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
1938 results
lets say there is no data skew at the time of join or at the time of aggregation, so what will be the case when executor memory may result in out of memory errors considering data is partitioned to 12...
1
answers
0
votes
15
views
asked a day ago
I have glue job with G.1X worker type and in glue I'm using pyspark code to read the parquet file in S3 and write the same in parquet and CSV files in different locations.
I need to know how this work...
1
answers
0
votes
22
views
asked 2 days ago
We have a DynamoDB table which has an attribute named `CustomAttributes` of [data type](https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/HowItWorks.NamingRulesDataTypes.html#HowItWorks...
1
answers
0
votes
21
views
asked 2 days ago
In Glue I tried to connect Mongodb Atlas in Data connection option (where I found default connector component). I provided the all required detail that asked in that connector, then I tried the test ...
2
answers
0
votes
27
views
asked 3 days ago
Accessing Apache Iceberg data.
Limited to 200 rows for preview
2 Source nodes load previews
Each branch has a few transformations (Schema, select, etc)
Each branch's previews load up until a Left Part...
1
answers
0
votes
26
views
asked 4 days ago
I have a glue job which has a pyspark code and here around some 50 million data is being read and write into S3 it was running fine and today I got a error of An error occurred while calling 0351.parq...
1
answers
0
votes
12
views
asked 5 days ago
Hi!
I need to extract a lot of data from big excels. Now Im doing it in my java springboot backend service, extracting the column names and his list of related values and then saving then in the data...
1
answers
0
votes
42
views
asked 5 days ago
I've created a s3 table bucketed named test-s3-table-bucket, and namespace's name is backend_reconciliation, and in Athena, I find the Datasource is AwsDataCatalog, and Catalog is s3tablescatalog/test...
Accepted AnswerAWS Glue
2
answers
0
votes
47
views
asked 6 days ago
I am attempting to use a crawler to create a table from a csv file I have uploaded to csv and it isn't creating it. I have tried to query the csv file using s3 object and it doesn't allow me to do tha...
2
answers
0
votes
38
views
asked 8 days ago
Hello!
We're trying to migrate from a stand-alone Hive Metastore to Glue. We've modified the definition of some EMR clusters (v7.0.0) to use Glue as the metastore, we use Spark on Hadoop to process da...
2
answers
0
votes
39
views
asked 9 days ago
Hi there,
I have a Glue Crawler that is failing with an `Internal Service Exception` and no other error logs.
## Some details:
- Crawler source: JDBC connection to a SQL Server Database with the wil...
1
answers
0
votes
27
views
asked 10 days ago
We have an S3 bucket partitioned as:
S3://ID=123/date=2025-06-03/data.parquet
Where in each date partition there is only one data.parquet file. There are a few hundred different ID's.
For each ID w...
1
answers
0
votes
20
views
asked 12 days ago