Unanswered Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Hello,
I'm writing a custom transform where I want to use mode within pyspark.sql.functions but I get the same issue irrespective of whether I use * or import the specific module. How can I resolve...
0
answers
0
votes
92
views
asked 10 months agolg...
Hello All,
I m doing one AWS workshop(Data Engineering Immersion Day), as part of workshop, I have to create sql application(legacy), but now AWS deprecated that feature to create new applications. ...
0
answers
0
votes
121
views
asked 10 months agolg...
I am trying to use the aws dynamodb export to s3 but when I read the data in glue. the value of an entire column is being received as null.
I have tried multiple times doing same thing. And I ran a...
0
answers
0
votes
102
views
asked 10 months agolg...
[CDK] Create a Glue Trigger that triggers Glue Crawler after Glue Job is finished successfully.lg...
I want to build a Glue Trigger that triggers Glue Crawler after Glue Job is finished successfully.
I looked over the cfnTrigger and wrote a code for it.
After CDK DEPLOY and finishing Glue Job...
0
answers
0
votes
112
views
asked 10 months agolg...
Using Athena on an s3 bucket that's been crawled and get the error:
class org.apache.parquet.io.GroupColumnIO cannot be cast to class org.apache.parquet.io.PrimitiveColumnIO
I've narrowed down the...
0
answers
0
votes
83
views
asked 10 months agolg...
Hi Team,
I am trying to archive the mongodb data to S3 as a parquet format, so that i have created spark script for that, When i am execute the spark script getting below error. How to resolve this...
0
answers
0
votes
125
views
asked 10 months agolg...
Hi all,
I've noticed some limitations while using Glue Workflows, that I'd like to suggest or possibly hear if there are alternatives.
1) Suppose you have job C depending on both jobs A and B...
0
answers
0
votes
225
views
asked 10 months agolg...
I have a raw bucket which performs read using glue job and writes to discovery bucket . In this process I’m facing error like not able to process the files present in location raw bucket ( from logs...
0
answers
0
votes
77
views
asked 10 months agolg...
I'm trying to update an existing AWS Glue Crawler for a DocumentDB instance. Given that it won't take a wildcard to add all the collections to the crawler I'm looking for an easy way to add several...
0
answers
0
votes
42
views
asked 10 months agolg...
Hello, we have an S3 bucket with various CSV files and an AWS Glue crawler to update the Data Catalog and finally an AWS Glue job to move the data to RedShift. The handling of data and target table is...
0
answers
0
votes
132
views
asked 10 months agolg...
I've got Athena setup to query a DocumentDB instance with the Lambda function built and AWS Glue configured. The setup was done through the datasource connector for DocumentDB.
I can see the database...
0
answers
0
votes
191
views
asked 10 months agolg...
Hello everyone.
Data from the rest api in the form of JSON is loaded daily by lambda into s3-bucket-1.
Then this data should be stored in s3-bucket-2 in the form of a flat parquet table.
I did it in...
0
answers
0
votes
82
views
asked 10 months agolg...