Unanswered Questions tagged with Extract Transform & Load Data
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Hello,
I'm writing a custom transform where I want to use mode within pyspark.sql.functions but I get the same issue irrespective of whether I use * or import the specific module. How can I resolve...
0
answers
0
votes
87
views
asked 8 months agolg...
Hello, we have an S3 bucket with various CSV files and an AWS Glue crawler to update the Data Catalog and finally an AWS Glue job to move the data to RedShift. The handling of data and target table is...
0
answers
0
votes
124
views
asked 9 months agolg...
Hello everyone.
Data from the rest api in the form of JSON is loaded daily by lambda into s3-bucket-1.
Then this data should be stored in s3-bucket-2 in the form of a flat parquet table.
I did it in...
0
answers
0
votes
71
views
asked 9 months agolg...
Are there any known, recently (~07/18/2023) introduced performance issues with Glue crawlers?
We have recently observed excessive slowness with Glue crawlers that had been running for months without...
0
answers
0
votes
28
views
asked 9 months agolg...
I read data from s3 using as follow.
```
sec_id_dyf = glueContext.create_dynamic_frame.from_options(
connection_type = 's3',
...
0
answers
0
votes
96
views
asked 9 months agolg...
Hello,
I am experience an issue when trying to use the Glue ETL on one of tables in my data catalogue. I am using the visual tool with a very simple SQL transformation on the table and when clicking...
0
answers
0
votes
89
views
asked 10 months agolg...
Hi everyone, I was trying to ingest csv data to Timestream Db with AWS SDK boto3.M y credential , region , Database_Name, Table_Name are all correct but still I am unable to connect to endpoint of my...
0
answers
0
votes
78
views
asked 10 months agolg...
I am using Data Quality to evaluate the dataset and I am routing the failed rules and failed records. But when i check the s3 bucket for the failed records i am seeing empty files along with the...
0
answers
0
votes
113
views
asked a year agolg...
py4j.protocol.Py4JJavaError: An error occurred while calling o492.pyWriteDynamicFrame.
: org.apache.spark.SparkException: Job aborted due to stage failure: Task 323 in stage 2.0 failed 4 times, most...
0
answers
0
votes
74
views
asked a year agolg...
HI Exports,
I am new in AWS
I have plane now SharePoint data move in Mysql(RDS0 using ETL tool.
please suggest which tool is best.
Thanks
Shanvitha
0
answers
0
votes
33
views
asked a year agolg...
I have 2 streams and I am using analytics Zeppelin notebook to merge the 2 streams but in the future, i will be using more streams I want to know if it's possible to scale everything so it...
0
answers
0
votes
66
views
asked a year agolg...
At present, Glue supports the iceberg framework, but the MERGE INTO syntax is needed here. I set it according to the official website's article, but I can't succeed. There will always be an error of...
0
answers
0
votes
73
views
asked a year agolg...