Unanswered Questions tagged with Extract Transform & Load Data
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I have an iceberg table defined like this:
CREATE TABLE IF NOT EXISTS staging (
id STRING,
staging_timestamp BIGINT,
... blah blah blah ...
)
PARTITIONED BY...
0
answers
0
votes
144
views
asked 19 days agolg...
I have multiple Visual ETL configured correctly, but if go back to the previous screen and then try to see the job again, the display editor will lost the configuration and it will highlight some...
0
answers
0
votes
72
views
asked 2 months agolg...
Scenario:
Source table: Glue Data Catalog table **study** crawled from MySQL with columns:
* id (int),
* code (varchar),
* desc (varchar)
* and 2 other columns not used in the job.
Target table:...
0
answers
0
votes
87
views
asked 4 months agolg...
I'm looking for an open-source solution that can help us make our python API more accessible.
For simplicity's sake, the data is accessed using Athena and has three string fields A, B, C.
Every...
0
answers
0
votes
130
views
asked 5 months agolg...
Hi,
The following CTAS query fails with Col not found error.
```
CREATE table <table_name>
with(
format='PARQUET'
, write_compression='SNAPPY'
, partitioned_by=ARRAY["yearMonth"]
, external_location...
0
answers
0
votes
223
views
asked 5 months agolg...
Hi
I been create Glue Data Connector using its AWS RDS option
and I also create proper IAM role, that have full access to "rds-data", "s3" and "glue"
but whenever I tried to connect (using test...
0
answers
0
votes
113
views
asked 6 months agolg...
Hi,
I am trying to migrate a table from Postgres to Redshift using a migration task
Simplified table structure:
| Name | Type |
| --- | --- |
| id | integer |
| time | timestamp with time zone |
|...
0
answers
0
votes
94
views
asked 6 months agolg...
My Glue 4.0 jobs have suddenly stopped working with error message below. As it is related to boto3, I am unable to make any changes to library config. Pls advise.
NB: I noticed that urllib3 released...
0
answers
0
votes
87
views
asked 6 months agolg...
I was trying to perform Glue ETL transformation and store it in AWS Serverless Redshift database and S3 (both) . However, even the Console generated PySpark sheet fails. Almost none of the methods...
0
answers
0
votes
156
views
asked 6 months agolg...
Hello,
I'm writing a custom transform where I want to use mode within pyspark.sql.functions but I get the same issue irrespective of whether I use * or import the specific module. How can I resolve...
0
answers
0
votes
81
views
asked 7 months agolg...
Hello, we have an S3 bucket with various CSV files and an AWS Glue crawler to update the Data Catalog and finally an AWS Glue job to move the data to RedShift. The handling of data and target table is...
0
answers
0
votes
120
views
asked 7 months agolg...
Hello everyone.
Data from the rest api in the form of JSON is loaded daily by lambda into s3-bucket-1.
Then this data should be stored in s3-bucket-2 in the form of a flat parquet table.
I did it in...
0
answers
0
votes
59
views
asked 8 months agolg...