Unanswered Questions tagged with Extract Transform & Load Data

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

AWS glue read csv file with two header rows, ignore the first one

Hello, Currently I am trying to read csv files in my s3 bucket with the following format: ``` header 1, header 2, header 3 value 1, value 2, value 3 header 1, header 2, header 3, header 4 value 1,...

AWS Glue Extract Transform & Load Data

answers

votes

133

views

WindHAWS

asked 2 years ago

How To Get Bad Records Using AWS Pydeequ - Data Quality Checks

Using AWS Pydeequ in databricks I am performing Data Quality checks. When I run this below mentioned code it provide only metrics results as my output (like Check_level, check_status, constraint,...

Analytics AWS Data Pipeline AWS Lambda AWS Glue DataBrew Extract Transform & Load Data

answers

votes

162

views

Gowtham Siddarth Jagadeesan

asked 2 years ago

AWS Glue - Map against a template

Hello, I am trying to use Glue to take an input file, do my required transformations, then output the columns in a specific order. I also want to output columns that may not be present in the input...

AWS Glue Extract Transform & Load Data

answers

votes

views

ives

asked 2 years ago

AWS parameterized Glue Concurrent jobs using step functions with enabled bookmarks - throws Version mismatch exception

I have a parameterized glue job , that will be called in parallel (25 glue job) through step functions, when bookmark enabled , version mismatch exception is thrown, when disabled, it runs fine. ....

AWS Step Functions AWS Glue Extract Transform & Load Data

answers

votes

149

views

rePost-User-2022-sim

asked 2 years ago

Generating Parquet files from Glue Data Catalog

I have a glue job that write to a Data Catalog. In the Data Catalog I originally set it up as CSV, and all works fine. Now I would like to try to use Parquet for the Data Catalog. I thought I would...

Database AWS Glue Extract Transform & Load Data

answers

votes

131

views

bfeeny

asked 2 years ago

MSCK REPAIR TABLE behaves differently when executed via Spark Context vs Athena Console/boto3

I have a Glue ETL job which creates partitions during the job ``` additionalOptions = {"enableUpdateCatalog": True, "updateBehavior": "LOG"} additionalOptions["partitionKeys"] = ["year",...

Amazon Athena AWS Glue Extract Transform & Load Data

answers

votes

113

views

bfeeny

asked 2 years ago

Decimal Precision issues when writing from DynamicFrame

When developing some Glue scripts from a successful Crawler run from a JDBC Oracle data source, I am encountering an error that I cannot resolve. ``` An error occurred while calling...

AWS Glue Data Lakes Extract Transform & Load Data

answers

votes

128

views

gearasdan

asked 2 years ago

Data Catalog schema table getting modified when I run my Glue ETL job

I created a Data Catalog with a table that I manually defined. I run my ETL job and all works well. I added partitions to both the table in the Data Catalog, as well as the ETL job. it creates the...

Amazon Athena AWS Glue Extract Transform & Load Data

answers

votes

269

views

bfeeny

asked 2 years ago

How to make changes to qualifications?

Hi! I am doing a longitudinal study and am trying to prevent previous workers from taking the second part of my survey. In order to do this, I have been following instructions on the MTurk blog...

Extract Transform & Load Data Amazon Mechanical Turk

answers

votes

views

AWS-User-7634150

asked 2 years ago

StreamingQueryException: Error while List shards

I have a Kinesis data Stream whose records I want to insert it in the AWS redshift with using AWS Glue.I created crawlers to bring source table and target table .They are working fine with . The code...

Extract Transform & Load Data Amazon Redshift Amazon Kinesis

answers

votes

124

views

AWS-User-8027014

asked 2 years ago

Data transformation not taken into account in AWS Glue

I have a S3 bucket with folders in which we have files. I want to make a database to be able to query these documents on a few keys with an API based on Lambda. But for that I need to normalize the...

AWS Glue Extract Transform & Load Data

answers

votes

views

AWS-alphacharlie

asked 2 years ago

Glue table not showing in console

A crawler reported it created a table but the table is not visible in the Glue console under tables. * I can see the table in Athena and when I query it data is returned as expected * When I use the...

Amazon Athena Database AWS Glue Extract Transform & Load Data

answers

votes

views

sm-1234

asked 2 years ago

1
•••
4
5
6
7
8
12 / page