What is the best ETL tool for ongoing loads of data into Redshift?

0

What would be the best AWS ETL tool that a customer would use to set up ongoing loads of data into Redshift, that would provide some sort of transform functionality, similar to Microsoft SSIS? E.g. “load data from this file into this table daily as a full replace, compute these columns, etc.”

已提問 7 年前檢視次數 329 次
2 個答案
0
已接受的答案

Depending on the customer's anticipated timeframe, this is precisely what Glue (https://aws.amazon.com/glue/ ) is intended to be really good at.

Alternatively, if your customer can persuade their data source to stream its content (what is their data source, btw?), Kinesis with its capability to trigger Lambda functions for the "TL" piece, may be a good fit - see eg https://aws.amazon.com/blogs/big-data/tag/aws-lambda/ .

AWS
專家
Dave_W
已回答 7 年前
0

Is it possible to run an advanced SQL query on the Glue job? I have at least 15 tables in my SQL and the query is quite advanced itself. Doesn't Glue work only with a small number of tables like e.g. 1-3 with simple conditions? Is there an option to run my own query on it, without building the query by using boxes in Job?

已回答 2 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南