Athena time travel queries with Apache Hudi tables

0

Hi,

Is there a way to perform time travel with Athena queries on Apache Hudi tables in a way similar to the one described here Implement a cdc based upsert in a data lake using Apache Iceberg and AWS Glue?
Or the only way is through spark.sql library at the moment?
Thanks

질문됨 일 년 전790회 조회
2개 답변
1
수락된 답변

Hello,

I understand that you would want to know if Athena supports time travel queries with Hudi tables. Unfortunately Athena does not currently support time travel queries for Hudi. There is an existing feature request for the same. We may not be able to provide an ETA on when this will be available. I would suggest to keep an eye on our News page/blog for future announcements. https://aws.amazon.com/new/ https://aws.amazon.com/blogs/aws/

Thank you.

AWS
지원 엔지니어
답변함 일 년 전
AWS
전문가
검토됨 9달 전
  • Perfect, thanks for confirming!

0

Are you referring to the MERGE INTO statement that is mentioned in the link? If I understood, your question is - Could the steps mentioned in the Glue job be performed in Athena? Answer is that they cannot be performed directly in Athena as they need a connector, Specific Spark configurations to be set before the spark.sql execution of the MERGE INTO statement.

profile pictureAWS
답변함 일 년 전
  • Hi @ananthtm,

    Thanks for your answer.

    I think I am asking something slightly different. I am not asking if the steps mentioned in the Glue job can be performed in Athena, I am asking if it is possible to write the Hudi version of the following Athena query (which is allowed on Iceberg tables)?

    SELECT * FROM iceberg_table FOR SYSTEM_TIME AS OF TIMESTAMP '2022-11-29 18:14:00'

    Thanks!

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠