athena query issue with apache hudi

0

recently i got this error in athena Permission denied on S3 path: s3://{S3_BUCKET}/{S3_PREFIX}/.hoodie where the location path that set in glue s3://{S3_BUCKET}/{S3_PREFIX}/{DB_NAME}/{TABLE_NAME}/

what i want to ask is why athena searching ".hoodie" directory under {S3_PREFIX}, where it should be inside the {TABLE_NAME} directory? checked in the {S3_PREFIX} directory there no file/directory name ".hoodie" after the {S3_PREFIX} and also check in emr-spark, with emr-spark i able to retrieve the data without getting permission denied issue.

noel
已提问 1 年前449 查看次数
1 回答
0

Athena Uses Presto for DML queries and to query Hudi tables metadata for the hudi table is required which is store under ".hoodie/" folder inside the table location. So suppose if the hudi table is built upon S3 location "s3://mubucket/hudi/" then after creating a hudi table in this location a ".hoodie/" folder will be created.

Here in you case if the permission denied error is coming up for a different location other than table location if I understood it correct then, This can be due miss-configurations of the table. Hence can you try recreating another small sample hudi table and then run Athena queries over the same. Also check whether you have proper permissions over the bucket or not when running Athena queries.

AWS
支持工程师
Shubh
已回答 1 年前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则