Glue Table and Athena: What if some files are compressed and others are not in S3, will it break anything?


I have a glue table specified that refers to a location in S3. I noticed that when Firehose puts data there in parquet format it might be not always compressed.

Will it cause issues for Glue table, crawler or Athena to access the data (both compressed and not compressed)?

asked a year ago377 views
1 Answer
Accepted Answer

parquet files are self descriptive, it's even possible to use different compression inside a file for different columns/pages. Therefore it should work, but since it's not the normal case, it's possible that some tool has a bug and assume a single code is used.

profile pictureAWS
answered a year ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions