Getting HIVE_CURSOR_ERROR: Failed to read Parquet file: s3://<my_bucket>/<my_directory>/output.parquet
I have created an external table on Parquet files generated by Pandas & Pyarrow library in Python.
If I execute SELECT count(*) FROM <table> it's giving me correct output
But
If I execute SELECT * FROM <table> It's throwing the above exception
What's going wrong ?
The Athena table structure is below .
CREATE EXTERNAL TABLE mmx_india_parquet_edr_part
(
uid
string COMMENT 'from deserializer',
customerid
string COMMENT 'from deserializer',
productid
string COMMENT 'from deserializer',
edrmode
string COMMENT 'from deserializer',
destinationid
string COMMENT 'from deserializer',
protocolid
string COMMENT 'from deserializer',
host
string COMMENT 'from deserializer',
servicetype
string COMMENT 'from deserializer',
t1
bigint COMMENT 'from deserializer',
t2
bigint COMMENT 'from deserializer',
status
string COMMENT 'from deserializer',
ruleid
string COMMENT 'from deserializer',
supplierid
string COMMENT 'from deserializer',
commandstatus
string COMMENT 'from deserializer',
httpstatus
string COMMENT 'from deserializer',
messageid
string COMMENT 'from deserializer',
fragmented
string COMMENT 'from deserializer',
fragnumber
string COMMENT 'from deserializer',
fragtotal
string COMMENT 'from deserializer',
messagestate
string COMMENT 'from deserializer')
PARTITIONED BY (
traffic_date
date)
STORED AS PARQUET
LOCATION
's3://<my_bucket>/<my_directory>/output.parquet'