1 Antwort
- Neueste
- Die meisten Stimmen
- Die meisten Kommentare
0
I managed to fix the issues.
The first one with JSON is because I was creating the table with Spark and it's using a different SerDe by default.
I tried using this one instead and it works:
CREATE EXTERNAL TABLE `test`.`product_created` (`data` STRUCT<`id`: STRING, `type`: STRING>, `meta` STRUCT<`created_at`: BIGINT>)
ROW FORMAT SERDE 'org.openx.data.jsonserde.JsonSerDe' LOCATION 's3a://my_bucket/product_created'
PARTITIONED BY (`year` INT, `month` INT, `day` INT, `hour` INT)
More info: https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-etl-glue-data-catalog-hive.html
For the second error with Parquet files, it's strange, I just recreated the table in Hive with the same query and gave it a different name and it works well now.
beantwortet vor 4 Jahren
Relevanter Inhalt
- AWS OFFICIALAktualisiert vor einem Jahr
- AWS OFFICIALAktualisiert vor einem Jahr
- AWS OFFICIALAktualisiert vor 2 Jahren