Can Glue Spark timezone be changed?

0

We have upstream systems that all use Central US time zone, but our pyspark/sparkSQL jobs in Glue is UTC and current_timestamp() is giving UTC time. Can we direct glue to use a different timezone? We tried adding a configuration to SparkConf: ("spark.sql.session.timeZone", "America/Chicago")

We also tried adding --java-options -Duser.timezone="America/Chicago" from the dashboard Run with Parameters feature.

Neither had the effect of updating spark's timezone. Any help here?

gefragt vor einem Jahr967 Aufrufe
1 Antwort
0

A timestamp doesn't have a timezone, by definition is based on UTC. The timezone you are configuring comes into play when you parse a date or you format that timestamp into a string.

If you do a "show()" on a timestamp column, you should see it in the timezone configured, if not maybe it's not correctly configured, notice that properly spark.sql.session.timeZone has to be set for SparkSession, not context.

profile pictureAWS
EXPERTE
beantwortet vor einem Jahr

Du bist nicht angemeldet. Anmelden um eine Antwort zu veröffentlichen.

Eine gute Antwort beantwortet die Frage klar, gibt konstruktives Feedback und fördert die berufliche Weiterentwicklung des Fragenstellers.

Richtlinien für die Beantwortung von Fragen