AWS Glue Studio Visual Editor Data Preview changing schema data types incorrectly

0

We have a file that we used the default XML crawler to crawl the data for, and it correctly created a table and schema for the data (relevant column shown): Correct schema

However when creating a job in the Visual ETL tool it changes the datatype from bigint to choice in the data preview session, and shows the data as mostly longs, with a couple of int values: Choice data

Attempting to do a change schema node doesn't allow me to change the choice value: No change schema option

Trying to do any data transform (such as flatten) results in an error: Error

Things I've tried to do to correct the issue:

  1. Change the schema type in the table from bigint to int.
  2. Use the resolveChoice function
  3. Restarting data preview sessions
  4. Recreating the job / data catalog source node
  5. Recrawl the data

Also, if i don't use the data preview session, but just run the job, it runs successfully and outputs the data as expected, so it appears to just be a problem with the data preview (however this renders data preview useless).

Is there any way to resolve this issue? We've had it show up not just in this set of data, but multiple sets of data over the last few days, and we hadn't seen it before last friday (Jan 19, 2024).

jeff
demandé il y a 4 mois135 vues
Aucune réponse

Vous n'êtes pas connecté. Se connecter pour publier une réponse.

Une bonne réponse répond clairement à la question, contient des commentaires constructifs et encourage le développement professionnel de la personne qui pose la question.

Instructions pour répondre aux questions