AWS Glue Studio Visual Editor Data Preview changing schema data types incorrectly

0

We have a file that we used the default XML crawler to crawl the data for, and it correctly created a table and schema for the data (relevant column shown): Correct schema

However when creating a job in the Visual ETL tool it changes the datatype from bigint to choice in the data preview session, and shows the data as mostly longs, with a couple of int values: Choice data

Attempting to do a change schema node doesn't allow me to change the choice value: No change schema option

Trying to do any data transform (such as flatten) results in an error: Error

Things I've tried to do to correct the issue:

  1. Change the schema type in the table from bigint to int.
  2. Use the resolveChoice function
  3. Restarting data preview sessions
  4. Recreating the job / data catalog source node
  5. Recrawl the data

Also, if i don't use the data preview session, but just run the job, it runs successfully and outputs the data as expected, so it appears to just be a problem with the data preview (however this renders data preview useless).

Is there any way to resolve this issue? We've had it show up not just in this set of data, but multiple sets of data over the last few days, and we hadn't seen it before last friday (Jan 19, 2024).

jeff
asked 3 months ago126 views
No Answers

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions