Unable to Parse file Error in AWS Glue

0

We are trying to read a CSV file to process the data using AWS Glue and we are getting an error message as below:

Py4JJavaError: An error occurred while calling o91.schema. : org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 3.0 failed 4 times, most recent failure: Lost task 0.3 in stage 3.0 (TID 8) executor 3): com.amazonaws.services.glue.util.FatalException: Unable to parse file:

We have already verified the codification, it is UTF-8 so there should be no problem. When we read the CSV using from_options function it was able show the data but printSchema() raised an error. We could not read the data using from_catalog function.

Appreciate any help on resolving this

sravan
已提問 2 個月前檢視次數 300 次
1 個回答
2
已接受的答案

Issue could be with special characters in the file. Refer the post - https://repost.aws/questions/QUe2LFXWv6Rh-GhxJajJKcoQ/aws-glue-jobs-3-0-unable-to-parse-csv-file for similar issue and resolution.

AWS
已回答 2 個月前
profile picture
專家
已審閱 2 個月前
profile picture
專家
已審閱 2 個月前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南