Unable to Parse file Error in AWS Glue

0

We are trying to read a CSV file to process the data using AWS Glue and we are getting an error message as below:

Py4JJavaError: An error occurred while calling o91.schema. : org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 3.0 failed 4 times, most recent failure: Lost task 0.3 in stage 3.0 (TID 8) executor 3): com.amazonaws.services.glue.util.FatalException: Unable to parse file:

We have already verified the codification, it is UTF-8 so there should be no problem. When we read the CSV using from_options function it was able show the data but printSchema() raised an error. We could not read the data using from_catalog function.

Appreciate any help on resolving this

sravan
posta 2 mesi fa302 visualizzazioni
1 Risposta
2
Risposta accettata

Issue could be with special characters in the file. Refer the post - https://repost.aws/questions/QUe2LFXWv6Rh-GhxJajJKcoQ/aws-glue-jobs-3-0-unable-to-parse-csv-file for similar issue and resolution.

AWS
con risposta 2 mesi fa
profile picture
ESPERTO
verificato 2 mesi fa
profile picture
ESPERTO
verificato 2 mesi fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande