Glue job keeps running but does not write results

0

Running a glue job to fetch records from Microsoft sql server but glue jobs keeps running and does not show any results. Job is scheduled with G.2X worker with 5 works with auto scheduling.

Logs:- 23/02/27 09:02:45 WARN TaskSchedulerImpl: Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient resources

2 Risposte
2

Can check the worker nodes to ensure that they have enough resources to run the job. The warning message in the logs suggests that the worker nodes may not have enough resources to execute the job.

con risposta un anno fa
0

The driver logs should tell you bit what the driver is doing, the warning could not be the cause.
It would be good to enable SparkUI logs and view then in a History Server to check what the driver is doing, has the driver started a Spark job/stage and doesn't get resources?

The most likely cause is that you have multiple stages in the job and the first stage is reading the data with a single task, SparkUI will tell you that. Also check this: https://docs.aws.amazon.com/glue/latest/dg/run-jdbc-parallel-read-job.html

profile pictureAWS
ESPERTO
con risposta un anno fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande