Kill yarn jobs that not running automatically

1

I want to know if the option exist in EMR or automated way to kill the spark job that says initial job not accepted any resources. Some of my cluster is transient that run off batch hour without progressing stuck in accepted state.

Thanks in advance

Mark
質問済み 7ヶ月前285ビュー
1回答
3
承認された回答

Hello,

There is no option to kill the application automatically when initial job has not accepted any resources. Because this is not an error on yarn side but the job is waiting for resource allocation and it will run after getting resources to launch executors.

So you may consider to create a script that periodically polls application logs and kill the application if there are logs "Initial job has not accepted any resources. Check your cluster UI to ensure that workers are registered and have sufficient resources" and kill it if the progress keeps 0% for the duration you specified.

Found this external reference that might help to achieve this. You can set to run the script on the primary node configuring bootstrap action when creating the cluster. Hope this answered your query.

AWS
サポートエンジニア
回答済み 7ヶ月前
profile picture
エキスパート
レビュー済み 7ヶ月前
profile pictureAWS
エキスパート
レビュー済み 7ヶ月前
  • Thank you for the detailed information. I shared a feedback for this as feature request. 👏

ログインしていません。 ログイン 回答を投稿する。

優れた回答とは、質問に明確に答え、建設的なフィードバックを提供し、質問者の専門分野におけるスキルの向上を促すものです。

質問に答えるためのガイドライン

関連するコンテンツ