Stuck at training cell in Notebook Instance in SageMaker

0

Hello, I have started running a command to train a model using Ultralytics YOLOv8.2.4. Most of the prerequisites should have already been installed. However whenever i run the cell, it will get stuck at the following:

Starting training for 100 epochs...

      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size
  0%|          | 0/143 [00:00<?, ?it/s]

after which it will get stuck there for hours with nothing changing.

There was once i stopped the notebook and tried to restart, and i got the following error message:

IOStream.flush timed out

Does anyone knows what is the issue? My current Jupyter Notebook instance is currently running at ml.t3.medium

Keine Antworten

Du bist nicht angemeldet. Anmelden um eine Antwort zu veröffentlichen.

Eine gute Antwort beantwortet die Frage klar, gibt konstruktives Feedback und fördert die berufliche Weiterentwicklung des Fragenstellers.

Richtlinien für die Beantwortung von Fragen