Stuck at training cell in Notebook Instance in SageMaker

0

Hello, I have started running a command to train a model using Ultralytics YOLOv8.2.4. Most of the prerequisites should have already been installed. However whenever i run the cell, it will get stuck at the following:

Starting training for 100 epochs...

      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size
  0%|          | 0/143 [00:00<?, ?it/s]

after which it will get stuck there for hours with nothing changing.

There was once i stopped the notebook and tried to restart, and i got the following error message:

IOStream.flush timed out

Does anyone knows what is the issue? My current Jupyter Notebook instance is currently running at ml.t3.medium

No hay respuestas

No has iniciado sesión. Iniciar sesión para publicar una respuesta.

Una buena respuesta responde claramente a la pregunta, proporciona comentarios constructivos y fomenta el crecimiento profesional en la persona que hace la pregunta.

Pautas para responder preguntas