Relative Content

Tag Archive for pytorchpytorch-lightninglightning

How to run pytorch lightning with multiple GPUS?

When using 2 GPUs on a single node, or multiple nodes on multiple nodes the training does not start while the job keeps running. I use a container (Apptainer) to deploy the environment and then submit the script to SLURM. The job starts but then stalls.