SLRUM torch distributed:The client socket cannot be initialized to connect to [localhost]:29500 (errno: 97 – Address family not supported by protocol)
I am new to pytorch-distributed, and any input will help. I have a code working with a single GPU. I am trying to make it distributed. I am getting a socket connect error. Below is the code ( I am avoiding the part of the code that may not be the issue). I suppose it’s a socket error.
SLRUM torch distributed:The client socket cannot be initialized to connect to [localhost]:29500 (errno: 97 – Address family not supported by protocol)
I am new to pytorch-distributed, and any input will help. I have a code working with a single GPU. I am trying to make it distributed. I am getting a socket connect error. Below is the code ( I am avoiding the part of the code that may not be the issue). I suppose it’s a socket error.
SLRUM torch distributed:The client socket cannot be initialized to connect to [localhost]:29500 (errno: 97 – Address family not supported by protocol)
I am new to pytorch-distributed, and any input will help. I have a code working with a single GPU. I am trying to make it distributed. I am getting a socket connect error. Below is the code ( I am avoiding the part of the code that may not be the issue). I suppose it’s a socket error.