pytorch matrix multiplication accuracy depends on tensor size
I have the following code where I multiply tensor X
by a matrix C
. Depending on the size of X
and whether C
is attached to the computation graph, I get different results when I compare batched multiplication vs looping over each slice of X
.