Comparing performance among cuda_runtime, cublas and cutensor
I’ve made the following CUDA tests to compare the performance numbers of matrix multiplication, running on Ubuntu 24.04 with the GPU Quadro T1000 Mobile of compute compatibility 7.5.
I’ve made the following CUDA tests to compare the performance numbers of matrix multiplication, running on Ubuntu 24.04 with the GPU Quadro T1000 Mobile of compute compatibility 7.5.