Custom vector class with CUDA backend
I’ve been learning CUDA on my own, and one project I was trying to do is implementing a linear algebra library. I created a mathematical vector class in C++ that computes the dot product via a GPU. I’m running into issues when it comes to freeing memory on the device via my class destructor.