Performance Discrepancy between GPU and CPU for Matrix Multiplication: Eigen vs. ViennaCL
I’m facing a performance issue when performing matrix multiplication operations using the Eigen and ViennaCL libraries in C++. I’m comparing the performance between executing these operations on the integrated GPU of my system and on the CPU.