How to calculate speed of different cores in Nvidia Ampere architecture?
Since there are different number of INT, FLOAT and DOUBLE cores in NVIDIA Ampere architecture, I want to see this difference reflected when I do computation.
How to calculate difference in speed of different cores in Nvidia Ampere architecture?
So I am trying to test the how fast is the execution of kernels for different cores. Here is the script: