Run parallel function on Apple GPU/metal in Python
I have a rather simple function that runs quite fast in parallel using Numba
, and I would like to know if I can run it on my Apple M3 Max GPU. However I have never worked on GPU code before (coming from Macs), so I am a little lost..