Relative Content

Tag Archive for cuda

Better way to synchronize threads

I am trying to optimize a CUDA program. In this program, thread i needs to wait for thread i-1 to store data in shared memory before it can proceed. Is there a better synchronization method than __syncthreads()?