how could I let cmake support cuda __half types?
I am trying to write a lib based on CUDA, and I need to use FP16 (the __half
type) ops. A basic code is like this:
I am trying to write a lib based on CUDA, and I need to use FP16 (the __half
type) ops. A basic code is like this: