Why does removing instructions from my SSE intrinsic function make it slower?
Please note that this question is not about YUV422 to RGB conversion!
Why does removing statements from my SSE intrinsic function makes it slower?
Please note that this question is not about YUV422 to RGB conversion!