Skipping zero multiplications in CNN inference
I have a pre-trained CNN model on MNIST and each time load the trained weights and biases to run inference. Is there any way to skip zero operations in the conv and fc layers in only inference phase (I dont want to retrain it so it doesnt need backpropagate)?
As the MNIST images are sparse so I should be expecting to have much less execution time when skipping zero operations. The optimality of work is not that much important for me, only I want to see how much execution time differs in different rate of zeros in input.