I did not go through this link, thank you @masahi! However, it seems to be more complex than what we need. We will only be adding a single cpp file which performs optimized depthwise convolution. I see a comment by @comaniac which suggests an easier way. Is it possible to get a link to learn more about this method? Thank you for your replies!!