[RFC][Tensor Core] Optimization of CNNs on Tensor Core

We are pleased to share the codes. Please check the link below,

conv2d_nhwc.py

This is the code that has the same layout as conv2d of Tensor Core.

For any questions, please feel free to let me know.