How to auto tune a convolutional network for nvidia gpus

I am cleaning up the code.
I think I can send the code, benchmark and tutorial in two weeks.