Use custom C++ code with TVM

See for example how we integrate cublas: