How can I directly invoke my existing CUDA code within the TVM framework for an operation?
It seems that I can rewrite the op in the form of relay op like tutorial here. Yet the example in scan.py shows that I probalbly need to rewrite the cuda code into python using te
. Is there a way that I can directly invoke my existing CUDA code ?